You are viewing a single thread.
View all comments View context
-2 points

OpenAI must evolve into serving something other than generative AI.

The compute bills for OpenAI are crazy. They would need more paying customers to try and at least keep the service somewhat viable.

https://futurism.com/the-byte/chatgpt-costs-openai-every-day

permalink
report
parent
reply
2 points

Cost reduction in the field is orders of magnitude potential. Look at llama running on everything down to a raspy pi after 2 months.

There are massive gains to be made - once we have dedicated hardware for transformers, that’s orders of magnitude more.

See your phone being able to playback 24h of video but die after 3h of browsing? Dedicated hardware codec support

permalink
report
parent
reply
-3 points

Yeah but Llama’s quality cannot compete with ChatGPT models (Doesn’t matter what model you use, if you want good and FAST results, you require serious compute). We do have commercial dedicated AI chips from NVDA, last time I checked you had to make an order to even get a price. George Hotz who is also working on something similar, by his account from a Lex Fridman podcast mentioned that a personal AI rig would have to be closer to a mainframe’s size.

There’s nothing I have seen so far that leads me to believe that generative AI gets more efficient with weaker hardware.

permalink
report
parent
reply
3 points
*

The trajectory is such that current L2 70B models are easily beating 3.5 and are approaching GPT4 performance - an A6000 can run them comfortably and this is a few months only after release.

Nah the trajectory is not in favor of proprietary, especially since they will have to dumb down due to alignment more and more

https://www.anyscale.com/blog/llama-2-is-about-as-factually-accurate-as-gpt-4-for-summaries-and-is-30x-cheaper?trk=feed_main-feed-card_feed-article-content

permalink
report
parent
reply

Technology

!technology@lemmy.ml

Create post

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.


Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.


Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

Community stats

  • 3.6K

    Monthly active users

  • 2.6K

    Posts

  • 41K

    Comments

Community moderators