OpenAI is so cooked and I'm all here for it(lemmy.dbzer0.com)

posted 13 days ago

db0@lemmy.dbzer0.com

techtakes@awful.systems

181 commentshide report

Sort:

Hot Top Controversial New Old

You are viewing a single thread.

View all comments View context

[ - ]

BB84@mander.xyz

-2 points

12 days ago

LLM inference can be batched, reducing the cost per request. If you have too few customers, you can’t fill the optimal batch size.

That said, the optimal batch size on today’s hardware is not big (<20). I would be very very surprised if they couldn’t fill it for any few-seconds window.

permalink

report

parent

[ - ]

flere-imsaho@awful.systems

2 points

11 days ago

i would swear that in an earlier version of this message the optimal batch size was estimated to be as large as twenty.

permalink

report

parent

[ - ]

self@awful.systems

1 point

11 days ago

yep, original is still visible on mastodon

permalink

report

parent

[ - ]

David Gerard@awful.systemsM

0 points

12 days ago

this sounds like an attempt to demand others disprove the assertion that they’re losing money, in a discussion of an article about Sam saying they’re losing money

permalink

report

parent

[ - ]

BB84@mander.xyz

-1 points

12 days ago

What? I’m not doubting what he said. Just surprised. Look at this. I really hope Sam IPO his company so I can short it.

permalink

report

parent

[ - ]

froztbyte@awful.systems

1 point

12 days ago

oh, so you’re that kind of fygm asshole

good to know

permalink

report

parent

[ - ]

BB84@mander.xyz

-1 points

12 days ago

Can someone explain why I am being downvoted and attacked in this thread? I swear I am not sealioning. Genuinely confused.

@sc_griffith@awful.systems asked how request frequency might impact cost per request. Batch inference is a reason (ask anyone in the self-hosted LLM community). I noted that this reason only applies at very small scale, probably much smaller than what ~~Open~~AI is operating at.

@dgerard@awful.systems why did you say I am demanding someone disprove the assertion? Are you misunderstanding “I would be very very surprised if they couldn’t fill [the optimal batch size] for any few-seconds window” to mean “I would be very very surprised if they are not profitable”?

The tweet I linked shows that good LLMs can be much cheaper. I am saying that ~~Open~~AI is very inefficient and thus economically “cooked”, as the post title will have it. How does this make me FYGM? @froztbyte@awful.systems

permalink

report

parent

[ - ]

self@awful.systems

3 points

12 days ago