OpenAI hard work got stolen...(ponder.cat)

posted 14 days ago

Cat@ponder.cat

microblogmemes@lemmy.world

49 commentshide report

Mastodon.

Sort:

Hot Top Controversial New Old

You are viewing a single thread.

View all comments View context

[ - ]

brucethemoose@lemmy.world

2 points

14 days ago

Running the model can be no more taxing than playing a modern video game, except the load is not constant.

This is not true, Deepseek R1 is huge. There’s a lot of confusion between the smaller distillations based on Qwen 2.5 (some that can run on consumer GPUs), and the “full” Deepseek R1 based on Deepseekv3

Your point mostly stands, but the “full” model is hundreds of gigabytes, and the paper mentioned something like a bank of 370 GPUs being optimal for hosting. It’s very efficient because its only like 30B active, which is bonkers, but still.

permalink

report

parent

Microblog Memes

!microblogmemes@lemmy.world

Create post

A place to share screenshots of Microblog posts, whether from Mastodon, tumblr, ~~Twitter~~ X, KBin, Threads or elsewhere.

Created as an evolution of White People Twitter and other tweet-capture subreddits.

Rules:

Please put at least one word relevant to the post in the post title.
Be nice.
No advertising, brand promotion or guerilla marketing.
Posters are encouraged to link to the toot or tweet etc in the description of posts.

Related communities:

Community stats

13K
Monthly active users
2.1K
Posts
92K
Comments

Community stats

Community moderators