You are viewing a single thread.
View all comments View context
6 points

The layman’s explanation of how an LLM works is it tries to predict the most likely word, or sequence of words, that follow from the last. This is based all on the input training set, which is compiled into a big bucket of probabilities. All text input influences those internal probabilities which in turn generates likely output. This is also why these things are error-prone because it’s really just hyper-sophisticated predictive text, and is doing its best to “play the odds.”

You can also view an LLM as one fiendishly massive if/else statement that chews on text tokens. There’s also some random seeding thrown in for more variation in output, but these things are 100% repeatable if you use the same seed every time; it’s just compiled logic.

permalink
report
parent
reply
3 points

Hehe best illustration. “big bucket of probabilities” …hell yeah

permalink
report
parent
reply
3 points

Yup. I had this in my head at the time:

permalink
report
parent
reply

memes

!memes@lemmy.world

Create post

Community rules

1. Be civil

No trolling, bigotry or other insulting / annoying behaviour

2. No politics

This is non-politics community. For political memes please go to !politicalmemes@lemmy.world

3. No recent reposts

Check for reposts when posting a meme, you can only repost after 1 month

4. No bots

No bots without the express approval of the mods or the admins

5. No Spam/Ads

No advertisements or spam. This is an instance rule and the only way to live.

Sister communities

Community stats

  • 13K

    Monthly active users

  • 2.3K

    Posts

  • 47K

    Comments