Could it be because a statistical relation isn’t the same as a semantic one? No, I must be prompting it wrong. I’ll just add “engineer” to my title and then everyone will take me seriously.
The problem is not the LLMs, but what people are trying to do with them.
They are currently spoons, but people are desperately wishing they were katanas.
They work really well for soup, but they can’t cut steak. But they’re being hyped as super ninja steak knives, and people are getting pissed when they can’t cut steak.
If you give them watery, soupy tasks they can do successfully, they can lighten your workload, as long as you’re aware of what they are and aren’t good at.
What people want LLMs to be able to do, ie. “Steak” tasks:
-
write complex documents
-
apply complex knowledge/rules to a situation
-
Write complex code and create entire programs based on vague description
What LLMs can currently do ie. “Soup” tasks:
-
check this document and fix all spelling, punctuation and grammatical errors
-
summarise this paragraph as dot points
-
write a python program that sorts my photographs into folders based on the year they were taken
Half of Lemmy is hyping katanas, the other half is yelling “Why won’t my spoon cut this steak?!! AI is so dumb!!!”
Update: wow, the pure vitriol pouring out of the replies is just stunning. Seems there are a lot of you out there who have, in one way or another, tied your ego very strongly to either the success or failure of AI.
Take a step back, friends, and go outside for a while.
“spoons and katanas” has got to be the most baby brained analogy. are you a child
Food analogy
This level of discourse wouldn’t fly on 4chan, how is it so popular with LLM fans?
needs to be a car analogy
- What people want LLMs to do, i.e. Corvette tasks
- What LLMs actually do, i.e. Trabant tasks
What LLMs can currently do summarise this paragraph as dot points
The entire point here is that they can’t?
Clearly this post is about LLMs not succeeding at this task, but anecdotally I’ve seen it work OK and also fail. Just like humans, which is the benchmark but they are faster.
good god this entire post is the most tortured believer whataboutism I’ve encountered this month and there’s extremely strong competition here
are currently spoons, but people are desperately wishing they were katanas
ie. “Steak” tasks
you should make a youtube channel, The Katana Steak-Eater
. I’d watch the shit out of that at least one saturday afternoon
I’d offer congratulations on obfuscating a bad claim with a poor analogy, but you didn’t even do that very well.
they don’t do any of that soup shit reliably either and reading the article might have told you that
Is it only me, or is the linked article not super long on details & is reaching a conclusion from 2 examples? This is important & I need to hear more, & I’m generally biased against AI at this point— but the article isn’t doing enough to convince me
did you click through to any of the inline citations? David’s shorter articles on pivot mostly gather and summarize those, so if you need to read the original research and its conclusions that’s where to go
Dang everyone here needs to look at a tree or a cat or something. Energy is wack in here
how the hell did this of all the posts turn into a promptfondler shooting gallery