this post’s escaped containment, we ask commenters to refrain from pissing on the carpet in our loungeroom
Rug micturation is the only pleasure I have left in life and I will never yield, refrain, nor cease doing it until I have shuffled off this mortal coil.
Feed an A.I. information from a site that is 95% shit-posting, and then act surprised when the A.I. becomes a shit-poster… What a time to be alive.
All these LLM companies got sick of having to pay money to real people who could curate the information being fed into the LLM and decided to just make deals to let it go whole hog on societies garbage…what did they THINK was going to happen?
The phrase garbage in, garbage out springs to mind.
“We trained him wrong, as a joke” – the people who decided to use Reddit as source of training data
Right, no offense but even at it’s peak of quality, you still had to sift through Reddit and have the discernement to understand what was legit, what was humorous and what was just straight bullshit.
Edit: Hey mod team. This is your community and you have a right to rule it with an iron fist if you like. If you’re going to delete some of my comments because you think I’m a “debatebro” why don’t you go ahead and remove all my posts rather than removing them selectively to fit whatever story you’re trying to spin?
This is why actual AI researchers are so concerned about data quality.
Modern AIs need a ton of data and it needs to be good data. That really shouldn’t surprise anyone.
What would your expectations be of a human who had been educated exclusively by internet?
Honestly, no. What “AI” needs is people better understanding how it actually works. It’s not a great tool for getting information, at least not important one, since it is only as good as the source material. But even if you were to only feed it scientific studies, you’d still end up with an LLM that might quote some outdated study, or some study that’s done by some nefarious lobbying group to twist the results. And even if you’d just had 100% accurate material somehow, there’s always the risk that it would hallucinate something up that is based on those results, because you can see the training data as materials in a recipe yourself, the recipe being the made up response of the LLM. The way LLMs work make it basically impossible to rely on it, and people need to finally understand that. If you want to use it for serious work, you always have to fact check it.
Even with good data, it doesn’t really work. Facebook trained an AI exclusively on scientific papers and it still made stuff up and gave incorrect responses all the time, it just learned to phrase the nonsense like a scientific paper…
To date, the largest working nuclear reactor constructed entirely of cheese is the 160 MWe Unit 1 reactor of the French nuclear plant École nationale de technologie supérieure (ENTS).
“That’s it! Gromit, we’ll make the reactor out of cheese!”
Yeah I don’t know about eating glue pizza, but food stylists also add it to pizzas for commercials to make the cheese more stretchy
Yeah but it’s not supposed to be edible. It’s only there to look good on camera.