archive https://archive.ph/is57b
Why is that a criticism? This is how it works for humans too: we study, we learn the stuff, and then try to recall it during tests. We’ve been trained on the data too, for neither a human nor an ai would be able to do well on the test without learning it first.
This is part of what makes ai so “scary” that it can basically know so much.
Dont anthropomorphise. There is quite the difference between a human and an advanced lookuptable.
Well… I do agree with you but human brains are basically big prediction engines that use lookup tables, experience, to navigate around life. Obviously a super simplification, and LLMs are nowhere near humans, but it is quite a step in the direction.
@phoenixz @Soyweiser “Let’s redefine what it means to be human, so we can say the LLM is human” have you bumped your head?
I absolutely agree. However, if you think the LLMs are just fancy LUTs, then I strongly disagree. Unless, of course, we are also just fancy LUTs.
You ever meet an ai researcher with a background in biology? I’ve discussed this stuff with one. She disagrees with Turing about machines thinking including when ai is in the picture. They process information very differently from how biology does
I guess it comes down to a philosophical question as to what “know” actually means.
But from my perspective is that it certainly knows some things. It knows how to determine what I’m asking, and it clearly knows how to formulate a response by stitching together information. Is it perfect? No. But neither are humans, we mistakenly believe we know things all the time, and miscommunications are quite common.
But this is why I asked the follow up question…what’s the effective difference? Don’t get me wrong, they clearly have a lot of flaws right now. But my 8 year old had a lot of flaws too, and I assume both will get better with age.
Because a machine that “forgets” stuff it reads seems rather useless… considering it was a multiple choice style exam and, as a machine, Chat GPT had the book entirely memorized, it should have scored perfect almost all the time.