ThoughtGoblin
Oh, okay, so I’m not crazy!
Saw this scrolling down /c/all and immediately noticed something was off with the tiny leg on the left. The only obviously weird thing (to me) was the planters on the left have the suspending wires attached to the leaves. I still wasn’t sure if it was AI generated.
In a few years these are going to be absolutely indistinguishable. What a time to be alive!
Not really, though it’s hard to know what exactly is or is not encoded in the network. It likely has more salient and highly referenced content, since those aspects would come up in it’s training set more often. But entire works is basically impossible just because of the sheer ratio between the size of the training data and the size of the resulting model. Not to mention that GPT’s mode of operation mostly discourages long-form wrote memorization. It’s a statistical model, after all, and the enemy of “objective” state.
Furthermore, GPT isn’t coherent enough for long-form content. With it’s small context window, it just has trouble remembering big things like books. And since it doesn’t have access to any “senses” but text broken into words, concepts like pages or “how many” give it issues.
None of the leaked prompts really mention “don’t reveal copyrighted information” either, so it seems the creators really aren’t concerned — which you think they would be if it did have this tendency. It’s more likely to make up entire pieces of content from the summaries it does remember.