OpenAI confirms that AI writing detectors don’t work

Typically for generative AI. I think during their training of the Nobel, they must have developed another model that detect if GPT produce a more natural language. I think that other model may reached the point where it couldn’t flag it with acceptable false positive.

permalink

report

parent

reply

[ - ]

GlendatheGayWitch@lib.lgbt

2 points

1 year ago

Couldn’t you just ask ChapGPT whether it wrote something specific?

permalink

report

reply

[ - ]

Kazumara@feddit.de

0 points

1 year ago

Obviously not. Its a language generator with a bit of chat modeling and reinforcement learning, not an Artificial General Intelligence.

It doesn’t know anything, it doesn’t retain memory long term, it doesn’t have any self identity. There is no way it could ever truthfully respond “I know that I wrote that”.

permalink

report

parent

reply

[ - ]

mwguy@infosec.pub

7 points

1 year ago

No. The model doesn’t have a record of everything it wrote.

permalink

report

parent

reply

[ - ]

4AV@lemmy.world

21 points

1 year ago

It doesn’t have “memory” of what it has generated previously, other than the current conversation. The answer you get from it won’t be much better than random guessing.

permalink

report

parent

reply

[ - ]

randint@lemmy.frozeninferno.xyz

-2 points

1 year ago

Deleted by creator

permalink

report

parent

reply

[ - ]

BetaDoggo_@lemmy.world

1 point

1 year ago

The model is only trained to handle 4k tokens, roughly 2000 words depending on complexity. Even if it had a log of everything asked it wouldn’t be able to use any of it.

permalink

report

parent

reply

[ - ]

sep@lemmy.world

14 points

1 year ago

Ignoring the huge privacy/liabillity issue… there are other llm’s then chatgpt.

permalink

report

parent

reply

[ - ]

zikk_transport2@lemmy.dbzer0.com

4 points

1 year ago

*

Deleted by creator

permalink

report

parent

reply

[ - ]

vale@sh.itjust.works

29 points

1 year ago

Then you have that time that a professor tried to fail his whole class because he asked chatGPT if it wrote the essays.

https://wgntv.com/news/professor-attempts-to-fail-students-after-falsely-accusing-them-of-using-chatgpt-to-cheat/

permalink

report

parent

reply

[ - ]

wedeworps@sh.itjust.works

1 point

1 year ago

Could you please provide a brief overview? This article is not available in my country/region.

permalink

report

parent

reply

[ - ]

T156@lemmy.world

2 points

1 year ago

*

It cites this article, which might work for you.

report

reply

[ - ]

8 points

1 year ago

That doesn’t really work because it just says whatever half the time. It’s very good at making stuff up. It doesn’t really get that it needs to tell the truth because all it’s doing is optimising for a good narrative.

That’s why it says slavery is good, because the only people asking that question clearly have an answer in mind, and it’s optimising for that answer.

Also it doesn’t have access to other people’s sessions (because that would be hella dodgy) so it can’t tell you definitively if it did or did not say something in another session, even if it were inclined to tell the truth.

permalink

report

parent

reply

[ - ]

nucleative@lemmy.world

0 points

1 year ago

*

We need to embrace AI written content fully. Language is just a protocol for communication. If AI can flesh out the “packets” for us nicely in a way that fits what the receiving humans need to understand the communication then that’s a major win. Now I can ask AI to write me a nice letter and prompt it with a short bulleted list of what I want to say. Boom! Done, and time is saved.

The professional writers who used to slave over a blank Word document are now obsolete, just like the slide rule “computers” of old (the people who could solve complicated mathematics and engineering problems on paper).

Teachers who thought a hand written report could be used to prove that “education” has happened are now realizing that the idea was a crutch (it was 25 years ago too when we could copy/paste Microsoft Encarta articles and use as our research papers).

The technology really just shows us that our language capabilities really are just a means to an end. If a better means asrises we should figure out how to maximize it.

permalink

report

reply

[ - ]

ram@lemmy.ca

5 points

1 year ago

Huh?

permalink

report

parent

reply

[ - ]

cheesorist@lemmy.world

72 points

1 year ago

they never did, they never will.

permalink

report

reply

[ - ]

stevedidWHAT@lemmy.world

6 points

1 year ago

Why tho or are you trying to be vague on purpose

permalink

report

parent

reply

[ - ]

sebi@lemmy.world

-1 points

1 year ago

Because generative Neural Networks always have some random noise. Read more about it here

permalink

report

parent

reply

[ - ]

stevedidWHAT@lemmy.world

3 points

1 year ago

Isn’t that article about GANs?

Isn’t GPT not a GAN?

report

reply

[ - ]

72 points

1 year ago

Because you’re training a detector on something that is designed to emulate regular languages closest possible, and human speech has so much incredible variability that it’s almost impossible to identify if someone or something has been written by an AI.

You can detect maybe your typical generic chat GPT type outputs, but you can characterize a conversation with chat GPT or any of the other much better local models (privacy and control are aspects which make them better) and after doing that you can get radically human seeming outputs that are totally different from anything chat GPT will output.

In short, given a static block of text it’s going to be nearly impossible to detect if it’s coming from an AI. It’s just too difficult to problem, and if you’re going to solve it it’s going to be immediately obsolete the next time someone fine tunes their own model

permalink

report

parent

reply

[ - ]

stevedidWHAT@lemmy.world

6 points

1 year ago

Yeah this makes a lot of sense considering the vastness of language and it’s imperfections (English I’m mostly looking at you, ya inbred fuck)

Are there any other detection techniques that you know of? Wb forcing AI models to have a signature that is guaranteed to be indentifiable, permanent, and unique for each tuning produced? It’d have to be not directly noticeable but easy to calculate in order to prevent any “distractions” for the users.

permalink

report

parent

reply

Show more comments

[ - ]

Eufalconimorph@discuss.tchncs.de

22 points

1 year ago

Because AIs are (partly) trained by making AI detectors. If an AI can be distinguished from a natural intelligence, it’s not good enough at emulating intelligence. If an AI detector can reliably distinguish AI from humans, the AI companies will use that detector to train their next AI.

permalink

report

parent

reply

[ - ]

stevedidWHAT@lemmy.world

-2 points

1 year ago

I’m not sure I’m following your argument here - you keep switching between talking about AI and AI detectors. Each of the below are just numbered according to the order of your prior responses as sentences:

Can you provide any articles or blog posts from AI companies for this or point me in the right direction?
Agreed
Right…

I’m having trouble finding your support for your claim

permalink

report

parent

reply

Show more comments

[ - ]

Boddhisatva@lemmy.world

28 points

1 year ago

OpenAI discontinued its AI Classifier, which was an experimental tool designed to detect AI-written text. It had an abysmal 26 percent accuracy rate.

If you ask this thing whether or not some given text is AI generated, and it is only right 26% of the time, then I can think of a real quick way to make it 74% accurate.

permalink

report

reply

[ - ]

Leate_Wonceslace@lemmy.dbzer0.com

14 points

1 year ago

I feel like this must stem from a misunderstanding of what 26% accuracy means, but for the life of me, I can’t figure out what it would be.

permalink

report

parent

reply

[ - ]

schzztl@lemmy.nz

1 point

1 year ago

Specificity vs sensitivity, no?

permalink

report

parent

reply

[ - ]

cmfhsu@lemmy.world

2 points

1 year ago

*

In statistics, everything is based off probability / likelihood - even binary yes or no decisions. For example, you might say “this predictive algorithm must be at least 95% statistically confident of an answer, else you default to unknown or another safe answer”.

What this likely means is only 26% of the answers were confident enough to say “yes” (because falsely accusing somebody of cheating is much worse than giving the benefit of the doubt) and were correct.

There is likely a large portion of answers which could have been predicted correctly if the company was willing to chance more false positives (potentially getting studings mistakenly expelled).

permalink

report

parent

reply

[ - ]

dartos@reddthat.com

10 points

1 year ago

*

Looks like they got that number from this quote from another arstechnica article ”…OpenAI admitted that its AI Classifier was not “fully reliable,” correctly identifying only 26 percent of AI-written text as “likely AI-written” and incorrectly labeling human-written works 9 percent of the time”

Seems like it mostly wasn’t confident enough to make a judgement, but 26% it correctly detected ai text and 9% incorrectly identified human text as ai text. It doesn’t tell us how often it labeled AI text as human text or how often it was just unsure.

EDIT: this article https://arstechnica.com/information-technology/2023/07/openai-discontinues-its-ai-writing-detector-due-to-low-rate-of-accuracy/

permalink

report

parent

reply

[ - ]

notatoad@lemmy.world

4 points

1 year ago

it seemed like a really weird decision for OpenAI to have an AI classifier in the first place. their whole business is to generate output that’s good enough that it can’t be distinguished from what a human might produce, and then they went and made a tool to try and point out where they failed.

permalink

report

parent

reply

[ - ]

Boddhisatva@lemmy.world

2 points

1 year ago

That may have been the goal. Look how good our AI is, even we can’t tell if its output is human generated or not.

permalink

report

parent