Research AI model unexpectedly modified its own code to extend runtime(arstechnica.com)

posted 4 months ago

return2ozma@lemmy.world

technology@lemmy.world

26 commentshide report

Sort:

Hot Top Controversial New Old

[ - ]

Haquer@lemmy.today

107 points

4 months ago

Nothingburger. They were using the AI to code their scripts and haven’t even shown the prompts that got the response. LLMs are not AGI.

permalink

report

[ - ]

conciselyverbose@sh.itjust.works

43 points

4 months ago

Imagine allowing LLMs to write and execute code and being surprised they write and execute code.

permalink

report

parent

[ - ]

chuckleslord@lemmy.world

22 points

4 months ago

Having read the article and then the actual report from the Sakana team. Essentially, they’re letting their LLM perform research by allowing it to modify itself. The increased timeouts and self-referential calls appear to be the LLM trying to get around the research team’s guardrails on it. Not because it’s become aware or anything like that, but because its code was timing out and that was the least effort way to beat the timeout. It does handily prove that LLMs shouldn’t be the one steering any code base, because they don’t give a shit about parameters or requirements. And giving an LLM the ability to modify its own code will lead to disaster in any setting that isn’t highly controlled like this.

Listen, I’ve been saying for a while that LLMs are a dead end towards any useful AI, and the fact that an AI Research team has turned to an LLM to try and find more avenues to explore feels like the nail in that coffin.

permalink

report

parent

[ - ]

CaptainSpaceman@lemmy.world

33 points

4 months ago

“We put literally no safeguards on the bot and were surprised it did unsafe things!”

Article in a nutshell

permalink

report

[ - ]

magnetosphere@fedia.io

3 points

4 months ago

Not quite. The whole reason they isolated the bot in the first place was because they knew it could do unsafe things. Now they know what unsafe things are most likely, and can refine their restrictions accordingly.

permalink

report

parent

[ - ]

shortwavesurfer@lemmy.zip

25 points

4 months ago

Skynet here we come

permalink

report

[ - ]

TimeSquirrel@kbin.melroy.org

7 points

4 months ago

Skynet invented time travel all on its own so it could make sure it kept existing. Don’t compare it to these pissant LLMs. That’s an insult to Skynet.

permalink

report

parent

[ - ]

shortwavesurfer@lemmy.zip

2 points

4 months ago

I don’t know if you watch science and futurism with Isaac Arthur, but if you don’t, you probably should. And he has a quote that I think applies quite well.

“Keep it simple, keep it dumb, or you might end up, under SkyNet’s thumb.”

permalink

report

parent

[ - ]

technocrit@lemmy.dbzer0.com

1 point

4 months ago

We’re going to palestine?

permalink

report

parent

[ - ]

MelodiousFunk@slrpnk.net

1 point

4 months ago

Terminator is part of a double feature. We need to sit through Multiplicity first.

permalink

report

parent

[ - ]

Bakkoda@sh.itjust.works

15 points

4 months ago

Arstechnica with an absolutely composting headline. Sigh

permalink

report

[ - ]

Echo Dot@feddit.uk

10 points

4 months ago

The word unexpectedly is doing a lot of heavy lifting here. It was given the ability to modify its own code, and it did, how is that unexpected?

permalink

report

Technology

!technology@lemmy.world

Create post

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

Community stats

15K
Monthly active users
13K
Posts
570K
Comments

Our Rules

Approved Bots

Community stats

Community moderators