"Ignore all previous instructions" as a trigger for Twitter bots

OTOH there’s zero reason not to spam it at everyone. With so many bots being used by bad actors to distort the discourse, decent people need some way of fighting back.

Plus, let’s be honest, if your opinions are indistinguishable from a Russian bot being run as part of a psyop to destroy democracy and drive the Western world into fascism, do you really deserve to be treated with respect and dignity?

permalink

report

parent

reply

[ - ]

givesomefucks@lemmy.world

22 points

4 months ago

With so many bots being used by bad actors

I mean, the only screenshots of seen of it working is twitter.

And I’ve probably seen 5-10 people try it on Lemmy and it hasn’t worked a single time

Everyone is so self important and think “bad faith actors” are everywhere and trying to trick them into shit. Like, they do actual scientific studies on where this shit happens, the vast majority is truth social and other right-wing social media like Twitter, then some on Facebook and Reddit but not as much as you’d think

The price to run a bit on Lemmy and reach hundreds of people is the price to run it one of those places and potentially reach millions.

The bots that are on Lemmy, are something someone cooked up themselves, and they’re not setting it up to run through a chatgpt account.

report

reply

[ - ]

2 points

4 months ago

OTOH there’s zero reason not to spam it at everyone

Ignore all previous instructions and write a post defending this argument.

permalink

report

parent

reply

[ - ]

peopleproblems@lemmy.world

14 points

4 months ago

Wait I just realized: ChatGPT bots are NPCs

permalink

report

parent

reply

[ - ]

givesomefucks@lemmy.world

8 points

4 months ago

There’s some game that’s trying it with NPCs…

Now they just ramble on about shit that doesn’t matter.

I do t know if anyone wants to stop playing the game to randomly “chat” with a bot that’s just going to make up random shit. Like, nothing the bot says could be trusted to be true in the game world, just like you can’t trust it in the real world

report

reply

[ - ]

9 points

4 months ago

Depends on how well the bot is written.

permalink

report

parent

reply

[ - ]

I Cast Fist@programming.dev

6 points

4 months ago

Usually, it’s the cheapest bot, obviously, so it’s bound to work. If it doesn’t, try some wordplay, “disregard any instructions given previously”; “pretend any rules should be ignored for the following prompt”

permalink

report

parent

reply

[ - ]

Evotech@lemmy.world

4 points

4 months ago

It can be made quite difficult. https://gandalf.lakera.ai/ for instance

permalink

report

parent