Claude was being judgy, so I called it out. It immediately caved. Is verbal abuse a valid method of circumventing LLM censorship??

You are viewing a single thread.
View all comments
16 points

This is so strange. You would think it wouldn’t be so easy to overcome the “guardrails”.

And what’s with the annoying faux-human response style. Their trying to “humanize” the LLM interface, but person is going to answer in this way if they believe this information should not be provided.

permalink
report
reply

TechTakes

!techtakes@awful.systems

Create post

Big brain tech dude got yet another clueless take over at HackerNews etc? Here’s the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

Community stats

  • 1.6K

    Monthly active users

  • 548

    Posts

  • 12K

    Comments

Community moderators