Someone got Gab's AI chatbot to show its instructions(mbin.grits.dev)

posted 7 months ago

mozz@mbin.grits.dev

technology@beehaw.org

199 commentshide report

Credit to @bontchev

Sort:

Hot Top Controversial New Old

You are viewing a single thread.

View all comments View context

[ - ]

Gaywallet (they/it)@beehaw.org

9 points

7 months ago

Ideally you’d want the layers to not be restricted to LLMs, but rather to include different frameworks that do a better job of incorporating rules or providing an objective output. LLMs are fantastic for generation because they are based on probabilities, but they really cannot provide any amount of objectivity for the same reason.

permalink

report

parent

[ - ]

jarfil@beehaw.org

2 points

7 months ago

It’s already been done, for at least a year. ChatGPT plugins are the “different frameworks”, and running a set of LLMs self-reflecting on a train of thought, is AutoGPT.

It’s like:

Can I stick my fingers in a socket? - Yes.
What would be the consequences? - Bad.
Do I want these consequences? - Probably not
Should I stick my fingers in a socket? - No

However… people like to cheap out, take shortcuts and run an LLM with a single prompt and a single iteration… which leaves you with “Yes” as an answer, then shit happens.

permalink

report

parent

Technology

!technology@beehaw.org

Create post

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:

This community’s icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

Community stats

2.7K
Monthly active users
3K
Posts
57K
Comments

Community stats

Community moderators