Nearly 10% of people ask AI chatbots for explicit content. Will it lead LLMs astray? [Article from October 3](www.zdnet.com)

posted 1 year ago

rufus@discuss.tchncs.de

localllama@sh.itjust.works

18 commentshide report

They are referencing this paper: LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset from September 30.

The paper itself provides some insight on how people use LLMs and the distribution of the different use-cases.

The researchers had a look at conversations with 25 LLMs. Data is collected from 210K unique IP addresses in the wild on their Vicuna demo and Chatbot Arena website.

Sort:

Hot Top Controversial New Old

You are viewing a single thread.

View all comments View context

[ - ]

micheal65536@lemmy.micheal65536.duckdns.org

4 points

1 year ago

Stable Diffusion 2 base model is trained using what we would today refer to as a “censored” dataset. Stable Diffusion 1 dataset included NSFW images, the base model doesn’t seem particularly biased towards or away from them and can be further trained in either direction as it has the foundational understanding of what those things are.

permalink

report

parent

LocalLLaMA

!localllama@sh.itjust.works

Create post

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.

Community stats

86
Monthly active users
197
Posts
758
Comments

Community stats

Community moderators