LocalLLaMA

2254 readers

1 users here now

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.

founded 1 year ago

MODERATORS

[email protected]

Nearly 10% of people ask AI chatbots for explicit content. Will it lead LLMs astray? [Article from October 3] (www.zdnet.com)

submitted 1 year ago* (last edited 1 year ago) by [email protected] to c/[email protected]

17 comments fedilink hide all child comments

They are referencing this paper: LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset from September 30.

The paper itself provides some insight on how people use LLMs and the distribution of the different use-cases.

The researchers had a look at conversations with 25 LLMs. Data is collected from 210K unique IP addresses in the wild on their Vicuna demo and Chatbot Arena website.

you are viewing a single comment's thread
view the rest of the comments

[–] Eheran 15 points 1 year ago (3 children)

Will it lead them astray? That is simply not possible. LLMs don't learn from input. They are trained on a dataset and then can not "learn" new things. That would require a new training, resulting in a new LLM.

Of course, the creator of the LLM could adapt to this and include more of different such things in the training data. But that is a very deliberate action.

[–] [email protected] 3 points 1 year ago* (last edited 1 year ago) (2 children)

I agree, that question at the end is a bit too clickbaity. It is of concern for the next iterations of LLMs if the 'wrong' kind of usage creeps into their datasets. But that's what AI safety is for and you better curate your datasets and align the models for your intended use-case. AFAIK all the professional LLMs have some research done on their biases. And that's also part of legislative attempts like what the EU is currently debating.

As I use LLMs for that 10% use-case I like them to know about those concepts. I believe stable diffusion is a bit ahead on this, didn't they strip nude pictures from the dataset at some point and that's why lots of people still use SD1.5 as the basis for their projects?

[–] [email protected] 5 points 1 year ago

Stable Diffusion 2 base model is trained using what we would today refer to as a "censored" dataset. Stable Diffusion 1 dataset included NSFW images, the base model doesn't seem particularly biased towards or away from them and can be further trained in either direction as it has the foundational understanding of what those things are.

load more comments (1 replies)