memes

11759 readers

2660 users here now

Community rules

1. Be civil

No trolling, bigotry or other insulting / annoying behaviour

2. No politics

This is non-politics community. For political memes please go to [email protected]

3. No recent reposts

Check for reposts when posting a meme, you can only repost after 1 month

4. No bots

No bots without the express approval of the mods or the admins

5. No Spam/Ads

No advertisements or spam. This is an instance rule and the only way to live.

A collection of some classic Lemmy memes for your enjoyment

Sister communities

[email protected] : Star Trek memes, chat and shitposts
[email protected] : Lemmy Shitposts, anything and everything goes.
[email protected] : Linux themed memes
[email protected] : for those who love comic stories.

founded 2 years ago

MODERATORS

Tenthrow

The_Picard_Maneuver

[email protected]

504

Education - It's about to get wild (lemmy.zip)

submitted 1 year ago by [email protected] to c/memes

35 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] jacksilver 25 points 1 year ago (3 children)

It's interesting, because people say they can only get better, but I'm not sure that's true. What happens when most new text data is being generated by LLMs or we accidentally start labeling images created through diffusion as real. Seems like there is a potential for these models to implode.

[–] FierySpectre 11 points 1 year ago (2 children)

They actually tested that, trained a model using only the outputs of the previous generation of model. It takes less iterations of that to completely lose quality than you'd think.

[–] jacksilver 4 points 1 year ago

Do you have any links on that, it was something I had wanted to explore, but never had the time or money.

[–] [email protected] 3 points 1 year ago

They go insane pretty quickly don't they? As in it all just become a jumble.

[–] [email protected] 5 points 1 year ago

Given that people quite frequently try and present AI generated content as real, I'd say this will be a huge problem in the future.

[–] danielbln 1 points 1 year ago

Microsoft has shown with Phi-2 (https://www.microsoft.com/en-us/research/blog/phi-2-the-surprising-power-of-small-language-models/) that synthetic data generation can be a great source for training data.