this post was submitted on 21 Nov 2024
332 points (96.6% liked)
Technology
59755 readers
2149 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I think people will still “contribute” because they also don’t care that their use of certain platforms leaks data used to target ads at them.
In the same vein though, once AI essentially destroys a site like Stack Overflow, where will AI companies source new training data with updated information? Also, we are likely to see something like 50% of content being AI generated. Are AI models then going to train on the content they themselves created? What is the impact of that? What is the use?
It leads to model collapse. The second AI starts to focuses on certain patterns in the output of the first AI instead of the actual content and you get degraded output. They are pattern matching machines after all. Repeat the cycle a few times and all output becomes gibberish. Think of it as data incest.
So the AI companies are pretty desperate for more fresh user data. More data is the only way they have currently to push through the diminishing returns.