this post was submitted on 07 Mar 2024
574 points (97.7% liked)
Technology
59887 readers
2844 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Let's pretend for a moment that we know that Reddit has any sort of decent versioning system, and that it keeps the old versions of your comments alongside the newer ones, and that it's feeding the LLM with the old version. (Does it? I have my doubts, given that Reddit Inc. isn't exactly competent.)
Even then, I think that it's sensible to use this tool, to scorch the earth and discourage other human users from adding their own content to that platform. It still means less data for Google to say "it's a bunch of users, who cares about the intellectual property of those filthy things? Their data is now my data. Feed it ~~to the wolves~~ to Gemini".
Let's also pretend that reddit isn't a cesspool of bots, marketing campaigns, foreign agents, incels, racists, Republicans, gun nuts, shit posters, trolls...the list goes on.
Is it even that valuable? It didn't take long for that Microsoft bot to turn into Hitler, feeding reddit into an "AI" is like speed running Ultron.
It's still somewhat valuable due to the size of the corpus (it's huge) and because people used to share technical expertise there.