this post was submitted on 17 Feb 2024
1062 points (98.8% liked)
Technology
60016 readers
3015 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
It will get trained on some comment posts.
And what's to stop instance owners from selling their data?
Thanks to federation, the copies of the eggs are. You can’t stop one instance from selling data sourced from federated content until it’s too late.
The only thing stopping them is the fact that anyone who wants the data can just utilize the federation protocol to take any data they want, and there's not a lot anyone can do about it. You can't sell something that's trivial to get for free.
If the question you're really asking is "what's stopping content on Lemmy/Mastodon/etc from being used to train an LLM?" the answer is, nothing.
You can't put a price tag on it. Nothing is stopping anyone from scraping all of the data for free.
mass user exodus to one of the many other identical Instances. Also, data brokers prolly aren't interested in going after each Instance because no one instance has enough data to make it worthwhile. Yet again, the fediverse proves its resistance to enshitification.
Lmao, if it gets as big as Reddit then it's worth scraping. It's not the fediverse making it less worthwhile, just the size.
Yes, it's not worth running an instance! So let's all run one! LOL. It's so worth it. Fuck reddit.
you OK bud?
shame
I wished they had evil lawyers looking after such stuff and sold strictly opt in data to AI corps. Free for FOSS though.