this post was submitted on 06 Sep 2024
1730 points (90.2% liked)
Technology
66020 readers
8303 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Maybe if you would pay for training data they would let you use copyright data or something?
Their business strategy is built on top of assumption they won't. They don't want this door opened at all. It was a great deal for Google to buy Reddit's data for some $mil., because it is a huge collection behind one entity. Now imagine communicating to each individual site owner whose resources they scrapped.
If that could've been how it started, the development of these AI tools could be much slower because of (1) data being added to the bunch only after an agreement, (2) more expenses meaning less money for hardware expansion and (3) investors and companies being less hyped up about that thing because it doesn't grow like a mushroom cloud while following legal procedures. Also, (4) the ability to investigate and collect a public list of what sites they have agreement with is pretty damning making it's own news stories and conflicts.
Had the company paid for the training data and/or left it as voluntary, there would be less of a problem with it to begin with.
Part of the problem is that they didn't, but are still using it for commercial purposes.
The other part of it is they broke the rules so they need to face the consequences. They are asking for forgiveness and in this case I don't think they deserve it.