this post was submitted on 11 Feb 2025
236 points (98.8% liked)
Technology
62130 readers
7101 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
https://natlawreview.com/article/court-training-ai-model-based-copyrighted-data-not-fair-use-matter-law
It sounds like the case you mentioned had a government entity doing the annotation, which makes it public even though it's not literally the law.
Reuters seems to have argued that while the law and cases are public, their tagging, summarization and keyword highlighting is editorial.
The judge agreed and highlighted that since westlaw isn't required to view the documents that everyone is entitled to see, training using their copy, including the headers, isn't justified.
It's much like how a set of stories being in the public domain means you can copy each of them, but my collection of those stories has curation that makes it so you can't copy my collection as a whole, assuming my work curating the collection was in some way creative and not just "alphabetical order".
Another major point of the ruling seems to rely on the company aiming to directly compete with Reuters, which undermines the fair use argument.