Technology

62973 readers

3939 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

328

The New York Times sues OpenAI and Microsoft for copyright infringement (edition.cnn.com)

submitted 1 year ago by L4s to c/technology

50 comments fedilink hide all child comments

The New York Times sues OpenAI and Microsoft for copyright infringement::The New York Times has sued OpenAI and Microsoft for copyright infringement, alleging that the companies’ artificial intelligence technology illegally copied millions of Times articles to train ChatGPT and other services to provide people with information – technology that now competes with the Times.

you are viewing a single comment's thread
view the rest of the comments

[–] kromem 29 points 1 year ago* (last edited 1 year ago) (2 children)

What's the value of old journalism?

It's a product where the value curve is heavily weighted towards recency.

In theory, the greatest value theft is when the AP writes a piece and two dozen other 'journalists' copy the thing changing the text just enough not to get sued. Which is completely legal, but what effectively killed investigative journalism.

A LLM taking years old articles and predicting them until it can effectively learn relationships between language itself and events described in those articles isn't some inherent value theft.

It's not the training that's the problem, it's the application of the models that needs policing.

Like if someone took a LLM, fed it recently published news stories in the prompts with RAG, and had it rewrite them just differently enough that no one needed to visit the original publisher.

Even if we have it legal for humans to do that (which really we might want to revisit, or at least create a special industry specific restriction regarding), maybe we should have different rules for the models.

But to try to claim a LLM that's allowing coma patients to communicate or to problem solve self-driving algorithms or to diagnose medical issues is stealing the value of old NYT articles in its doing so is not really an argument I see much value in.

[–] jacksilver 10 points 1 year ago (1 children)

Except no one is claiming that LLMs are the problem, they're claiming GPT, or more specifically GPTs training data, is the problem. Transformer models still have a lot of potential, but the question the NYT is asking is "can you just takes anyone else's work to train them".

[–] kromem 3 points 1 year ago

There's a similar suit against Meta for Llama.

And yes, we will end up seeing as the dust settles if training a LLM is fair use in case law.

[–] ChucklesMacLeroy 2 points 1 year ago

Really gave me a whole new perspective. Thanks for that.