this post was submitted on 13 Jul 2023
263 points (95.8% liked)

Technology

59655 readers
2575 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

A lawsuit claims Google took people's data without their knowledge or consent to train its AI products, including chatbot Bard.

you are viewing a single comment's thread
view the rest of the comments
[–] fubo 7 points 1 year ago (1 children)

If I read a bunch of copyrighted books, and answer questions based on the knowledge I have acquired from them, I do not owe the authors anything.

[–] [email protected] 1 points 1 year ago

TLDR: maybe it’s like a library? Libraries pay for books, even digital copies.

Presumably somebody bought a copy of the book, even if you found it on the coffee table.

This seems more like going through the trash for anything legible, reading billboards and taking free newspapers. It just happens that a lot of the stuff put out at the curb was copyrighted material. In fact, almost every website has © in the footer, so clearly the sentiment is “don’t copy my original content”, especially without credit. But if the AI is not reproducing, in whole or in part, the copyrighted material then it does seems a bit late to try to claw back value just because someone else found a way to monetize what you put out on the open web. I think that’s what’s going to have to be proven, one way or another.

Maybe another way to look at a LLM is as an enormous library, but instead of borrowing books and periodicals, as a user you are borrowing the pre-digested knowledge directly. Libraries have complex agreements in place with publishers, so that rights holders are compensated. Say what you will about these contracts, but they are a precedent. What is perhaps without precedent is how to handle the rest of the trash this library is indiscriminately gathering up.