this post was submitted on 04 Jul 2023
210 points (97.7% liked)

Technology

60133 readers
3044 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 2 years ago
MODERATORS
 

An update to Google's privacy policy suggests that the entire public internet is fair game for it's AI projects.

you are viewing a single comment's thread
view the rest of the comments
[–] CIA_chatbot 1 points 2 years ago (1 children)

I mean you can dread it all you want, because that is LITERALLY how it works today. Google, OpenAI and Microsoft already have multiple lawsuits for stealing people’s copyrights to train their LLMs.

Copyright is assigned automatically. If I make a blog post, that is automatically my copyrighted material. As the creator I get to choose how it’s used, not Google

If I took some proprietary Google code and used it without permission you know damn well they would sue my ass into oblivion. Copyright has to protect the small as well as the giant.

[–] drmoose 1 points 2 years ago* (last edited 2 years ago) (1 children)

I don't think you understand.

Let's imagine everything is copyrighted. Who will be able to create LLMs now? Google/Meta who can afford to literally hire thousands of people on below minimum wage creating annotations or smaller companies and free projects? You are literally empowering the thing you're complaining about.

Public data is public and that's good for general balance. It removed the moats.

[–] CIA_chatbot 1 points 2 years ago (1 children)

I don’t think you understand? You’re talking about some “information must be free Star Trek future” that doesn’t exist. I’m talking about the exact legal framework that exists today.

If I write a short story somewhere, why the fuck should someone be able to profit off of it because they pointed a bot at my site? How do you prevent giant corps from eventually squashing and owning everything?

Just because something is publicly accessible doesn’t mean it’s public. I would maybe start here

https://www.copyright.gov/help/faq/faq-general.html#protect

If Google or Meta wants to make an LLM off my content, they can fucking have the decency to ask or pound sand. Adding a clause to some policy somewhere doesn’t auto-magically remove my legal rights

[–] drmoose 1 points 2 years ago

If I write a short story somewhere,

it's a two way street - if you want to benefit from the free flow of information (your story being public) you should also bear the costs. I feel we've reached the end of this thread so lets just agree to disagree. Maybe my distaste for copyrighting information is too great here for you to convince me otherwise :)