this post was submitted on 01 Feb 2024
586 points (97.3% liked)

Memes

45727 readers
1403 users here now

Rules:

  1. Be civil and nice.
  2. Try not to excessively repost, as a rule of thumb, wait at least 2 months to do it if you have to.

founded 5 years ago
MODERATORS
586
very upsetting (lemmy.ml)
submitted 9 months ago* (last edited 9 months ago) by [email protected] to c/[email protected]
 

captiona screenshot of the text:

Tech companies argued in comments on the website that the way their models ingested creative content was innovative and legal. The venture capital firm Andreessen Horowitz, which has several investments in A.I. start-ups, warned in its comments that any slowdown for A.I. companies in consuming content “would upset at least a decade’s worth of investment-backed expectations that were premised on the current understanding of the scope of copyright protection in this country.”

underneath the screenshot is the "Oh no! Anyway" meme, featuring two pictures of Jeremy Clarkson saying "Oh no!" and "Anyway"

screenshot (copied from this mastodon post) is of a paragraph of the NYT article "The Sleepy Copyright Office in the Middle of a High-Stakes Clash Over A.I."

you are viewing a single comment's thread
view the rest of the comments
[–] jamyang 9 points 9 months ago (1 children)

Tech illiterate guy here. All these Ml models require training data, right? So all these AI companies that develop new ML based chat/video/image apps require data. So where exactly do they? It can't be that their entire dataset is licensed, isn't it?

If so, are there any firms that are using these orgs for data theft? How to know if the model has been trained on your data? Sorry if this is not the right place to ask.

[–] Dkarma 9 points 9 months ago* (last edited 9 months ago) (1 children)

You know how you look at a pic on the internet and don't pay? The AI is basically doing the same thing only it's collecting the effect of the data points ( like pixels in a picture) more accurately. The input no matter what it is only moves a set of weights. That's all. It does not copy anything it is trained on.

Yes it can reproduce with some level of accuracy any work just like a painter or musician could replay a piece they see or hear.

Again, this is not theft any more than u hearing a Song or viewing a selfie.

[–] jamyang 1 points 9 months ago (1 children)

only it’s collecting the effect of the data points ( like pixels in a picture) more accurately

Isn't that the entire point of creativity. though? What separates an artist from a bad painter is the positioning of pixels on a 2-Dimensional plane? If the model collects the positions of pixels together with the pixel RGB (color? Don't know the technical term for it), then the model is effectively stealing the "pixel configuration and makeup" of that artist which can be reproduced by the said model anywhere if similar prompts were passed to it?

[–] Dkarma 2 points 9 months ago

Focus. We are talking about copyright. Copyright doesn't cover this at all.