this post was submitted on 17 Mar 2025

464 points (98.7% liked)

Technology

66939 readers

5599 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

464

China announces plan to label all AI-generated content with watermarks and metadata. (english.news.cn)

submitted 2 days ago by [email protected] to c/technology

50 comments fedilink hide all child comments

China has released a set of guidelines on labeling internet content that is generated or composed by artificial intelligence (AI) technology, which are set to take effect on Sept. 1.

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 61 points 2 days ago (3 children)

This is a smart and ethical way to include AI into everyday use, though I hope the watermarks are not easily removed.

[–] Cocodapuf 4 points 1 day ago (1 children)

I'm going to develop a new AI designed to remove watermarks from AI generated content. I'm still looking for investors if you're interested! You could get in on the ground floor!

[–] BreadstickNinja 1 points 17 hours ago

I've got a system that removes the watermark and adds two or three bonus fingers, free of charge! Silicon Valley VC is gonna be all over this.

[–] [email protected] 37 points 2 days ago* (last edited 2 days ago) (1 children)

Think a layer deeper how can it misused to control naratives.

You read some wild allegation, no AI marks (they required to be visible), so must written by someone? Right? What if someone, even the government jumps out as said someone use an illiegal AI to generate the text? The questioning of the matter will suddently from verifying if the allegation decribed happened, to if it itself is real. The public sentiment will likely overwhelmed by "Is this fakenews?" or "Is the allegation true?" Compound that with trusted entities, discrediting anything become easier.

Give you a real example. Before Covid spread globally there was a Chinese whistleblower, worked in the hospital and get infected. He posted a video online about how bad it was, and quickly got taken down by the government. What if it happened today with the regulation in full force? Government can claim it is AI generated. The whistleblower doesn't exist. Nor the content is real. 3 days later, they arrested a guy, claiming he spread fakenews using AI. They already have a very efficient way to control naratives, and this piece of garbage just give them an express way.

You though that only a China thing? No, every entities including governments are watching, especially the self-claimed friend of Putin and Xi, and the absolute free speech lover. Don't think it is too far to reach you yet.

[–] [email protected] 16 points 2 days ago (1 children)

It's still a good thing. The alternative is people posting AI content as though it is real content, which is a worldwide problem destroying entire industries. All AI content should by law have to be clearly labeled.

[–] [email protected] 7 points 2 days ago* (last edited 2 days ago) (1 children)

Then what AI generated slop without label are to the plain eyes? That label just encourge the laziness of the brain as an "easy filter." Those slop without label just evelated itself to be somewhat real, becuase the label exist exploiting the laziness.

Before you said some AI slop are clearly identifiable, you can't rule out everyone can, and every piece are that identifiable. And for those images that looks a little unrealistic, just decrease the resolution to very grainy and hide those details. That will work 9 out of 10. You can't rule out that 0.1% content that pass sanity check can't do 99.9% damage.

After all, human are emotional creatures, and sansationism is real. The urge of share something emotional is why misinformation and disinformation are so common these days. People will overlook details when the urge hits.

Somethimes, labeling can do more harm than good. It just give a false sense.

[–] [email protected] 6 points 2 days ago (1 children)

Just because something is theoretically circumventable doesn't mean we shouldn't make it as hard as possible to circumvent it.

The reason why misinformation is so common these days is because of concerted effort by fascists to obtain control over media companies. Once they are in power and have significant influence within those companies they can poison them, turning them into massive misinformation engines churning out content at a pace even faster than we ever believed possible. This problem has existed since the rise of mass media especially in the 19th century. But social media presents far faster and more direct throughlines to spreading misinformation to the masses.

And those masses do not care if something is labeled as AI or not. They will believe it one way or the other. This still doesn't change that it is necessary to directly label AI generated content as such. What is and isn't made by a human is extremely important. We cannot equate algorithms with people, and it's necessary to make that distinguishment as clearly as possible.

[–] [email protected] 4 points 2 days ago* (last edited 2 days ago) (1 children)

The problem is you can't make a digital label that hard to circumvent. Much like a signature, you sign something you want to prove it is genuinely from you, but you won't sign something that's not from you while not signing things that are, especially in digital format. Digital signature can just be stripped out of the data. Watermarks on images can now patched with the help of inpainting models. Disclaimers in text can just be deleted. The default shouldn't be "This thing doesn't have an AI label so it would be written by human." The label itself it a slippery slope that helps misinformation spread faster and aid building alternate facts. Adding a label won't help people identify contents generated with ML models, but let them defer the identification to that mere label because it said so, or didn't.

Misinformation didn't spread fast simply because fascists obtained controls on medias. Just look at how China, Russia, and Iran launch misinformation campaigns. They didn't have to control those media, but some seed accounts that make sensational title that attracts people in more powerful position and recognition to spread it out. For more info on misinformation and disinformation, I recommend you watch Ryan McBeth's video on YT.

Yes, we need a way to identify what is and what not generated by ML models, but that should not be done by labeling ML contents.

[–] [email protected] 1 points 2 days ago (1 children)

I'm curious what you would suggest to aid identifying generated content if not clear labeling. Sure its circumventable but again its more than what already exists. It provides legal precedence for repercussions to companies trying to pass off AI generated content as human created.

[–] [email protected] 2 points 2 days ago* (last edited 2 days ago)

Please allow me to have a little bit of time deep thoughts and organize myself. It might take a while, but I will give you a response.

[–] [email protected] 7 points 2 days ago (3 children)

It will be relatively easy to strip that stuff off. It might help a little bit with internet searches or whatever, but anyone spreading deepfakes will probably not be stopped by that. Still better than nothing, I guess.

[–] [email protected] 1 points 3 hours ago

You can use things like steganography to embed data into the AI output.

Imagine a text has certain letters in certain places which can give you a probability rating that it's AI generated, or errant pixels of certain colors.

Printers already do something like this, printing imperceptible dots on pages.

[–] spankmonkey 13 points 2 days ago

Having an unreliable verification method is worse than nothing.

[–] [email protected] 2 points 2 days ago

it will be relatively easy to strip off

How so? If it's anything like llm text based "water marks" the watermark is an integral part of the output. For an llm it's about downrating certain words in the output, I'm guessing for photos you could do the same with certain colors, so if this variation of teal shows up more than this variation then it's made by ai.

I guess the difference with images is that since you're not doing the "guess the next word" aspect and feeding the output from the previous step into the next one, you can't generate the red green list from the previous output.