this post was submitted on 23 Dec 2024
122 points (96.2% liked)

Fuck AI

1772 readers
209 users here now

"We did it, Patrick! We made a technological breakthrough!"

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

founded 10 months ago
MODERATORS
 

This is a proposal by some AI bro to add a file called llms.txt that contains a version of your websites text that is easier to process for LLMs. Its a similar idea to the robots.txt file for webcrawlers.

Wouldn't it be a real shame if everyone added this file to their websites and filled them with complete nonsense. Apparently you only need to poison 0.1% of the training data to get an effect.

you are viewing a single comment's thread
view the rest of the comments
[โ€“] [email protected] 25 points 1 month ago (3 children)

We could respect this convention the same way the IA webcrawlers respect robot.txt ๐Ÿคทโ€โ™‚๏ธ

[โ€“] [email protected] 9 points 1 month ago (1 children)

Do webcrawlers from places other than Iowa respect that file differently?

[โ€“] [email protected] 10 points 1 month ago (2 children)

Sorry: Intelligence Artificielle <=> Artificial Intelligence

[โ€“] [email protected] 4 points 1 month ago

No worries. I was just making a joke.

[โ€“] [email protected] 1 points 1 month ago

๐ŸŽ๐Ÿง 

[โ€“] [email protected] 4 points 1 month ago

I've had a page that bans by ip listed as 'dont visit here' on my robots.txt file for seven months now. It's not listed anywhere else. I have no banned IPs on there yet. Admittedly, i've only had 15 visitors in that past six months though.

[โ€“] draughtcyclist 2 points 1 month ago

Seriously. I've never seen a convention so aggressively ignored. This isn't the brilliant idea some think it is.