this post was submitted on 23 Dec 2024
122 points (96.2% liked)

Fuck AI

2090 readers
270 users here now

"We did it, Patrick! We made a technological breakthrough!"

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

founded 1 year ago
MODERATORS
 

This is a proposal by some AI bro to add a file called llms.txt that contains a version of your websites text that is easier to process for LLMs. Its a similar idea to the robots.txt file for webcrawlers.

Wouldn't it be a real shame if everyone added this file to their websites and filled them with complete nonsense. Apparently you only need to poison 0.1% of the training data to get an effect.

you are viewing a single comment's thread
view the rest of the comments
[โ€“] [email protected] 25 points 2 months ago (3 children)

We could respect this convention the same way the IA webcrawlers respect robot.txt ๐Ÿคทโ€โ™‚๏ธ

[โ€“] [email protected] 9 points 2 months ago (1 children)

Do webcrawlers from places other than Iowa respect that file differently?

[โ€“] [email protected] 10 points 2 months ago (2 children)

Sorry: Intelligence Artificielle <=> Artificial Intelligence

[โ€“] [email protected] 4 points 2 months ago

No worries. I was just making a joke.

[โ€“] [email protected] 1 points 2 months ago

๐ŸŽ๐Ÿง 

[โ€“] [email protected] 4 points 2 months ago

I've had a page that bans by ip listed as 'dont visit here' on my robots.txt file for seven months now. It's not listed anywhere else. I have no banned IPs on there yet. Admittedly, i've only had 15 visitors in that past six months though.

[โ€“] draughtcyclist 2 points 2 months ago

Seriously. I've never seen a convention so aggressively ignored. This isn't the brilliant idea some think it is.