this post was submitted on 06 Jun 2024
813 points (98.8% liked)
Technology
59673 readers
3161 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
What are the ways that US domains can block AI? I figure pay walls, and captchas, but is there something we can add to robots.txt that has any teeth against AI scraping? I mean would we even know if they obeyed it anyway? How do we set traps and keep this shit out?
Capthchas haven't worked against serious actors for years and companies could easily pay for a user account. Anything a normal tech illiterate person can do, companies can automate. You sort of have to trust their pinky promise of not scraping content.