this post was submitted on 24 Sep 2024
265 points (98.2% liked)
Technology
60346 readers
5030 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
...and everybody was shocked! Absolutely shocked.
Shocked? You'd think all the people outraged at having their websites scraped would be delighted. That's probably the real reason for this.
It's not the scraping itself, but the purpose of the scraping, that can be problematic. There are good reasons for public sites to allow scraping.
I have the distinct impression that a number of people would object to the purpose of re-hosting their content as part of a commercial service, especially one run by Google.
Anyway, now no one has to worry about Google helping people bypass their robots.txt or IP-blocks or whatever counter-measures they take. And Google doesn't have to worry about being sued. Next stop: The Wayback Machine.