If it's publicly accessible it's scrape-able. He already tried to make tweets not publicly accessible and look how long that lasted.
Technology
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
It’s not even clear that it’s illegal to scrape publicly available data, so I don’t know what the TOS would be enforcing.
It didn't have to be illegal for them to sue you for violating its policies. It would be a civil suit for damages.
Lol, then they would have to demonstrate that there were damages. The worst a TOS violation will get you is a ban.
Bandwidth costs.
Peanuts.
It's still a billionaire suing you for those peanuts.
Suing for 50 dollar or so? Fine.
Unfortunately, they have more money to blow on legal fees. The threat of a suit is enough to keep most perks from fucking around and finding out.
Would it be violating policy if the scrapper only took the public data and never actually signed the ToS since they didn't make an account?
It's not publicly available anymore. If you're not logged in you don't see anything anymore except tweets you have a direct link to. Even then you don't see any replies and the amount of tweets per day you can see is limited.
Yep, would never remember on the odd occasions I would look at Twitter, then just leave the site after being prompted to login
Lol this has the same energy as those NFT idiots crying about people taking screenshots of their stupid monkeys.
Noooo, stop scraping my dataaaa!
Never waste an opportunity to use "muh" in place of "my"
No thanks!
And the next generation of AI probably only needs a fraction of the data it needs now so the need to scrape the data is gone.
I hope someone makes some manic bot that scrapes every last tweet and posts it on a duplicate site call Y
Might as well go whole hog and do the entire alphabet. Then do one for every iteration of every letter combination.
He's still mad at those researchers for scraping the data that shows that ever since he took over, the antisemitism, racism and general bigotry has gone up on the platform.
Good luck enforcing that.
Let the Supreme Court enforce it ;)
It should, but it won't.
I’m pretty sure both parties must agree to the terms before they legally bind anyone so wouldn’t this just apply to logged in users?
Accessing the website is often viewed as accepting the terms, so that wouldn't hold up. Not that they'd have a legal standpoint on the issue.
Accessing the website is often viewed as accepting the terms
The scraping bot can't read the terms
But even if it could, it wouldn't give a damn :-)
By reading this message you agree to my terms that I'm really cool
Lol and you username
How do you read the terms without accessing their website?
How on earth will do they plan on enforcing that? xD
They don't have to enforce it. If someone says bad things about Twitter by analysing their content, Twitter can sue them scraping.
“Our interns spent 500 hours collecting the raw data”.
deleted by creator
This is hilariously unenforceable as long as Twitter is on the public internet.
I thought this was an article about the X Windows system based on the preview for the article. Boy are those two similar-looking.
Realistically, very little people know about x windows system even less care about it.
You could always join wayland.social
Or just stop using X all together.
Crawling for me, not thee!
Just update robot.txt coward!
Took a look at their robots.txt, it appears to block all bots except Google.
don't!
You heard him, scrape more.
So he’s going to sue google then?