this post was submitted on 07 Jul 2023
27 points (90.9% liked)
Technology
60070 readers
3660 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
They could do classic web crawling, yes. But that is -super slow -easy to detect -easy to block -illegal for companies to do for the sake of selling shit in many places, since the users have not given you consent to use their data
I think they try to pull the WhatsApp stunt here: when you sign up to WhatsApp, WhatsApp will send your whole contact list to Meta and update it on every change in order to "connect the phone numbers on your phone with WhatsApp users" (or so they say). They have structured this process in a way that they're not at fault, but the user is. Since the user "sent" them the numbers, they are not the ones who need consent to use the data, the user needed that. Same with the fediverse. "No. We didn't steal any data without consent! Our users should have had that consent when they subscribed to [email protected]! The data was pushed to us from there, we ain't doin' nothin' wrong!"
I don't think anyone needs consent to do research using your public posts though. You can literally scrape the whole Twitter and run sentiment analysis and nobody can do anything about it for example.
Yes, you can. Yet, that will not give you the interaction history (who liked what and such) and is way less convenient to do compared to "set up ActivityPub in own app real quick and have the whole fediverse send shit to me nicely formatted with interaction data ready to be used". Legal issues arise in some spots when doing web-scraping-things like when you copy and use copyrighted imagery or happen to scrape stuff you weren't allowed to see for some reason.
All of those hurdles are out of the way automatically when you literally just use the inner workings of the service the data is from. No user can complain when Mate collects data sent to them via ActivityPub. That is literally what this protocol is used to do and the inner core of any application running it. If you don't want your data to be sent to other instances around the world: Don't use the protocol, right?
They can get the data in many different ways, this is just the most convenient one.
What does that do for Thread users though? They can't interact with the posts. Would make Thread suck more as suddenly there is this group of users in some strange parallel universe you can see but can't interact with and they can't see you.
Interesting thanks for that I didn’t think of it that way!