this post was submitted on 28 Nov 2024
31 points (91.9% liked)

Hacker News

268 readers
328 users here now

RSS Feed of HackerNews

founded 2 months ago
MODERATORS
top 11 comments
sorted by: hot top controversial new old
[โ€“] [email protected] 26 points 6 days ago (1 children)

Can't really have an open federated protocol if you want to be able to prevent scraping ๐Ÿคท

[โ€“] [email protected] 2 points 5 days ago (1 children)

the protocol might be open but joining your own server is not open so not really like the fediverse.

[โ€“] [email protected] 1 points 5 days ago* (last edited 5 days ago) (1 children)

Weelll, depends on what you mean with "joining your own server." The model is definitely different from ActivityPub's, but lots of people on the network already have their own PDSs or "personal data servers". They do still go through Bluesky's relay, but nothing is really stopping people from running relays as such, it's just fairly costly (as in some hundreds of dollars per month) currently as they need to hold the full state and history of the network (but apparently that's being worked on.)

But it's definitely a federated protocol even though it's different from AP

[โ€“] [email protected] 1 points 5 days ago (1 children)

but you kind of said yourself that in practice its not really.

[โ€“] [email protected] 1 points 4 days ago (1 children)

Right, but that doesn't mean the protocol isn't federated

[โ€“] [email protected] 1 points 4 days ago (1 children)

yes but it makes federation pointless.

[โ€“] [email protected] 1 points 4 days ago

So just because someone else isn't running a relay now while they figure out how to make them less costly to run, it's pointless for the protocol to be partially federated already and support further federation?

[โ€“] PixelatedSaturn 9 points 6 days ago

Same with Mastodon.

[โ€“] [email protected] 5 points 5 days ago* (last edited 5 days ago) (1 children)

This isn't exactly surprising, is it? Odds are that Lemmy is also being scrapped the shit out of.

And it would be fine if it was a bunch of nobodies training their homebrew small language models, for the sake of whatever.

Except that it isn't - it's a bunch of big arse companies, with a "NEED MOAR DATAS!" approach, and more than enough money to bake the already too warm planet, since they struggle with the fact that those "things" called "humans" care about consent. "This thing didn't opt out, so training on its data is fair game!". Just to shove the tech back into the thing's throat, in the hopes that it makes the tech eventually profitable.

...I guess that my point is that this should be handled legally, not through closing down the protocol. The issue is not people scraping it, but who does it, and why.

[โ€“] buddascrayon 2 points 5 days ago

Anyone who doesn't believe that literally everything and anything they post on the internet is being scraped for LLM's is an idiot.

[โ€“] [email protected] 4 points 6 days ago

The nature of the beast.