this post was submitted on 28 Nov 2024
31 points (91.9% liked)

Hacker News

268 readers
328 users here now

RSS Feed of HackerNews

founded 2 months ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[โ€“] [email protected] 5 points 5 days ago* (last edited 5 days ago) (1 children)

This isn't exactly surprising, is it? Odds are that Lemmy is also being scrapped the shit out of.

And it would be fine if it was a bunch of nobodies training their homebrew small language models, for the sake of whatever.

Except that it isn't - it's a bunch of big arse companies, with a "NEED MOAR DATAS!" approach, and more than enough money to bake the already too warm planet, since they struggle with the fact that those "things" called "humans" care about consent. "This thing didn't opt out, so training on its data is fair game!". Just to shove the tech back into the thing's throat, in the hopes that it makes the tech eventually profitable.

...I guess that my point is that this should be handled legally, not through closing down the protocol. The issue is not people scraping it, but who does it, and why.

[โ€“] buddascrayon 2 points 5 days ago

Anyone who doesn't believe that literally everything and anything they post on the internet is being scraped for LLM's is an idiot.