this post was submitted on 27 Jun 2023
29 points (100.0% liked)

datahoarder

6722 readers
69 users here now

Who are we?

We are digital librarians. Among us are represented the various reasons to keep data -- legal requirements, competitive requirements, uncertainty of permanence of cloud services, distaste for transmitting your data externally (e.g. government or corporate espionage), cultural and familial archivists, internet collapse preppers, and people who do it themselves so they're sure it's done right. Everyone has their reasons for curating the data they have decided to keep (either forever or For A Damn Long Time). Along the way we have sought out like-minded individuals to exchange strategies, war stories, and cautionary tales of failures.

We are one. We are legion. And we're trying really hard not to forget.

-- 5-4-3-2-1-bang from this thread

founded 4 years ago
MODERATORS
 

Here's the bottom line....

  • Reddit exists to serve you ads, farm and sell your data.
  • Reddit doesn't like or support you data hoarding.
  • Reddit only cares if you're making them money.
  • Reddit says one thing and does another.
  • Reddit will strip and ban mods that aren't willing to bend over.

We could go on, but you get the point... You have no say here, you lick the boots or fuck you.


So the API is about to be shafted, many apps/bots will die, other things will change, you know what's up. But the more important thing directly related to the DataHoarding community is that Reddit has now very effectively killed Pushshift from a data hoarding perspective which was the only place you could get the most complete up-to-date Reddit data in bulk.

Reddit has now taken control of Pushshift, had them delete bulk data downloads, prevents them releasing new dumps and limits PS API access to only mods Reddit approves of.


/r/DataHoarder moving forward....

We will continue to exist and operate as we have for as long as Reddit allows us to. We will promote alternatives for those of you who wish leave finding DataHoarder communities elsewhere. We will promote every project, tool and download that seeks to keep Reddit data available to both DataHoarders and researchers. We will continue to hoard. We will not hit any fucking delete buttons.

New rule.

We see a lot of basic vaguely dh related tech support questions here, we're going to be more actively removing these posts. Many of these also clearly break rule 1 as they're asked every other week.

Sidebar updates.


Happy Hoarding.

you are viewing a single comment's thread
view the rest of the comments
[–] iempqob4 7 points 1 year ago

The past few weeks I've seriously missed the content over at DataHoarder.

Was nice to see mention of this community over here, hope to see more activity soon.