this post was submitted on 11 Jun 2023
41 points (97.7% liked)

Self Hosted - Self-hosting your services.

11401 readers
1 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules

Important

Beginning of January 1st 2024 this rule WILL be enforced. Posts that are not tagged will be warned and if not fixed within 24h then removed!

Cross-posting

If you see a rule-breaker please DM the mods!

founded 3 years ago
MODERATORS
 

If I were to create my own instance federated with all the other instances, as of today, how much data would I be storing, since I would make a copy of all the content?

I know this will vary a lot, but I’m looking for a ballpark figure to have an idea. I don’t think it would be a lot, but I can’t find an estimate anywhere.

Reposted from https://lemmy.world/post/55030 as I think this community is probably a better fit

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 9 points 1 year ago* (last edited 1 year ago) (1 children)

Our smaller instance that has been federating for a bit more than a year now (started in March 2022) is now at 2.4gb for the database and 7gb for the image storage (which probably needs some clean-up from previous image spam waves).

[–] [email protected] 3 points 1 year ago (1 children)

@poVoq @ndr this will surely grow a lot, if you look at other activity pub compatible systems you'll see a huge grow, it depends on the retention of "old" post and media, if you say just store all for a year you might keep it smaller, but if you want to replace #Reddit or so it would be better to keep stuff a bit longer, but then on the other hand the #Fediverse is probable not meant to store stuff for long term.

On my Friendica node I have a rather short period to store foreign posts and media, and my storage is only about 47 GB, most of the media is stored in the database as well (easier and faster to backup, much slower to retrieve) and it is a single user instance with just a handful of bots besides the account I write from.

[–] [email protected] 0 points 1 year ago (1 children)

Keeping more of the history is probably a good thing if we want to replace Reddit. Think of all the homelab/server posts you've used that are over a year old. Good info can last a while.

[–] [email protected] 2 points 1 year ago (1 children)

@sedawk probably... maybe threads can be closed and then some data can be removed.

This is a culprit of the Fediverse players, they're unlikely to keep all data forever (as some services do, at least until they're gone for good like Google Waves). Storage costs money, the need will grow forever, maybe some more cost effective storage can be used for old data/posts/threads/media, just like internet archive does, they don't used fast storage, so it takes seconds to load old website versions. But that also seems like a big leap for amateur technology enthusiasts (like most admins of Fediverse systems are).

[–] [email protected] 1 points 1 year ago

Does the Fediverse not allow closing out threads after a while? I just assumed they had an "age out" feature.

Once something ages out it could be dumped as static data basically and compressed even.