this post was submitted on 03 Jul 2023
220 points (97.4% liked)

No Stupid Questions

36198 readers
822 users here now

No such thing. Ask away!

!nostupidquestions is a community dedicated to being helpful and answering each others' questions on various topics.

The rules for posting and commenting, besides the rules defined here for lemmy.world, are as follows:

Rules (interactive)


Rule 1- All posts must be legitimate questions. All post titles must include a question.

All posts must be legitimate questions, and all post titles must include a question. Questions that are joke or trolling questions, memes, song lyrics as title, etc. are not allowed here. See Rule 6 for all exceptions.



Rule 2- Your question subject cannot be illegal or NSFW material.

Your question subject cannot be illegal or NSFW material. You will be warned first, banned second.



Rule 3- Do not seek mental, medical and professional help here.

Do not seek mental, medical and professional help here. Breaking this rule will not get you or your post removed, but it will put you at risk, and possibly in danger.



Rule 4- No self promotion or upvote-farming of any kind.

That's it.



Rule 5- No baiting or sealioning or promoting an agenda.

Questions which, instead of being of an innocuous nature, are specifically intended (based on reports and in the opinion of our crack moderation team) to bait users into ideological wars on charged political topics will be removed and the authors warned - or banned - depending on severity.



Rule 6- Regarding META posts and joke questions.

Provided it is about the community itself, you may post non-question posts using the [META] tag on your post title.

On fridays, you are allowed to post meme and troll questions, on the condition that it's in text format only, and conforms with our other rules. These posts MUST include the [NSQ Friday] tag in their title.

If you post a serious question on friday and are looking only for legitimate answers, then please include the [Serious] tag on your post. Irrelevant replies will then be removed by moderators.



Rule 7- You can't intentionally annoy, mock, or harass other members.

If you intentionally annoy, mock, harass, or discriminate against any individual member, you will be removed.

Likewise, if you are a member, sympathiser or a resemblant of a movement that is known to largely hate, mock, discriminate against, and/or want to take lives of a group of people, and you were provably vocal about your hate, then you will be banned on sight.



Rule 8- All comments should try to stay relevant to their parent content.



Rule 9- Reposts from other platforms are not allowed.

Let everyone have their own content.



Rule 10- Majority of bots aren't allowed to participate here.



Credits

Our breathtaking icon was bestowed upon us by @Cevilia!

The greatest banner of all time: by @TheOneWithTheHair!

founded 2 years ago
MODERATORS
 

I know that Lemmy is open source and it can only get better from here on out, but I do wonder if any experts can weigh in whether the foundation is well written? Or are we building on top of 4 years worth of tech debt?

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 19 points 2 years ago (3 children)

I disagree that it being a monolith is immediately a problem, but also

In fact you scale a monolith the same way you scale micro services.

This is just not true. With microservices, it is easy to scale out individual services to multiple instances as demand requires them. Hosting a fleet of entire Lemmy instances is far more expensive than just small slices of it that may require the additional processing power.

[–] [email protected] 16 points 2 years ago (2 children)

What microservices would you split Lemmy into? The database, image hosting and the UI are already separate.

[–] [email protected] 4 points 2 years ago (1 children)

Well, I'm going to start by repeating that I don't necessarily agree that it being monolithic is necessarily a problem right now.

The immediate thought in my mind would be all of the federation logic. That's where all of the instances seem to be lagging behind, and it seems the common fix is "just increase the workers to one billion". Apparently that does something meaningful, but the developer in me wants to know how a few cores can put so many workers to use.

Spinning federation off into a microservice means you could deploy it on something like Cloud Run or AWS ECS, and have it autoscale as the workload demands it. Seems like a pretty prime candidate to me.

[–] Fauzruk 2 points 2 years ago

To me this sounds like a code / DB problem more so than a monolith vs microservice issue. You can totally run only the worker part of a monolith inside AWS ECS and have it autoscale, this is not specific to microservices.

[–] BURN 1 points 2 years ago* (last edited 2 years ago)

I’d split it out into a few systems

  • Signup/Login/Account management
  • Posting/Commenting/Voting
  • Moderation Controls
  • DB Readers and Writers in different services
  • Community management (may get lumped into moderation controls, but separation of duties may be needed)
  • Edit: Federation is a big one I keep forgetting about

The ultimate goal, and I don’t know how possible it is with rust, would be to have a way to run those as individual services or as one part of a larger monolith. Smaller instances would be able to run it as one binary, while larger instances like Lemmy.world or Lemmy.ml can run each part independently. That would allow easier scaling on large instances while (hopefully) leaving it just as simple to deploy on a small scale too.

There’s no reason to split it into 100 different services, but 4-5 mid sized ones might help.

[–] [email protected] 2 points 2 years ago* (last edited 2 years ago) (1 children)

Lemmy's backend is native code, not run in a virtual machine or interpreted from text like most backends (Java and PHP are still so, so popular, as is Python). You're not going to pay much extra for an extra megabyte or 10 of RAM being used per instance for the extra code sitting idle. It certainly shouldn't use much processing power when not in use.

[–] [email protected] 4 points 2 years ago (1 children)

I'd be less concerned with memory (of which Lemmy seems to use very little), and much more concerned with CPU core count. I touched on it in my other comment, but I don't understand how a few cores is supposed to handle the ridiculous number of federation workers people are setting their instances to.

[–] [email protected] 1 points 2 years ago (1 children)

Does code that exists but isn't being executed often actually impact CPU usage that much?

Linux kernel has like 30 million lines of code and you probably compile a fifth of it into your actual kernel binary by default. That's still several million lines, but it doesn't use much CPU at all. Rather, eliminating all the excess code keeps your size down so you can load it from disk in 0.0001 seconds instead of 0.0002.

Now if code that doesn't need to run often, runs more often because we scaled the monolith horizontally - that IS a problem, but it's not a problem inherent to the monolith design pattern, but rather a specific instance of bad design.

[–] [email protected] 2 points 2 years ago* (last edited 2 years ago) (1 children)

Again, my knowledge of the Lemmy codebase is very small, and we could possibly host the monolith in microservices style. The point I am making is this (when it comes to scaling monolith vs microservices):

If the federation logic were split out, we could configure it to run on super tiny docker instances on Google Cloud or AWS. Any time we needed it to, it would autoscale to handle the traffic. The configuration for these dockers could be super minimal memory, no storage, and multiple weak CPU cores. This would be super affordable while still being able to handle as much traffic from federation as we ask it to. One of the cool things with Google Cloud Run is that it handles load balancing between docker containers for you (just point the federation traffic at the necessary URL)

IF Lemmy has things like background services, scheduled tasks, etc, this would significantly muddy the water (we would need each service to be able to handle being run on a multitude of instances, or we would need to be able to disable each one instance by instance). And if we just scaled by spinning up more instances of Lemmy, we would also need to ensure that only federation traffic is heading to the weaker instances that we spun up for such purpose, or we would need to ensure that each spun up instance has enough resources to handle federation traffic along with the main application.

I feel like I need to state once more: I don't necessarily think Lemmy needs to move to microservices. Only that scaling monolith vs microservices is not necessarily the same.

[–] [email protected] 1 points 2 years ago (1 children)

I suppose that's completely fair - async workers tend to fare well as standalone services and are often split off even in monoliths. But I guess what I'm saying is that splitting it might not actually win you THAT much compared to just scaling the whole thing. Not until we're talking like 100 runners to 1 API instance or something. It gives you a bit of additional flexibility, but won't necessarily be a huge difference in total resource cost, is what I'm saying. But it is still a good idea because it results in cleaner code and, as outlined before, tinier docker images.

Also the thing about Google Cloud Run is that it's probably not a good idea for many instance owners. Autoscaling can lead to unexpected costs if set up by an amateur. But that's an unrelated can of worms.

[–] [email protected] 1 points 2 years ago

Autoscaling can lead to unexpected costs if set up by an amateur. But that’s an unrelated can of worms.

Agreed (along with all things cloud!). That said, Cloud Run does a good job of letting you define how many connections each container can handle, and a max number of containers (and min) to scale to.

[–] sosodev 1 points 2 years ago (1 children)

You can easily scale a monolith. You typically horizontally replicate any web server (monolith or not) to handle whatever traffic you're getting. It shouldn't really matter what type of traffic it is. Plenty of the world's biggest websites run monoliths in production. You know how people used to say "rails doesn't scale"? Well they were wrong because Rails monoliths are behind some huge companies like GitHub and Shopify.

The lemmy backend is also quite lightweight and parallel so it's cheap and effective to replicate.

In my professional experience microservices are usually a dumpster fire from both the dev perspective and an ops perspective (I'm a Site Reliability Engineer).

[–] [email protected] 2 points 2 years ago

This isn't really contradictory to what I said. I only wished to express that the two don't scale exactly the same (in terms of execution)