Upcoming 0.18 upgrade, 404 errors and infrastructure costs

sunaurus@lemm.ee · edit-2 1 year ago

Upcoming 0.18 upgrade, 404 errors and infrastructure costs

xavier666@lemm.ee · 1 year ago

For storage, I can understand how horizontal scaling works (add more storage nodes to, say glusterfs). But how does it work for CPU? Since adding a 2CPU VM can be physically on another server, it would need lemmy to work in a highly distributed manner, i.e., CPU instructions need to cross the network.

Is this distributed feature a part of lemmy or is there another abstraction layer?

sunaurus@lemm.ee · edit-2 1 year ago

This is where our load balancer comes in. All requests go through the load balancer, and this load balancer will try to evenly distribute the requests to all of our backend servers.

Is this distributed feature a part of lemmy … ?

In fact it’s the opposite - Lemmy has so far had some assumptions built in to the code which make it quite hard to run on multiple servers. I have made some modifications in order to improve this (and contributed those modifications back to the main repo as well). It’s one of the things I want to keep improving as we grow.

xavier666@lemm.ee · edit-2 1 year ago

Here is my oversimplified understanding of the backend of lemm.ee This

Am I correct? Or is there another loadbalancer in front of the DB?

Sorry for asking so many questions, but I’m new to system design and trying to learn about practical deployments.

sunaurus@lemm.ee · edit-2 1 year ago

That’s pretty close, but there are some nuances.

One of the servers is currently exclusively dedicated to handling images (processing, indexing, resizing, uploading to object storage)
One of the servers is only handling Lemmy HTTP requests
One of the servers is handling Lemmy HTTP requests + at the same time also handling Lemmy background tasks (different cleanups, updating the front page rankings, etc)

Additionally, we are not using Docker at all for lemm.ee. Not that I have anything against Docker - I use it regularly in other projects - it just wouldn’t provide any advantages for lemm.ee at the moment.

xavier666@lemm.ee · 1 year ago

Thanks for the clarifications. I now understand the architecture of lemm.ee.

However, by the way you have horizontally scaled things, it had to be done manually. You basically tried to decouple different lemmy functionalities and put them in different servers. It’s not as simple as setting a simple env variable as the number of servers.

Also, with this approach i feel like it’s possible some servers will be loaded more than others. Eg, server 1 which handles images will be more CPU/RAM-heavy, where as server 2 which handles HTTP requests will be mostly network-heavy. So there will be cases where the scaling is not unform.

Please don’t consider this as criticism (i personally just play around with my raspberry pi) but rather as observations.

Upcoming 0.18 upgrade, 404 errors and infrastructure costs

Upcoming 0.18 upgrade, 404 errors and infrastructure costs

Hello, fellow lemmings!

Upcoming 0.18 upgrade

Why do we even want 0.18?

Random 404 errors

Server costs

Pinning updates on the front page