Lemmy Federation Architecture Change Proposal

HTTP_404_NotFound@lemmyonline.com · 1 year ago

Lemmy Federation Architecture Change Proposal

HTTP_404_NotFound@lemmyonline.com · edit-2 1 year ago

for fucks sake, dude, be collaborative, and not defensive. This isn’t reddit, I am not out to attack your karma.

If every instance, hosts a community, and Every other instance, subscribes to every one of those communities, that would lead to a full-mesh between all instances, resulting in worst-case scenario, ie, following the formula I provided for a full-mesh topology.

That is indeed, the worst case scenario, I have provided, explained, and documented in my examples.

If my example is too hard to understand, lets use an easier example

Count the number of instances on https://lemmy.ml/instances

Assume every one of those instances subscribes to !asklemmy.

Now, count the number of instances on https://lemmy.world/instances

Assume, every one of those instances subscribes to !lemmyworld.

Now, count the number of instances on https://beehaw.org/instances

Assume, every one of those instances subscribes to !technology.

It does. not. scale.

delcake@lemmy.songsforno.one · 1 year ago

In no way is the person you’re responding to speaking defensively. They’ve discussed the reason why your extrapolation to a full-mesh connective worst-case scenario isn’t based in the reality of how ActivityPub functions. But you don’t seem to be willing to entertain the notion that the federation of any given action never exceeds the number of instances subscribed to the community that generated it.

Even should every instance subscribe to every community on every other instance, the recipient of a federated action doesn’t turn around and rebroadcast that action back on to the network because it is not the authoritative host of that community. Therefore what this discussion is lacking is proof of where this exponential broadcast storm of federated actions comes from in your assertion.

bdonvr@thelemmy.club · edit-2 1 year ago

Apologies if I came off as hostile.

I mean I get what you’re saying - I just don’t see the practical use. The centralized hub replication servers would have to basically foot a huge bill for the fediverse, and do so silently and invisibly to the end user. As it is, most instances run on goodwill or donations. A silent, invisible server is hard to gather donations for. Who would run them?

Furthermore the topology you propose is essentially what we already have. A few large instances hold most of the largest communities. I don’t see that changing. This brings a fairly good balance - smaller instances pretty much only have to listen for updates from a few other instances, only the big instances are doing the hard work of notifying hundreds of others. They are already our “hubs”. Small instances really hardly do practically any hard work, the one I run for example just listens to maybe a dozen instances send updates, and occasionally sends out an update when one of my users interacts.

I suppose I just don’t understand how this could be implemented in practice- or rather how it could be useful to do so. It would strictly enforce a sort of centralization that right now is only a natural consequence of user behavior, while seemingly only bringing theoretical benefits unlikely to be realized.

King@vlemmy.net · 1 year ago

Yes, it is a “full mesh” diagram. But for each specific “federated” action, it is a simple hub and spoke distribution. The hosting server will send the federated action to each subscribed node. The nodes don’t need to check in with each other for that specific action.

I too believe that Federation is going to have scaling issues. But not due to full mesh

HTTP_404_NotFound@lemmyonline.com · 1 year ago

I am onboard with you there-

But, would not not agree- delegating and offloading those federation actions to a dedicated pool of servers, would not assist scalability?

That way- each instance doesn’t need to maintain all of the connections?

King@vlemmy.net · 1 year ago

There is no need to “maintain all of the connections”. The server opens a connection, sends the data, then closes the connection.

HTTP_404_NotFound@lemmyonline.com · 1 year ago

I realize that…

Let’s- set the record straight here.

Do you think the current implementation of federation works well?

Fauxreigner@beehaw.org · 1 year ago

Federation isn’t working well, but it’s not working well because the big instances aren’t able to keep up with all of the inbound/outbound messages, and if a message fails, that’s it. Right now there’s no automated way to resync and catch up on missed activity.

cyd@vlemmy.net · 1 year ago

How was syncing done in Usenet? It has a very similar decentralized model, and I don’t recall there being problems of data loss due to desyncing between servers.

Lemmy Federation Architecture Change Proposal

Lemmy Federation Architecture Change Proposal

[Architecture] Hub-Spoke model for federation? · Issue #3245 · LemmyNet/lemmy