Semi-Serious Homelab Ceph Cluster [2023 Design]

zacharyfreeman70@alien.top · 1 year ago

Semi-Serious Homelab Ceph Cluster [2023 Design]

ZombieLinux@alien.top · 1 year ago

This isn’t a bad plan. 64g of ram might be on the low side depending on what else is running.

You’ll 100% saturate that 1g link. Might even saturate a 10g link. I’d recommend going with 25 or 40g links for your ceph cluster. Some mellanox switches can be found relatively cheap.

Also, it should be an odd number to avoid split brain.

zacharyfreeman70@alien.top · 1 year ago

Regarding 64GiB RAM being too low, I suspect you might be right. Perhaps going up to 96GiB or 128GiB would be better? The total raw storage in the cluster will be 1,344TiB, but with Erasure Coding/redundancy, usable storage should fall just below the petabyte level. I’m keen to minimize buying any new hardware but if 64GiB per node just isn’t feasible, then I’m fine with spending to get to the bare minimum goal of saturating that 1 GbE link.

Speaking of which, regarding saturating the 1GbE link, that’s good to know. So long as I can do that, that’s all that really matters performance-wise.

Regarding “it should be an odd number to avoid split brain”, does the “it” in your statement mean the number of OSD nodes, the number of Manager nodes or the number of Monitor nodes?

Sporkers@alien.top · 1 year ago

CPU seems low from what I have read other places and if writes matter a few used Enterprise SSDs per chassis as DB/Wall for the HDDs would be nice.

zacharyfreeman70@alien.top · 1 year ago

In your experience, what have you found to be the bare minimum? a 4c/8t CPU at 3.5Ghz does indeed sound a bit undersized for 24 HDD-based OSDs, so I’d be curious to read what others are running.

Sporkers@alien.top · 1 year ago

I don’t really have a lot of experience with this, I just read a ton and built a modest 5 node homelab cluster and 5 nodes seemed to be the the minimum count you want to be at. The recommendation for Ceph now are so vague, all the documentation has changed in recent years to talk about IOPS/core but is very vague about it. So it depends on how much performance you really expect out of it, higher expectation give it more cores. NVME devices for sure scale with more cores from 2 to 4 cores shows 100% iops scaling in Ceph Docs and keep scaling decently past there if isolating a single OSD for performance testing with enterprise NVME drives.

But you are using HDDs, in a homelab and on a budget. I think your 4 cores would be the extreme low budget, the not expecting performance option for that many OSDs, and 8 cores would be the more regular budget option minimum and do 12-16 if I had heavier use/performance goals and then have more than 64GB RAM per node especially if the monitors are co-located. And next level would be to add maybe 4-8 used enterprise class NVME drives per node and spread the DB/Wall for the OSDs across those NVME drives and more cores to handle them.