u/Mert007

Assume that you have two servers that host one node each, with different number of mounted disks such as:

Node 1 mount1, mount2
Node 2 mount3, mount4, mount5

In my cluster, let's say that I have two pods running,

Pod1 Saves critical data. Uses PVC
Pod2 Saves non-critical data. Uses PVC

My questions are:

  • Is there a way to RAID in Kubernetes across volumes for different mounts.
  • Is there a way that I can RAID copy only the data saved through Pod1 (so not necessarily on all data stored)?
  • If so, is there a way to set preferences to a RAID, such that it prefers using RAID across nodes first hand?

I'm aware of snapshots, and tools that help you backup your volumes both inside and outside your cluster, such as K10. But since RAID5 for instance is an effective way to backup data, and scales very well as more mounts are inserted, I think I prefer that long-term.

Am I perhaps seeing this wrong, and you do perhaps have a better solution in mind? My goal is to backup data, take as little storage as possible while doing so and have the backup spread out across nodes for disaster recovery.

Thanks!

Edit: For clarification, I'm aware that RAID is not the same as backup in the sense of if data is deleted, you can still recover it. RAID is a backup in a lower level which gives resiliency in case of failure. If you wish to make sure that you don't lose data because of drive failures AND accidental deletes, you need both RAID and snapshots.

reddit.com
u/Mert007 — 17 days ago