Assume that you have two servers that host one node each, with different number of mounted disks such as:
| Node 1 | mount1, mount2 |
|---|---|
| Node 2 | mount3, mount4, mount5 |
In my cluster, let's say that I have two pods running,
| Pod1 | Saves critical data. Uses PVC |
|---|---|
| Pod2 | Saves non-critical data. Uses PVC |
My questions are:
- Is there a way to RAID in Kubernetes across volumes for different mounts.
- Is there a way that I can RAID copy only the data saved through Pod1 (so not necessarily on all data stored)?
- If so, is there a way to set preferences to a RAID, such that it prefers using RAID across nodes first hand?
I'm aware of snapshots, and tools that help you backup your volumes both inside and outside your cluster, such as K10. But since RAID5 for instance is an effective way to backup data, and scales very well as more mounts are inserted, I think I prefer that long-term.
Am I perhaps seeing this wrong, and you do perhaps have a better solution in mind? My goal is to backup data, take as little storage as possible while doing so and have the backup spread out across nodes for disaster recovery.
Thanks!
Edit: For clarification, I'm aware that RAID is not the same as backup in the sense of if data is deleted, you can still recover it. RAID is a backup in a lower level which gives resiliency in case of failure. If you wish to make sure that you don't lose data because of drive failures AND accidental deletes, you need both RAID and snapshots.