
Hello all! For the Community Showcase Day we wanted to share an open-source tool we’ve been building to alleviate a recurring problem we've faced when administrating Proxmox clusters at scale.
We often hear from Sys Admins moving from VMware to Proxmox VE that they miss having a global inventory extractor for capacity management and compliance audit similar to RVTools. With this in mind, we engineered cv4pve-report.
It's entirely agentless; it will read from the native PVE REST APIs, then combine everything into a structured and cross-linked spreadsheet (or if you want a huge CSV dump you can feed into Big Data processing, pure CSV is also possible).
Some of the main technical points we aimed at:
- Deep Extraction: not only simple CPU/RAM, but also RRD historical metrics, SDN configuration and rules, firewall rules, snapshots present (with RAM retained state), and even physical disk SMART telemetry.
- API Load Management: As you might imagine querying very dense clusters against the PVE API will hit the PVE daemon pretty hard; therefore we implemented 3 scanning modes (Fast, Standard, Full) that trade depth of scan against API load on PVE.
- Network topology: generates a SVG vector diagram depicting the whole network setup within the DC, from the Physical NIC up to the leaf VMs and including bonds/bridges/etc.
The Question to you all: Let us know what you think! Any comments, feedback, etc. are greatly appreciated. How are you performing documentation, compliance audit and capacity planning currently within your PVE clusters? What metrics/edge cases are a pain point to extract natively in PVE and could be a useful feature to add?
Here is the GitHub repo so you can have it and play around with it: https://github.com/Corsinvest/cv4pve-report
Many thanks in advance!