One big complexity to note would be working the Ceph health messages into this, but luckily their levels are fairly consistent (WARN and ERR being roughly -10 and -50 respectively) and they output…
This actually wouldn't work as well as-is as would be obvious. Probably better to make our own "plugin" system that allows arbitrary things to monitor then build a coherent check framework around…
I definitely think leveraging part of the check_mk_agent system for this might be worthwhile. At regular intervals (say every minute so every ~12 keepalives), the plugins can run and save output…
I think this could be done in a crude way by doing:
virsh pauseon the VM- Quickly snapshotting all volumes
virsh unpauseon the VM
It should be quick enough to be impreceptable…
From some quick testing, the consolecallback script does not work properly - it prints no output and immediately exits on any input. This is also the only example I can find so far which does not…