Commit Graph

26 Commits

Author SHA1 Message Date
40b7d68853 Separate monitoring and move to 60s interval
Removes the dependency of the monitoring subsystem from the node
keepalives, and runs them at a 60s interval to avoid excessive backups
if a plugin takes too long.

Adds its own logs and related items as required.

Finally adds a new required argument to the run() of plugins, the
coordinator state, which can be used by a plugin to determine actions
based on whether the node is a primary, secondary, or non-coordinator.
2023-09-15 16:47:11 -04:00
9114255af5 Add *.update-* obsolete configs to dpkg plugin 2023-04-10 15:39:40 -04:00
0b583bfdaf Bump IPMI timeout to 2 seconds 2023-03-07 09:25:27 -05:00
4c2d99f8a6 Fix bug with SMART info 2023-02-23 13:21:23 -05:00
bcff6650d0 Set timeout on IPMI command 2023-02-23 11:10:09 -05:00
a11206253d Fix ZK check location 2023-02-23 11:04:02 -05:00
dc4e56db4b Add IPMI monitoring check 2023-02-22 15:02:08 -05:00
9805681f94 Use consistent connection with other checks 2023-02-22 15:02:08 -05:00
6c9abb2abe Add Libvirtd monitoring check 2023-02-22 15:02:08 -05:00
a1122c6e71 Add Zookeeper monitoring check 2023-02-22 15:02:08 -05:00
3696f81597 Add PostgreSQL monitoring check 2023-02-22 15:02:08 -05:00
5ca0d903b6 Adjust comment message 2023-02-22 15:02:08 -05:00
c9ceb3159b Remove obsolete LINKSPEED variable 2023-02-22 01:04:25 -05:00
6525a2568b Adjust health delta of load to 50
This is a very bad situation and should be critical.
2023-02-22 01:03:12 -05:00
09a005d3d7 Adjust health delta of EDAC Uncorrected to 50
This is a very bad situation and should be critical.
2023-02-22 01:01:54 -05:00
388f6556c0 Remove extra text from packages plugin 2023-02-15 16:28:41 -05:00
6c7be492b8 Move Ceph health to global cluster health 2023-02-15 15:46:13 -05:00
8565cf26b3 Add disk monitoring plugin 2023-02-15 11:30:49 -05:00
ca91be51e1 Improve ethtool parsing speeds 2023-02-14 15:49:58 -05:00
e29d0e89eb Add NIC monitoring plugin 2023-02-14 15:43:52 -05:00
a3c31564ca Flip condition in EDAC check 2023-02-13 21:58:56 -05:00
71139fa66d Add EDAC check plugin 2023-02-13 21:43:13 -05:00
9c14d84bfc Add node health value and send out API 2023-02-13 15:53:39 -05:00
d8f346abdd Move Ceph cluster health reporting to plugin
Also removes several outputs from the normal keepalive that were
superfluous/static so that the main output fits on one line.
2023-02-13 13:29:40 -05:00
2ee52e44d3 Move Ceph cluster health reporting to plugin
Also removes several outputs from the normal keepalive that were
superfluous/static so that the main output fits on one line.
2023-02-13 12:13:56 -05:00
3c742a827b Initial implementation of monitoring plugin system 2023-02-13 12:06:26 -05:00