22 Commits

Author SHA1 Message Date
794cea4a02 Reverse ordering, run checks before starting timer 2023-09-15 22:25:37 -04:00
479e156234 Run monitoring plugins once on startup 2023-09-15 17:53:16 -04:00
86830286f3 Adjust message printing to be on one line 2023-09-15 17:00:34 -04:00
4d51318a40 Make monitoring interval configurable 2023-09-15 16:54:51 -04:00
cba6f5be48 Fix wording of non-coordinator state 2023-09-15 16:51:04 -04:00
254303b9d4 Use coordinator_state instead of router_state
Makes it much clearer what this variable represents.
2023-09-15 16:47:56 -04:00
40b7d68853 Separate monitoring and move to 60s interval
Removes the dependency of the monitoring subsystem from the node
keepalives, and runs them at a 60s interval to avoid excessive backups
if a plugin takes too long.

Adds its own logs and related items as required.

Finally adds a new required argument to the run() of plugins, the
coordinator state, which can be used by a plugin to determine actions
based on whether the node is a primary, secondary, or non-coordinator.
2023-09-15 16:47:11 -04:00
570da99605 Avoid failures if no children found 2023-09-02 01:36:17 -04:00
7c07fbefff Adjust keepalive health printing and ordering 2023-02-24 11:08:30 -05:00
202dc3ed59 Correct error handling if monitoring plugins fail 2023-02-24 10:19:41 -05:00
e45b3108a2 Add health delta change to message output 2023-02-22 15:02:08 -05:00
118237a53b Fix bad string value for message 2023-02-22 15:02:08 -05:00
1093ca6264 Disallow health less than 0 2023-02-15 16:50:24 -05:00
0ecf219910 Run setup during plugin loads 2023-02-15 10:11:38 -05:00
0f4edc54d1 Use percentage in keepalie output 2023-02-15 01:56:02 -05:00
14d29f2986 Adjust text on log message 2023-02-13 22:21:23 -05:00
bc88d764b0 Add logging flag for montioring plugin output 2023-02-13 22:04:39 -05:00
b07396c39a Fix bugs if plugins fail to load 2023-02-13 21:51:48 -05:00
1ea4800212 Set node health to None when restarting 2023-02-13 15:54:46 -05:00
9c14d84bfc Add node health value and send out API 2023-02-13 15:53:39 -05:00
d8f346abdd Move Ceph cluster health reporting to plugin
Also removes several outputs from the normal keepalive that were
superfluous/static so that the main output fits on one line.
2023-02-13 13:29:40 -05:00
3c742a827b Initial implementation of monitoring plugin system 2023-02-13 12:06:26 -05:00