705ec802a3
Bump version to 0.9.67
2023-08-27 14:47:20 -04:00
0b90f37518
Bump version to 0.9.66
2023-08-27 11:41:22 -04:00
1e083d7652
Bump version to 0.9.65
2023-08-23 01:56:57 -04:00
075dbe7cc9
Bump version to 0.9.64
2023-08-18 12:34:27 -04:00
b5f996febd
Fix bugs for node flush for stop/shutdown/restart
...
Previously VMs in stop/shutdown/restart states wouldn't be properly
handled during a node flush. This fixes the bugs and ensures that the
transient VM states (shutdown/restart) are completed before proceeding,
and then avoids setting a stopped/shutdown VM to shutdown/auotstart.
2023-08-18 11:25:59 -04:00
3a90fda109
Bump version to 0.9.63
2023-04-28 14:47:04 -04:00
2c3a3cdf52
Use try when watching health value in NodeInstance
2023-03-07 09:53:01 -05:00
7c07fbefff
Adjust keepalive health printing and ordering
2023-02-24 11:08:30 -05:00
202dc3ed59
Correct error handling if monitoring plugins fail
2023-02-24 10:19:41 -05:00
45ad3b9a17
Bump version to 0.9.62
2023-02-22 18:13:45 -05:00
e45b3108a2
Add health delta change to message output
2023-02-22 15:02:08 -05:00
118237a53b
Fix bad string value for message
2023-02-22 15:02:08 -05:00
1093ca6264
Disallow health less than 0
2023-02-15 16:50:24 -05:00
f4eef30770
Add JSON health to cluster data
2023-02-15 15:26:57 -05:00
0ecf219910
Run setup during plugin loads
2023-02-15 10:11:38 -05:00
0f4edc54d1
Use percentage in keepalie output
2023-02-15 01:56:02 -05:00
14d29f2986
Adjust text on log message
2023-02-13 22:21:23 -05:00
bc88d764b0
Add logging flag for montioring plugin output
2023-02-13 22:04:39 -05:00
b07396c39a
Fix bugs if plugins fail to load
2023-02-13 21:51:48 -05:00
1ea4800212
Set node health to None when restarting
2023-02-13 15:54:46 -05:00
9c14d84bfc
Add node health value and send out API
2023-02-13 15:53:39 -05:00
d8f346abdd
Move Ceph cluster health reporting to plugin
...
Also removes several outputs from the normal keepalive that were
superfluous/static so that the main output fits on one line.
2023-02-13 13:29:40 -05:00
2ee52e44d3
Move Ceph cluster health reporting to plugin
...
Also removes several outputs from the normal keepalive that were
superfluous/static so that the main output fits on one line.
2023-02-13 12:13:56 -05:00
3c742a827b
Initial implementation of monitoring plugin system
2023-02-13 12:06:26 -05:00
aeb238f43c
Bump version to 0.9.61
2023-02-08 10:08:05 -05:00
a49510ecc8
Bump version to 0.9.60
2022-12-06 15:42:55 -05:00
92feeefd26
Bump version to 0.9.59
2022-11-15 15:50:15 -05:00
095bcb2373
Bump version to 0.9.58
2022-11-07 12:27:48 -05:00
d65f512897
Bump version to 0.9.57
2022-11-06 01:39:50 -04:00
c3bc55eff8
Bump version to 0.9.56
2022-10-27 14:21:04 -04:00
726d0a562b
Update copyright header year
2022-10-06 11:55:27 -04:00
f1df1cfe93
Bump version to 0.9.55
2022-10-04 13:21:40 -04:00
5942aa50fc
Avoid raise/handle deadlocks
...
Can cause log flooding in some edge cases and isn't really needed any
longer. Use a proper conditional followed by an actual error handler.
2022-10-03 14:04:12 -04:00
239c392892
Bump version to 0.9.54
2022-08-23 11:01:05 -04:00
9b499b9f48
Bump version to 0.9.53
2022-08-12 17:47:11 -04:00
2a21d48128
Bump version to 0.9.52
2022-08-12 11:09:25 -04:00
8d0f26ff7a
Add additional kb_ values to OSD stats
...
Allows for easier parsing later to get e.g. % values and more details on
the used amounts.
2022-08-11 11:06:36 -04:00
645b525ad7
Bump version to 0.9.51
2022-07-25 23:25:41 -04:00
932b3c55a3
Bump version to 0.9.50
2022-07-06 16:01:14 -04:00
92e2ff7449
Fix bug with space-containing detect strings
2022-07-06 15:58:57 -04:00
51ad2058ed
Bump version to 0.9.49
2022-05-06 15:49:39 -04:00
7a40c7a55b
Add support for replacing/refreshing OSDs
...
Adds commands to both replace an OSD disk, and refresh (reimport) an
existing OSD disk on a new node. This handles the cases where an OSD
disk should be replaced (either due to upgrades or failures) or where a
node is rebuilt in-place and an existing OSD must be re-imported to it.
This should avoid the need to do a full remove/add sequence for either
case.
Also cleans up some aspects of OSD removal that are identical between
methods (e.g. using safe-to-destroy and sleeping after stopping) and
fixes a bug if an OSD does not truly exist when the daemon starts up.
2022-05-06 15:32:06 -04:00
3801fcc07b
Fix bug with initial JSON for stats
2022-05-02 13:28:19 -04:00
c741900baf
Refactor OSD removal to use new ZK data
...
With the OSD LVM information stored in Zookeeper, we can use this to
determine the actual block device to zap rather than relying on runtime
determination and guestimation.
2022-05-02 12:52:22 -04:00
464f0e0356
Store additional OSD information in ZK
...
Ensures that information like the FSIDs and the OSD LVM volume are
stored in Zookeeper at creation time and updated at daemon start time
(to ensure the data is populated at least once, or if the /dev/sdX
path changes).
This will allow safer operation of OSD removals and the potential
implementation of re-activation after node replacements.
2022-05-02 12:11:39 -04:00
cea8832f90
Ensure initial OSD stats is populated
...
Values are all invalid but this ensures the client won't error out when
trying to show an OSD that has never checked in yet.
2022-04-29 16:50:30 -04:00
5807351405
Bump version to 0.9.48
2022-04-29 15:03:52 -04:00
d6ca74376a
Fix bugs with forced removal
2022-04-29 14:03:07 -04:00
4d698be34b
Add OSD removal force option
...
Ensures a removal can continue even in situations where some step(s)
might fail, for instance removing an obsolete OSD from a replaced node.
2022-04-29 11:16:33 -04:00
ea709f573f
Bump version to 0.9.47
2021-12-28 22:03:08 -05:00