9842f628a3
Use GO zkcli instead of trash BASH one
...
With Debian 12+ the interactive zkCli.sh is completely broken, failing
to load the JLine module regardless of what is done, and no one in the
project seems to care about that.
Instead let's swap this out for the (non-interactive) GO-based zkcli of
https://github.com/go-zkcli/zkcli instead, as this suits our needs
nicely.
2023-12-13 15:48:12 -05:00
b2a7eecf2c
Only set exclude_perf_counts on Debian 12+
2023-12-10 12:43:34 -05:00
09eba9de80
Add Prometheus Zookeeper exporter
2023-12-10 12:39:35 -05:00
1eb618e8f3
Fix typo
2023-12-10 03:15:02 -05:00
1f34427a0c
Add user as well
2023-12-10 03:05:11 -05:00
c7d914696b
Add configuration for Postgres Prom exporter
2023-12-10 02:53:11 -05:00
258b5792da
Add Libvirt Prometheus exporter
2023-12-10 01:42:09 -05:00
7a2c16e2a5
Disable Zookeeper Prometheus export
...
This does not work as this functionality is built out of the Debian
packages. Hopefully it will be enabled one day...
2023-12-10 01:41:06 -05:00
23bb3cb4d9
Add JVM info to Zookepeer metric
2023-12-10 01:16:44 -05:00
54bf70d336
Enable Prometheus metrics in Zookeeper too
2023-12-10 00:32:37 -05:00
513313d60f
Limit FRR Prom exporter to 12+
...
Package did not exist on Debian 10/11
2023-12-10 00:31:21 -05:00
bf10ede298
Add additional Prometheus exporters
2023-12-10 00:24:19 -05:00
9e21aecf97
Ignore errors then check for PVC package
...
This helps work around apt issues when running from the oneshot
update-pvc-daemons playbook. On a new install, this will be OK. On an
upgrade, the apt tasks will fail OK but then the verification that
pvc-client-cli will ensure that things are actually sane before
proceeding.
2023-12-09 23:56:34 -05:00
865c7d0872
Add Ceph Prometheus configurations (0.9.84)
2023-12-09 23:51:29 -05:00
18054c01a0
Remove obsolete config templates
2023-12-09 23:05:16 -05:00
5111ae47c4
Lower default monitoring interval to 15s
...
Faults are also reported on the monitoring interval, so 60s seems like
too long. Lower this to 15 seconds by default instead.
2023-12-01 16:06:24 -05:00
73ad2a7751
Avoid removal of old versions at all
...
We simply shouldn't do this here. Let's leave them hanging around unless
removed in another way (e.g. in update-pvc-daemons)
2023-12-01 02:08:22 -05:00
97b1469a70
Set ownership of pvc.conf
2023-12-01 01:57:56 -05:00
d59c9ce1ea
Add safety to removal of legacy configs
...
This conditional will ensure that, the first time pvc.conf is installed
(or, subsequent times, until it stabilizes), the legacy configs will not
be removed. Then, on the next run in which pvc.conf does not change,
they will be removed.
This should provide a safety valve during a 0.9.83 update with the
update-pvc-daemons playbook: if the update succeeds, on the next run,
the legacy configs will be purged; otherwise, they will still be present
and can be used for fallback just in case.
This probably isn't needed, but just in case I'd rather be safe.
2023-12-01 01:45:48 -05:00
1cfda69e5e
Remove autobackup.yaml and fix quoting
2023-12-01 01:43:14 -05:00
9408bf709c
Only install pvcapid on coordinators
...
There should be no reason for the API to be installed on non-coordinator
hosts, so separate it out.
2023-12-01 01:40:56 -05:00
9d2af41d3f
Install new packages and remove old confs
2023-11-30 03:29:24 -05:00
b1d6915cf4
Write new pvc.conf style configuration (0.9.82+)
2023-11-28 16:10:23 -05:00
c8764159f6
Readd queue configuration with updated options
2023-11-05 23:37:49 -05:00
523f7da71e
Add KeyDB (Redis clone) to configuration
...
Replaces Redis for PVC >= 0.9.81
2023-11-05 19:24:30 -05:00
2f9603c82f
Adjust pvcapid.yaml for 0.9.81 worker queue config
2023-11-04 12:52:21 -04:00
e15e2dfaab
Remove erroneous netmask from floating IP defaults
2023-11-04 12:51:07 -04:00
103e9fe147
Add restart overrides for ceph-mgr
...
Needed because ceph-mgr seems to crash frequently under Debian 12 when
adding or removing OSDs. The default settings do not restart it
properly, so this override does.
2023-11-03 14:25:31 -04:00
15a5b581f1
Disable failing socket services
2023-11-03 12:10:19 -04:00
90417621d7
Add autobackup support to pvc-ansible
2023-10-27 02:08:20 -04:00
17f819ea3f
Don't set "latest" for libvirt packages
...
Avoids errors during runs before upgrades.
2023-10-24 10:41:47 -04:00
c11f896a60
Fix zk_status check target znode
2023-10-22 00:42:43 -04:00
f4bbdb7c86
Use full path for uuidgen
2023-09-29 03:00:53 -04:00
83636388f0
Add configurable monitoring interval
2023-09-15 22:31:16 -04:00
8ebb8a8339
Disable autoscale via command
...
As per [1] the ceph.conf option does not work properly and must be set this way.
[1] https://stackoverflow.com/questions/63853436/ceph-octopus-setting-autoscale-mode-from-ceph-conf-file
2023-09-02 01:59:47 -04:00
a10b3e8d4a
Lower default pgs and disable autoscale
2023-09-01 23:54:10 -04:00
cf426408f2
Restore original setting
2023-09-01 16:18:20 -04:00
3680717daa
Remove extra restarts on bootstrap
2023-09-01 15:42:30 -04:00
1f4cd92d63
Fix bad calls to node primary
2023-09-01 15:42:30 -04:00
6da9956811
Fix delegate_to
2023-09-01 15:42:30 -04:00
fb60093750
Ignore errors in Patroni restart handler
2023-09-01 15:42:30 -04:00
7b061966ad
Ignore errors in Patroni
...
Required during upgrades as the service may be masked.
2023-09-01 15:42:30 -04:00
1e497413e8
Remove extra whitespace
2023-09-01 15:42:30 -04:00
64ce09122d
Add additional primary node switch
2023-09-01 15:42:30 -04:00
353399a407
Ensure core pg_hba entries are present
2023-09-01 15:42:30 -04:00
b21778f117
Fix Patroni upgrade and D12 support
2023-09-01 15:42:30 -04:00
9411679004
Fix reboot
2023-09-01 15:42:30 -04:00
7c8b6919fe
Add Debian 12 Patroni config
2023-09-01 15:42:30 -04:00
2ba8f1cfc3
Add retries to all apt commands
2023-09-01 15:42:30 -04:00
d54844746e
Ignore errors enabling vhostmd
...
Seems to cause issues in bookworm.
2023-09-01 15:42:30 -04:00