Commit Graph

249 Commits

Author SHA1 Message Date
Joshua Boniface 9842f628a3 Use GO zkcli instead of trash BASH one
With Debian 12+ the interactive zkCli.sh is completely broken, failing
to load the JLine module regardless of what is done, and no one in the
project seems to care about that.

Instead let's swap this out for the (non-interactive) GO-based zkcli of
https://github.com/go-zkcli/zkcli instead, as this suits our needs
nicely.
2023-12-13 15:48:12 -05:00
Joshua Boniface b2a7eecf2c Only set exclude_perf_counts on Debian 12+ 2023-12-10 12:43:34 -05:00
Joshua Boniface 09eba9de80 Add Prometheus Zookeeper exporter 2023-12-10 12:39:35 -05:00
Joshua Boniface 1eb618e8f3 Fix typo 2023-12-10 03:15:02 -05:00
Joshua Boniface 1f34427a0c Add user as well 2023-12-10 03:05:11 -05:00
Joshua Boniface c7d914696b Add configuration for Postgres Prom exporter 2023-12-10 02:53:11 -05:00
Joshua Boniface 258b5792da Add Libvirt Prometheus exporter 2023-12-10 01:42:09 -05:00
Joshua Boniface 7a2c16e2a5 Disable Zookeeper Prometheus export
This does not work as this functionality is built out of the Debian
packages. Hopefully it will be enabled one day...
2023-12-10 01:41:06 -05:00
Joshua Boniface 23bb3cb4d9 Add JVM info to Zookepeer metric 2023-12-10 01:16:44 -05:00
Joshua Boniface 54bf70d336 Enable Prometheus metrics in Zookeeper too 2023-12-10 00:32:37 -05:00
Joshua Boniface 513313d60f Limit FRR Prom exporter to 12+
Package did not exist on Debian 10/11
2023-12-10 00:31:21 -05:00
Joshua Boniface bf10ede298 Add additional Prometheus exporters 2023-12-10 00:24:19 -05:00
Joshua Boniface 9e21aecf97 Ignore errors then check for PVC package
This helps work around apt issues when running from the oneshot
update-pvc-daemons playbook. On a new install, this will be OK. On an
upgrade, the apt tasks will fail OK but then the verification that
pvc-client-cli will ensure that things are actually sane before
proceeding.
2023-12-09 23:56:34 -05:00
Joshua Boniface 865c7d0872 Add Ceph Prometheus configurations (0.9.84) 2023-12-09 23:51:29 -05:00
Joshua Boniface 18054c01a0 Remove obsolete config templates 2023-12-09 23:05:16 -05:00
Joshua Boniface 5111ae47c4 Lower default monitoring interval to 15s
Faults are also reported on the monitoring interval, so 60s seems like
too long. Lower this to 15 seconds by default instead.
2023-12-01 16:06:24 -05:00
Joshua Boniface 73ad2a7751 Avoid removal of old versions at all
We simply shouldn't do this here. Let's leave them hanging around unless
removed in another way (e.g. in update-pvc-daemons)
2023-12-01 02:08:22 -05:00
Joshua Boniface 97b1469a70 Set ownership of pvc.conf 2023-12-01 01:57:56 -05:00
Joshua Boniface d59c9ce1ea Add safety to removal of legacy configs
This conditional will ensure that, the first time pvc.conf is installed
(or, subsequent times, until it stabilizes), the legacy configs will not
be removed. Then, on the next run in which pvc.conf does not change,
they will be removed.

This should provide a safety valve during a 0.9.83 update with the
update-pvc-daemons playbook: if the update succeeds, on the next run,
the legacy configs will be purged; otherwise, they will still be present
and can be used for fallback just in case.

This probably isn't needed, but just in case I'd rather be safe.
2023-12-01 01:45:48 -05:00
Joshua Boniface 1cfda69e5e Remove autobackup.yaml and fix quoting 2023-12-01 01:43:14 -05:00
Joshua Boniface 9408bf709c Only install pvcapid on coordinators
There should be no reason for the API to be installed on non-coordinator
hosts, so separate it out.
2023-12-01 01:40:56 -05:00
Joshua Boniface 9d2af41d3f Install new packages and remove old confs 2023-11-30 03:29:24 -05:00
Joshua Boniface b1d6915cf4 Write new pvc.conf style configuration (0.9.82+) 2023-11-28 16:10:23 -05:00
Joshua Boniface c8764159f6 Readd queue configuration with updated options 2023-11-05 23:37:49 -05:00
Joshua Boniface 523f7da71e Add KeyDB (Redis clone) to configuration
Replaces Redis for PVC >= 0.9.81
2023-11-05 19:24:30 -05:00
Joshua Boniface 2f9603c82f Adjust pvcapid.yaml for 0.9.81 worker queue config 2023-11-04 12:52:21 -04:00
Joshua Boniface e15e2dfaab Remove erroneous netmask from floating IP defaults 2023-11-04 12:51:07 -04:00
Joshua Boniface 103e9fe147 Add restart overrides for ceph-mgr
Needed because ceph-mgr seems to crash frequently under Debian 12 when
adding or removing OSDs. The default settings do not restart it
properly, so this override does.
2023-11-03 14:25:31 -04:00
Joshua Boniface 15a5b581f1 Disable failing socket services 2023-11-03 12:10:19 -04:00
Joshua Boniface 90417621d7 Add autobackup support to pvc-ansible 2023-10-27 02:08:20 -04:00
Joshua Boniface 17f819ea3f Don't set "latest" for libvirt packages
Avoids errors during runs before upgrades.
2023-10-24 10:41:47 -04:00
Joshua Boniface c11f896a60 Fix zk_status check target znode 2023-10-22 00:42:43 -04:00
Joshua Boniface f4bbdb7c86 Use full path for uuidgen 2023-09-29 03:00:53 -04:00
Joshua Boniface 83636388f0 Add configurable monitoring interval 2023-09-15 22:31:16 -04:00
Joshua Boniface 8ebb8a8339 Disable autoscale via command
As per [1] the ceph.conf option does not work properly and must be set this way.

[1] https://stackoverflow.com/questions/63853436/ceph-octopus-setting-autoscale-mode-from-ceph-conf-file
2023-09-02 01:59:47 -04:00
Joshua Boniface a10b3e8d4a Lower default pgs and disable autoscale 2023-09-01 23:54:10 -04:00
Joshua Boniface cf426408f2 Restore original setting 2023-09-01 16:18:20 -04:00
Joshua Boniface 3680717daa Remove extra restarts on bootstrap 2023-09-01 15:42:30 -04:00
Joshua Boniface 1f4cd92d63 Fix bad calls to node primary 2023-09-01 15:42:30 -04:00
Joshua Boniface 6da9956811 Fix delegate_to 2023-09-01 15:42:30 -04:00
Joshua Boniface fb60093750 Ignore errors in Patroni restart handler 2023-09-01 15:42:30 -04:00
Joshua Boniface 7b061966ad Ignore errors in Patroni
Required during upgrades as the service may be masked.
2023-09-01 15:42:30 -04:00
Joshua Boniface 1e497413e8 Remove extra whitespace 2023-09-01 15:42:30 -04:00
Joshua Boniface 64ce09122d Add additional primary node switch 2023-09-01 15:42:30 -04:00
Joshua Boniface 353399a407 Ensure core pg_hba entries are present 2023-09-01 15:42:30 -04:00
Joshua Boniface b21778f117 Fix Patroni upgrade and D12 support 2023-09-01 15:42:30 -04:00
Joshua Boniface 9411679004 Fix reboot 2023-09-01 15:42:30 -04:00
Joshua Boniface 7c8b6919fe Add Debian 12 Patroni config 2023-09-01 15:42:30 -04:00
Joshua Boniface 2ba8f1cfc3 Add retries to all apt commands 2023-09-01 15:42:30 -04:00
Joshua Boniface d54844746e Ignore errors enabling vhostmd
Seems to cause issues in bookworm.
2023-09-01 15:42:30 -04:00