376 Commits

Author SHA1 Message Date
b2a7eecf2c Only set exclude_perf_counts on Debian 12+ 2023-12-10 12:43:34 -05:00
09eba9de80 Add Prometheus Zookeeper exporter 2023-12-10 12:39:35 -05:00
1eb618e8f3 Fix typo 2023-12-10 03:15:02 -05:00
1f34427a0c Add user as well 2023-12-10 03:05:11 -05:00
c7d914696b Add configuration for Postgres Prom exporter 2023-12-10 02:53:11 -05:00
258b5792da Add Libvirt Prometheus exporter 2023-12-10 01:42:09 -05:00
7a2c16e2a5 Disable Zookeeper Prometheus export
This does not work as this functionality is built out of the Debian
packages. Hopefully it will be enabled one day...
2023-12-10 01:41:06 -05:00
23bb3cb4d9 Add JVM info to Zookepeer metric 2023-12-10 01:16:44 -05:00
54bf70d336 Enable Prometheus metrics in Zookeeper too 2023-12-10 00:32:37 -05:00
513313d60f Limit FRR Prom exporter to 12+
Package did not exist on Debian 10/11
2023-12-10 00:31:21 -05:00
bf10ede298 Add additional Prometheus exporters 2023-12-10 00:24:19 -05:00
8bf3bbdeb1 Enable Prometheus exporter on nodes by default 2023-12-10 00:18:44 -05:00
9e21aecf97 Ignore errors then check for PVC package
This helps work around apt issues when running from the oneshot
update-pvc-daemons playbook. On a new install, this will be OK. On an
upgrade, the apt tasks will fail OK but then the verification that
pvc-client-cli will ensure that things are actually sane before
proceeding.
2023-12-09 23:56:34 -05:00
865c7d0872 Add Ceph Prometheus configurations (0.9.84) 2023-12-09 23:51:29 -05:00
18054c01a0 Remove obsolete config templates 2023-12-09 23:05:16 -05:00
5111ae47c4 Lower default monitoring interval to 15s
Faults are also reported on the monitoring interval, so 60s seems like
too long. Lower this to 15 seconds by default instead.
2023-12-01 16:06:24 -05:00
73ad2a7751 Avoid removal of old versions at all
We simply shouldn't do this here. Let's leave them hanging around unless
removed in another way (e.g. in update-pvc-daemons)
2023-12-01 02:08:22 -05:00
97b1469a70 Set ownership of pvc.conf 2023-12-01 01:57:56 -05:00
d59c9ce1ea Add safety to removal of legacy configs
This conditional will ensure that, the first time pvc.conf is installed
(or, subsequent times, until it stabilizes), the legacy configs will not
be removed. Then, on the next run in which pvc.conf does not change,
they will be removed.

This should provide a safety valve during a 0.9.83 update with the
update-pvc-daemons playbook: if the update succeeds, on the next run,
the legacy configs will be purged; otherwise, they will still be present
and can be used for fallback just in case.

This probably isn't needed, but just in case I'd rather be safe.
2023-12-01 01:45:48 -05:00
1cfda69e5e Remove autobackup.yaml and fix quoting 2023-12-01 01:43:14 -05:00
9408bf709c Only install pvcapid on coordinators
There should be no reason for the API to be installed on non-coordinator
hosts, so separate it out.
2023-12-01 01:40:56 -05:00
9d2af41d3f Install new packages and remove old confs 2023-11-30 03:29:24 -05:00
1e89a1440c Enable modelines by default 2023-11-28 16:13:49 -05:00
b1d6915cf4 Write new pvc.conf style configuration (0.9.82+) 2023-11-28 16:10:23 -05:00
c8764159f6 Readd queue configuration with updated options 2023-11-05 23:37:49 -05:00
523f7da71e Add KeyDB (Redis clone) to configuration
Replaces Redis for PVC >= 0.9.81
2023-11-05 19:24:30 -05:00
2f9603c82f Adjust pvcapid.yaml for 0.9.81 worker queue config 2023-11-04 12:52:21 -04:00
e15e2dfaab Remove erroneous netmask from floating IP defaults 2023-11-04 12:51:07 -04:00
103e9fe147 Add restart overrides for ceph-mgr
Needed because ceph-mgr seems to crash frequently under Debian 12 when
adding or removing OSDs. The default settings do not restart it
properly, so this override does.
2023-11-03 14:25:31 -04:00
15a5b581f1 Disable failing socket services 2023-11-03 12:10:19 -04:00
90417621d7 Add autobackup support to pvc-ansible 2023-10-27 02:08:20 -04:00
17f819ea3f Don't set "latest" for libvirt packages
Avoids errors during runs before upgrades.
2023-10-24 10:41:47 -04:00
c11f896a60 Fix zk_status check target znode 2023-10-22 00:42:43 -04:00
5764695699 Add AMD microcode as well 2023-10-03 13:36:56 -04:00
f4bbdb7c86 Use full path for uuidgen 2023-09-29 03:00:53 -04:00
82accb3b5e Install intel-microcode on Intel CPUs
Required otherwise needrestart fails.
2023-09-20 16:43:08 -04:00
83636388f0 Add configurable monitoring interval 2023-09-15 22:31:16 -04:00
e995f3750b Fix incorrect repo name in Bullseye 2023-09-09 19:28:47 -04:00
85253e9706 Enable pass-through IOMMU on Bookworm 2023-09-05 16:35:58 -04:00
6ac6b74023 Update key name 2023-09-05 13:50:37 -04:00
80f5a4f260 Add dpkg-cleanup step to base config 2023-09-05 10:32:40 -04:00
8ebb8a8339 Disable autoscale via command
As per [1] the ceph.conf option does not work properly and must be set this way.

[1] https://stackoverflow.com/questions/63853436/ceph-octopus-setting-autoscale-mode-from-ceph-conf-file
2023-09-02 01:59:47 -04:00
a10b3e8d4a Lower default pgs and disable autoscale 2023-09-01 23:54:10 -04:00
cf426408f2 Restore original setting 2023-09-01 16:18:20 -04:00
3680717daa Remove extra restarts on bootstrap 2023-09-01 15:42:30 -04:00
1f4cd92d63 Fix bad calls to node primary 2023-09-01 15:42:30 -04:00
6da9956811 Fix delegate_to 2023-09-01 15:42:30 -04:00
fb60093750 Ignore errors in Patroni restart handler 2023-09-01 15:42:30 -04:00
7b061966ad Ignore errors in Patroni
Required during upgrades as the service may be masked.
2023-09-01 15:42:30 -04:00
1e497413e8 Remove extra whitespace 2023-09-01 15:42:30 -04:00