159 Commits

Author SHA1 Message Date
8c2d117a3c Wait longer when restarting services
From 15 -> 30 seconds to ensure more time for stabilization before
proceeding with the next.
2023-09-01 15:42:26 -04:00
647ca1c446 Add default features flag to ceph.conf generator
Coupled with the removal of explicit --image-features flags to the RBD
command in PVC itself, this ensures that only the two features supported
on kernel 4.19 are enabled by default.
2023-09-01 15:42:26 -04:00
3d64ad2420 Typo fix 2023-09-01 15:42:26 -04:00
eaea860b61 Lower autopurge interval to 1 hour 2023-09-01 15:42:26 -04:00
524f857f56 Add some Zookeeper configuration tweaks 2023-09-01 15:42:26 -04:00
13556918d7 Disable any systemd start rate limiting
Because Zookeeper is supremely stupid (see last commit) we want to
disable start limiting. It needs to keep trying forever until it starts.
2023-09-01 15:42:26 -04:00
8eecc95f2f Ensure Zookeeper restarts itself
The Zookeeper daemon does not appear to exit with any status other than
0, even after a fatal error. Work around this.
2023-09-01 15:42:26 -04:00
b03ecf0125 Add -XX:+AlwaysPreTouch option for Zookeeper 2023-09-01 15:42:26 -04:00
b842276002 Lower keep count for Zookeeper vacuum to 3
Required to keep disk space growth down when using zookeeper_logging
functionality.
2023-09-01 15:42:26 -04:00
681afd1d1b Fix excessive whitespace 2023-09-01 15:42:26 -04:00
2d31e6c8ea Fix memory tuning issues 2023-09-01 15:42:26 -04:00
71b6da6555 Adjust package lists per Debian version 2023-09-01 15:42:26 -04:00
a52d4cbf37 Add Zookeeper logging configs 2023-09-01 15:42:26 -04:00
e760114b8d Fix bootstrap collection path for Ceph 2023-09-01 15:42:26 -04:00
0802cca980 Support both versions of psycopg2 and kazoo 2023-09-01 15:42:26 -04:00
31a677b444 Fix Patroni ACL to use subnet mask 2023-09-01 15:42:26 -04:00
a2ed38b459 Add generic SR-IOV configuration 2023-09-01 15:42:26 -04:00
388db6ad1d Use IPs for Patroni configuration 2023-09-01 15:42:26 -04:00
d455b31905 Bump max connections in Zookeeper to 200 2023-09-01 15:42:26 -04:00
f105f0497c Configure Zookeeper only on Cluster address 2023-09-01 15:42:26 -04:00
7e94dddb4c Ensure libvirtd restarts when unit changes 2023-09-01 15:42:26 -04:00
0bbb91fc8b Add override custom libvirtd.service unit
This has no functional change on Buster, but on Bullseye this overrides
the stupid socket-based activation shenanigans that the default unit
tries to do, as well as the breaking replacement of the
/etc/default/libvirt variable names.
2023-09-01 15:42:26 -04:00
0114ad8ed5 Add python3 version of psycopg2 explicitly 2023-09-01 15:42:26 -04:00
027a819a83 Move some other tasks to bootstrap role
Avoids an issue where the pvcnoded service is stopped on non-bootstrap
runs.
2023-09-01 15:42:25 -04:00
6a61f8f7bf Update relative path to bootstrap files 2023-09-01 15:42:25 -04:00
4caab67d03 Remove superfluous symlink 2023-09-01 15:42:25 -04:00
39b8229c35 Add libguestfs-tools to libvirt role deps 2023-09-01 15:42:25 -04:00
0bf9c6209c Fix incorrect systemd enabling in Patroni 2023-09-01 15:42:25 -04:00
4f5dbee8ee Correct bugs during bootstrap
1. Ensure Zookeeper restarts and checks out successfully before
proceeding with other steps.
2. Make sure PVC itself doesn't start prematurely.
2023-09-01 15:42:25 -04:00
26dbd082ef Retry pgsql bootstrap startup 6 times
This will sometimes fail, so retry it several times
2023-09-01 15:42:25 -04:00
e9f08ad100 Retry msgr2 enabling 6 times
This will sometimes fail, so retry it several times
2023-09-01 15:42:25 -04:00
a77e41bf7c Remove invalid timezone entries in postgres conf 2023-09-01 15:42:25 -04:00
cba276e248 Add default values 2023-09-01 15:42:24 -04:00
be94bc134f Add configurable ZK memory limits 2023-09-01 15:42:24 -04:00
6e74ac44a5 Remove libjemalloc package 2023-09-01 15:42:24 -04:00
2bd5cc5a25 Tune Zookeeper memory usage
Use Xms and Xmx=128M to reduce overall Zookeeper memory usage.
2023-09-01 15:42:24 -04:00
b4e36d146a Add tuning for Ceph OSDs 2023-09-01 15:42:24 -04:00
24764fe704 Don't use libjemalloc for Ceph daemons
This was an artifact of a much, much older Ceph configuration I ran, and
is not relevant with newer Ceph versions like those used in PVC.
Performance testing with Nautilus and Bluestore reveals a minimal
performance hit, and using `jemalloc` prevents cache autotuning from
being effective, so remove it.
2023-09-01 15:42:24 -04:00
458e7b4872 Use new init command location
Command was renamed in the PVC CLI to facilitate other "task" actions
like backup/restore.
2023-09-01 15:42:24 -04:00
bcb5962353 Add jute.maxbuffer to Zookeeper environment ops
Adds this option based on the findings of
https://github.com/python-zk/kazoo/issues/630, whereby restores of >1MB
in size would fail. This is considered an unsafe option, but given our
usecase no actual znode should ever exceed this limit; this is purely
for the large transactions that come from a `pvc task restore` action to
an empty Zookeeper instance.
2023-09-01 15:42:24 -04:00
f79fb605de Support using existing SSL certs on system
Add the additional pvc_api_ssl_cert_path and pvc_api_ssl_key_path
group_vars options, which can be used to set the SSL details to existing
files on the filesystem if desired. If these are empty (or nonexistent),
the original pvc_api_ssl_cert and pvc_api_ssl_key raw format options
will be used as they were.

Allows the administrator to use outside methods (such as Let's Encrypt)
to obtain the certs locally on the system, avoiding changes to the
group_vars and redeployment to manage SSL keys.
2023-09-01 15:42:24 -04:00
2caed2ae12 Rename remaining "pvc_prov" items to pvc_api 2023-09-01 15:42:24 -04:00
2a2d318dbc Change name of default API database
From pvcprov to pvcapi to reflect the changing use of this database.
2023-09-01 15:42:24 -04:00
833d99a360 Add comments to defaults 2023-09-01 15:42:24 -04:00
8109f13386 Add additional configuration to group_vars
Also include defaults and the new pvc_vm_shutdown_timeout option.
2023-09-01 15:42:24 -04:00
72df058684 Ensure ZK prioritizes IPv4 2023-09-01 15:42:24 -04:00
457e18a850 Use FQDN for Zookeeper server entries 2023-09-01 15:42:24 -04:00
fbbf5ffe09 Use cluster_group variable for paths
Instead of trying to automagic this group out of the Ansible hostvars,
just make it explicitly defined in the group_vars to avoid any
confusion.
2023-09-01 15:42:23 -04:00
a925e4bd40 Ignore errors in bringing up bootstrap interfaces 2023-09-01 15:42:23 -04:00
e3ad750412 Add storage components to default pvcnoded.yaml 2023-09-01 15:42:23 -04:00