206 Commits

Author SHA1 Message Date
a52d4cbf37 Add Zookeeper logging configs 2023-09-01 15:42:26 -04:00
7bacbd5dd6 Don't fail if IPMI tasks fail 2023-09-01 15:42:26 -04:00
eef0f959dd Add GRUB, Plymouth themes and issue for PVC 2023-09-01 15:42:26 -04:00
6d3e5ac728 Fix zkcli for good 2023-09-01 15:42:26 -04:00
e760114b8d Fix bootstrap collection path for Ceph 2023-09-01 15:42:26 -04:00
bace67b8bf Add GRUB configuration to Ansible role 2023-09-01 15:42:26 -04:00
0802cca980 Support both versions of psycopg2 and kazoo 2023-09-01 15:42:26 -04:00
31a677b444 Fix Patroni ACL to use subnet mask 2023-09-01 15:42:26 -04:00
35089f6dda Fix zkcli alias to use hostname 2023-09-01 15:42:26 -04:00
9dc9139c35 Use short ansible_hostname in ipmi fragment 2023-09-01 15:42:26 -04:00
329bc9690e Add ipmitool to packages list 2023-09-01 15:42:26 -04:00
a2ed38b459 Add generic SR-IOV configuration 2023-09-01 15:42:26 -04:00
0fc889df32 Ensure we can connect to Patroni 2023-09-01 15:42:26 -04:00
388db6ad1d Use IPs for Patroni configuration 2023-09-01 15:42:26 -04:00
d455b31905 Bump max connections in Zookeeper to 200 2023-09-01 15:42:26 -04:00
f105f0497c Configure Zookeeper only on Cluster address 2023-09-01 15:42:26 -04:00
7e94dddb4c Ensure libvirtd restarts when unit changes 2023-09-01 15:42:26 -04:00
c9df64bc7d Ensure deb-src is present for bullseye 2023-09-01 15:42:26 -04:00
0bbb91fc8b Add override custom libvirtd.service unit
This has no functional change on Buster, but on Bullseye this overrides
the stupid socket-based activation shenanigans that the default unit
tries to do, as well as the breaking replacement of the
/etc/default/libvirt variable names.
2023-09-01 15:42:26 -04:00
3a67dc129b Ensure DEBIAN_FRONTEND is noninteractive 2023-09-01 15:42:26 -04:00
0114ad8ed5 Add python3 version of psycopg2 explicitly 2023-09-01 15:42:26 -04:00
a548bdcc6a Use inventory_hostname for IPMI dict 2023-09-01 15:42:26 -04:00
6104e0a5a5 Use independent fact to work around codename 2023-09-01 15:42:26 -04:00
5c46bb0db7 Ensure backup_keys isn't empty 2023-09-01 15:42:25 -04:00
d69770b776 Avoid writing hosts if empty 2023-09-01 15:42:25 -04:00
f4e49b9d3e Ensure apt-update runs if configs update 2023-09-01 15:42:25 -04:00
9438ab46d7 Add bullseye support 2023-09-01 15:42:25 -04:00
dc83f91bd8 Add directory creation to backup script 2023-09-01 15:42:25 -04:00
5466df7065 Add PostgreSQL to daily backup script 2023-09-01 15:42:25 -04:00
c9742fe2e5 Update tags and fix backup keys to var 2023-09-01 15:42:25 -04:00
7c7ca4a229 Allow inter-cluster orphan NTP sync
Due to the requirement of Ceph to have all peer nodes tightly
synchronized with each other to come online, PVC nodes need a way to
synchronize to each other even in the absence of an external time
reference. This is especially prevalent if a set of nodes are left
offline for an extended period (>1-2 weeks), since their hardware clocks
will drift. If the resulting Internet connectivity is then dependent on
a VM, this will cause a catch-22 and the cluster will not properly
start.

This configuration will accomplish that - if no suitable >6 stratum
peers are found, the hosts will enter orphan mode. Since they are now
all configured as "peers" with each other, they will collectively decide
on one of them to become the source and sync to it. A local stratum 10
fudge is added so that at least one of the nodes can become this source.

While this is not an ideal use of NTP, it is by far the cleanest
solution to this problem, and does not impact normal functionality when
the two configured stratum-2 servers are reachable.
2023-09-01 15:42:25 -04:00
027a819a83 Move some other tasks to bootstrap role
Avoids an issue where the pvcnoded service is stopped on non-bootstrap
runs.
2023-09-01 15:42:25 -04:00
e53342474c Remove GRUB config from base role
This is not actually ideal.
2023-09-01 15:42:25 -04:00
4666db17cb Fix version sorting bugs in kernel-cleanup.sh 2023-09-01 15:42:25 -04:00
6903627150 Add additional items to base role
Backups, GRUB configuration, and IPMI configuration.
2023-09-01 15:42:25 -04:00
c96ad603b0 Fix sudoers to use conditional deploy_username 2023-09-01 15:42:25 -04:00
29363ebf80 Allow configurable fail2ban IPs 2023-09-01 15:42:25 -04:00
d9be39a048 Allow customization of deploy username 2023-09-01 15:42:25 -04:00
4dc5ebdba0 Move to more dynamic apt configs
Allow specifying repository URLs in the group_vars, and add
release-specific template files to support future version changes.
2023-09-01 15:42:25 -04:00
6a61f8f7bf Update relative path to bootstrap files 2023-09-01 15:42:25 -04:00
4caab67d03 Remove superfluous symlink 2023-09-01 15:42:25 -04:00
57e5953fd1 Add sensible sorting of kernel removals 2023-09-01 15:42:25 -04:00
2a72a826f5 Remove cruft and add mkpasswd setup 2023-09-01 15:42:25 -04:00
bf02da693f Correct bad indentation in base role 2023-09-01 15:42:25 -04:00
39b8229c35 Add libguestfs-tools to libvirt role deps 2023-09-01 15:42:25 -04:00
1f6cb077fa Update tags and add kernel-cleanup script 2023-09-01 15:42:25 -04:00
0bf9c6209c Fix incorrect systemd enabling in Patroni 2023-09-01 15:42:25 -04:00
c0dc6fad4e Add some additional compression libraries 2023-09-01 15:42:25 -04:00
a4be011884 Add local domain to resolver config 2023-09-01 15:42:25 -04:00
4f5dbee8ee Correct bugs during bootstrap
1. Ensure Zookeeper restarts and checks out successfully before
proceeding with other steps.
2. Make sure PVC itself doesn't start prematurely.
2023-09-01 15:42:25 -04:00