Joshua Boniface
4ccc23bd85
Add python3 version of psycopg2 explicitly
2021-06-22 03:15:23 -04:00
Joshua Boniface
8a140f70dc
Use inventory_hostname for IPMI dict
2021-06-22 03:15:23 -04:00
Joshua Boniface
836c946c72
Use independent fact to work around codename
2021-06-07 10:54:55 -04:00
Joshua Boniface
69c037c136
Ensure backup_keys isn't empty
2021-06-06 00:41:53 -04:00
Joshua Boniface
16715f0e35
Adjust comments and IPMI value
2021-06-06 00:41:34 -04:00
Joshua Boniface
030b47ac16
Add example backup_keys var
2021-06-05 02:06:36 -04:00
Joshua Boniface
fb7a214fbc
Ignore a virtualenv if used
2021-06-05 01:54:25 -04:00
Joshua Boniface
6d8f08e1b2
Add updated IPMI configuration to match base role
2021-06-05 01:47:09 -04:00
Joshua Boniface
6b79e5db31
Avoid writing hosts if empty
2021-06-05 01:12:00 -04:00
Joshua Boniface
8fa8590eb8
Ensure apt-update runs if configs update
2021-06-05 01:03:35 -04:00
Joshua Boniface
9dc0949b47
Add bullseye support
2021-06-05 00:56:02 -04:00
Joshua Boniface
998e5a8752
Add directory creation to backup script
2021-06-01 10:16:08 -04:00
Joshua Boniface
0aa328e350
Add PostgreSQL to daily backup script
2021-06-01 10:10:22 -04:00
Joshua Boniface
9deee94332
Update tags and fix backup keys to var
2021-05-27 12:29:19 -04:00
Joshua Boniface
cae8cfc4cb
Add norestart policy for apt updates
2021-05-27 01:38:43 -04:00
Joshua Boniface
491ea77306
Add README and daemon upgrade playbook, cleanups
2021-05-20 11:02:47 -04:00
Joshua Boniface
e76832de91
Allow inter-cluster orphan NTP sync
...
Due to the requirement of Ceph to have all peer nodes tightly
synchronized with each other to come online, PVC nodes need a way to
synchronize to each other even in the absence of an external time
reference. This is especially prevalent if a set of nodes are left
offline for an extended period (>1-2 weeks), since their hardware clocks
will drift. If the resulting Internet connectivity is then dependent on
a VM, this will cause a catch-22 and the cluster will not properly
start.
This configuration will accomplish that - if no suitable >6 stratum
peers are found, the hosts will enter orphan mode. Since they are now
all configured as "peers" with each other, they will collectively decide
on one of them to become the source and sync to it. A local stratum 10
fudge is added so that at least one of the nodes can become this source.
While this is not an ideal use of NTP, it is by far the cleanest
solution to this problem, and does not impact normal functionality when
the two configured stratum-2 servers are reachable.
2021-05-19 11:03:18 -04:00
Joshua Boniface
238449904f
Move some other tasks to bootstrap role
...
Avoids an issue where the pvcnoded service is stopped on non-bootstrap
runs.
2021-05-13 10:17:38 -04:00
Joshua Boniface
7536732f30
Remove GRUB config from base role
...
This is not actually ideal.
2021-05-12 14:55:57 -04:00
Joshua Boniface
04bc9730a0
Fix version sorting bugs in kernel-cleanup.sh
2021-05-12 14:40:18 -04:00
Joshua Boniface
45322e0f9e
Add additional items to base role
...
Backups, GRUB configuration, and IPMI configuration.
2021-05-12 13:53:15 -04:00
Joshua Boniface
da9eafcdfa
Fix sudoers to use conditional deploy_username
2021-04-13 16:50:05 -04:00
Joshua Boniface
70ba4b240f
Allow configurable fail2ban IPs
2021-04-13 16:44:49 -04:00
Joshua Boniface
ce3554b530
Allow customization of deploy username
2021-04-13 11:30:42 -04:00
Joshua Boniface
593a81e07c
Fix group_vars to match new setup
2021-04-08 14:15:11 -04:00
Joshua Boniface
3819cd87fd
Move to more dynamic apt configs
...
Allow specifying repository URLs in the group_vars, and add
release-specific template files to support future version changes.
2021-04-08 14:14:25 -04:00
Joshua Boniface
3e1d3a90b0
Update root password in default group_vars
2021-04-08 14:08:21 -04:00
Joshua Boniface
404751f695
Update relative path to bootstrap files
2021-04-08 14:04:56 -04:00
Joshua Boniface
622cef1586
Remove superfluous symlink
2021-04-08 13:50:47 -04:00
Joshua Boniface
6589a9cd38
Add sensible sorting of kernel removals
2021-04-08 13:46:43 -04:00
Joshua Boniface
6598637e91
Remove cruft and add mkpasswd setup
2021-04-08 13:46:30 -04:00
Joshua Boniface
25674731cd
Update file copyright header
2021-03-25 16:58:58 -04:00
Joshua Boniface
dcd0b48d94
Correct bad indentation in base role
2021-03-18 09:36:49 -04:00
Joshua Boniface
82fa85834a
Add libguestfs-tools to libvirt role deps
2021-03-15 13:39:37 -04:00
Joshua Boniface
510db0df58
Add cleanup to update oneshot playbook
2021-02-02 15:41:38 -05:00
Joshua Boniface
ca3a5e144f
Update tags and add kernel-cleanup script
2021-02-02 15:41:38 -05:00
Joshua Boniface
1c05c8729f
Fix incorrect systemd enabling in Patroni
2021-01-28 16:28:02 -05:00
Joshua Boniface
4b179b66ed
Add reboot to purge
2021-01-28 14:13:15 -05:00
Joshua Boniface
71edb9db15
Remove log dirs during purge
2021-01-28 14:12:40 -05:00
Joshua Boniface
f4974d648d
Add some additional compression libraries
2021-01-28 13:34:58 -05:00
Joshua Boniface
fa0aeec88e
Add local domain to resolver config
2021-01-28 13:34:26 -05:00
Joshua Boniface
04ca8f73d2
Correct bugs during bootstrap
...
1. Ensure Zookeeper restarts and checks out successfully before
proceeding with other steps.
2. Make sure PVC itself doesn't start prematurely.
2021-01-28 13:32:36 -05:00
Joshua Boniface
21e3e0e172
Add removal of Zookeeper keys too
2021-01-28 13:26:46 -05:00
Joshua Boniface
20d802f0b0
Update purge script
2021-01-27 17:08:38 -05:00
Joshua Boniface
b7f251ea16
Retry pgsql bootstrap startup 6 times
...
This will sometimes fail, so retry it several times
2021-01-27 15:45:36 -05:00
Joshua Boniface
7b08610efa
Retry msgr2 enabling 6 times
...
This will sometimes fail, so retry it several times
2021-01-27 14:13:09 -05:00
Joshua Boniface
c4c285c7b3
Remove invalid timezone entries in postgres conf
2021-01-26 15:20:25 -05:00
Joshua Boniface
97869ca5c3
Reorder Ceph stop and lower some waits
2021-01-07 11:11:16 -05:00
Joshua Boniface
d35250b870
Add tasks to verify node has finished (un)flushing
2021-01-07 10:49:23 -05:00
Joshua Boniface
cd164d1984
Increase all wait timeouts to 30s
...
Ensure that even on slow(er) clusters, these timeouts have more time to
complete before proceeding so the task won't fail.
2021-01-05 16:17:19 -05:00