237 Commits

Author SHA1 Message Date
b4e9ed5d39 Ensure DEBIAN_FRONTEND is noninteractive 2021-06-22 03:15:23 -04:00
4ccc23bd85 Add python3 version of psycopg2 explicitly 2021-06-22 03:15:23 -04:00
8a140f70dc Use inventory_hostname for IPMI dict 2021-06-22 03:15:23 -04:00
836c946c72 Use independent fact to work around codename 2021-06-07 10:54:55 -04:00
69c037c136 Ensure backup_keys isn't empty 2021-06-06 00:41:53 -04:00
6b79e5db31 Avoid writing hosts if empty 2021-06-05 01:12:00 -04:00
8fa8590eb8 Ensure apt-update runs if configs update 2021-06-05 01:03:35 -04:00
9dc0949b47 Add bullseye support 2021-06-05 00:56:02 -04:00
998e5a8752 Add directory creation to backup script 2021-06-01 10:16:08 -04:00
0aa328e350 Add PostgreSQL to daily backup script 2021-06-01 10:10:22 -04:00
9deee94332 Update tags and fix backup keys to var 2021-05-27 12:29:19 -04:00
e76832de91 Allow inter-cluster orphan NTP sync
Due to the requirement of Ceph to have all peer nodes tightly
synchronized with each other to come online, PVC nodes need a way to
synchronize to each other even in the absence of an external time
reference. This is especially prevalent if a set of nodes are left
offline for an extended period (>1-2 weeks), since their hardware clocks
will drift. If the resulting Internet connectivity is then dependent on
a VM, this will cause a catch-22 and the cluster will not properly
start.

This configuration will accomplish that - if no suitable >6 stratum
peers are found, the hosts will enter orphan mode. Since they are now
all configured as "peers" with each other, they will collectively decide
on one of them to become the source and sync to it. A local stratum 10
fudge is added so that at least one of the nodes can become this source.

While this is not an ideal use of NTP, it is by far the cleanest
solution to this problem, and does not impact normal functionality when
the two configured stratum-2 servers are reachable.
2021-05-19 11:03:18 -04:00
238449904f Move some other tasks to bootstrap role
Avoids an issue where the pvcnoded service is stopped on non-bootstrap
runs.
2021-05-13 10:17:38 -04:00
7536732f30 Remove GRUB config from base role
This is not actually ideal.
2021-05-12 14:55:57 -04:00
04bc9730a0 Fix version sorting bugs in kernel-cleanup.sh 2021-05-12 14:40:18 -04:00
45322e0f9e Add additional items to base role
Backups, GRUB configuration, and IPMI configuration.
2021-05-12 13:53:15 -04:00
da9eafcdfa Fix sudoers to use conditional deploy_username 2021-04-13 16:50:05 -04:00
70ba4b240f Allow configurable fail2ban IPs 2021-04-13 16:44:49 -04:00
ce3554b530 Allow customization of deploy username 2021-04-13 11:30:42 -04:00
3819cd87fd Move to more dynamic apt configs
Allow specifying repository URLs in the group_vars, and add
release-specific template files to support future version changes.
2021-04-08 14:14:25 -04:00
404751f695 Update relative path to bootstrap files 2021-04-08 14:04:56 -04:00
622cef1586 Remove superfluous symlink 2021-04-08 13:50:47 -04:00
6589a9cd38 Add sensible sorting of kernel removals 2021-04-08 13:46:43 -04:00
6598637e91 Remove cruft and add mkpasswd setup 2021-04-08 13:46:30 -04:00
dcd0b48d94 Correct bad indentation in base role 2021-03-18 09:36:49 -04:00
82fa85834a Add libguestfs-tools to libvirt role deps 2021-03-15 13:39:37 -04:00
ca3a5e144f Update tags and add kernel-cleanup script 2021-02-02 15:41:38 -05:00
1c05c8729f Fix incorrect systemd enabling in Patroni 2021-01-28 16:28:02 -05:00
f4974d648d Add some additional compression libraries 2021-01-28 13:34:58 -05:00
fa0aeec88e Add local domain to resolver config 2021-01-28 13:34:26 -05:00
04ca8f73d2 Correct bugs during bootstrap
1. Ensure Zookeeper restarts and checks out successfully before
proceeding with other steps.
2. Make sure PVC itself doesn't start prematurely.
2021-01-28 13:32:36 -05:00
b7f251ea16 Retry pgsql bootstrap startup 6 times
This will sometimes fail, so retry it several times
2021-01-27 15:45:36 -05:00
7b08610efa Retry msgr2 enabling 6 times
This will sometimes fail, so retry it several times
2021-01-27 14:13:09 -05:00
c4c285c7b3 Remove invalid timezone entries in postgres conf 2021-01-26 15:20:25 -05:00
7585553225 Add default values 2020-12-21 00:20:45 -05:00
ac071f4bf0 Add configurable ZK memory limits 2020-12-21 00:20:45 -05:00
98e3e39570 Remove libjemalloc package 2020-12-21 00:20:45 -05:00
8e104113d7 Tune Zookeeper memory usage
Use Xms and Xmx=128M to reduce overall Zookeeper memory usage.
2020-12-21 00:20:45 -05:00
de04105a38 Add tuning for Ceph OSDs 2020-12-21 00:20:45 -05:00
28c86d170f Don't use libjemalloc for Ceph daemons
This was an artifact of a much, much older Ceph configuration I ran, and
is not relevant with newer Ceph versions like those used in PVC.
Performance testing with Nautilus and Bluestore reveals a minimal
performance hit, and using `jemalloc` prevents cache autotuning from
being effective, so remove it.
2020-12-21 00:20:45 -05:00
cb96ef4e7a Use new init command location
Command was renamed in the PVC CLI to facilitate other "task" actions
like backup/restore.
2020-11-24 12:22:34 -05:00
3c0c3e8e56 Add jute.maxbuffer to Zookeeper environment ops
Adds this option based on the findings of
https://github.com/python-zk/kazoo/issues/630, whereby restores of >1MB
in size would fail. This is considered an unsafe option, but given our
usecase no actual znode should ever exceed this limit; this is purely
for the large transactions that come from a `pvc task restore` action to
an empty Zookeeper instance.
2020-11-24 12:20:25 -05:00
da8c357d38 Add PVC status MOTD script 2020-11-17 12:48:53 -05:00
9f84609808 Set proper mode on agent plugins 2020-10-27 15:48:57 -04:00
2d1b76ecdf Add check-mk-agent plugin installs
These are used by various Ansible tasks, even if the administrator is
not using Check_MK for monitoring.
2020-10-27 15:41:20 -04:00
2b0398dec8 Add PCI and USB utils 2020-10-05 16:10:10 -04:00
934f73af0f Support using existing SSL certs on system
Add the additional pvc_api_ssl_cert_path and pvc_api_ssl_key_path
group_vars options, which can be used to set the SSL details to existing
files on the filesystem if desired. If these are empty (or nonexistent),
the original pvc_api_ssl_cert and pvc_api_ssl_key raw format options
will be used as they were.

Allows the administrator to use outside methods (such as Let's Encrypt)
to obtain the certs locally on the system, avoiding changes to the
group_vars and redeployment to manage SSL keys.
2020-08-26 14:11:14 -04:00
2edea75fbe Use generic Debian repos and PVC component 2020-08-26 12:16:39 -04:00
2f2123b70e Rename remaining "pvc_prov" items to pvc_api 2020-08-25 13:01:48 -04:00
d79c587384 Change name of default API database
From pvcprov to pvcapi to reflect the changing use of this database.
2020-08-25 02:00:29 -04:00