Commit Graph

380 Commits

Author SHA1 Message Date
Joshua Boniface 5797535997 Adjust documentation and behaviour of cpuset
1. Detail the caveats and specific situations and ref the documentation
which will provide more details.

2. Always install the configs, but use /etc/default/ceph-osd-cpuset to
control if the script does anything or not (so, the "osd" cset set is
always active just not set in a special way.
2021-09-29 20:49:00 -04:00
Joshua Boniface 81cf341c32 Install cset configs even if disabled
The setup script handles this instead.
2021-09-29 10:23:01 -04:00
Joshua Boniface 645249b57e Allow dynamic enabling/disabling of cset
Add a separate config to handle enable/disable on the system itself.
2021-09-29 10:21:47 -04:00
Joshua Boniface 8ac2a5ea0c Adjust default ceph.conf parameters
1. Remove an explicit OSD journal size, especially such a small one (no
clue why I ever added that...)

2. Add max scrubs, disable scrub during recovery, and set scrub sleep.

3. Add max backfills, tune recovery sleep to 0 to prioritize recovery.
2021-09-28 02:09:50 -04:00
Joshua Boniface 732bfe732c Add Ceph OSD cpuset tuning options
Allows an administrator to set CPU pinning with the cpuset tool for Ceph
OSDs, in situations where CPU contention with VMs or other system tasks
may be negatively affecting OSD performance. This is optional, advanced
tuning and is disabled by default.
2021-09-27 00:27:57 -04:00
Joshua Boniface d7b07925bb Fix bad flag 2021-09-09 13:07:15 -04:00
Joshua Boniface de64437ff6 Add interpreter var to sites group 2021-09-09 13:04:50 -04:00
Joshua Boniface 77c84cec52 Add package installs for different Debian versions 2021-09-09 12:59:18 -04:00
Joshua Boniface a91112fa71 Move paths and keys to defaults 2021-08-24 15:25:42 -04:00
Joshua Boniface 2e9d02ab52 Add additional CMK checks 2021-08-21 15:41:44 -04:00
Joshua Boniface 9b103f48ad Update default repo domain
We now have parallelvirtualcluster.org set up, so use that instead of
the bonifacelabs domain.
2021-08-21 01:00:53 -04:00
Joshua Boniface b37d6c3009 Wait longer when restarting services
From 15 -> 30 seconds to ensure more time for stabilization before
proceeding with the next.
2021-07-30 11:46:49 -04:00
Joshua Boniface b62731199f Add default features flag to ceph.conf generator
Coupled with the removal of explicit --image-features flags to the RBD
command in PVC itself, this ensures that only the two features supported
on kernel 4.19 are enabled by default.
2021-07-30 11:39:24 -04:00
Joshua Boniface 2cc4548af6 Fix sources.list for Bullseye 2021-07-26 00:36:39 -04:00
Joshua Boniface 311f388f56 Increase flush/unflush wait timeout
Bump this from 10 minutes (60 * 10 seconds) to 30 minutes (180 * 10
seconds) to ensure there is sufficient time for (relatively) large VMs
to migrate with (relatively) slow networking.
2021-07-22 16:16:27 -04:00
Joshua Boniface 942743daef Use wait on secondary and delay for 15 seconds 2021-07-22 09:35:00 -04:00
Joshua Boniface 01020ab29c Rename Daemon upgrade playbook to match 2021-07-22 09:34:26 -04:00
Joshua Boniface dd2fe47881 Typo fix 2021-07-20 13:59:47 -04:00
Joshua Boniface 9e42e6ae88 Lower autopurge interval to 1 hour 2021-07-20 13:57:59 -04:00
Joshua Boniface 13dd41bb3e Add some Zookeeper configuration tweaks 2021-07-19 16:31:40 -04:00
Joshua Boniface f294817b55 Disable any systemd start rate limiting
Because Zookeeper is supremely stupid (see last commit) we want to
disable start limiting. It needs to keep trying forever until it starts.
2021-07-19 13:21:16 -04:00
Joshua Boniface b112663ef0 Ensure Zookeeper restarts itself
The Zookeeper daemon does not appear to exit with any status other than
0, even after a fatal error. Work around this.
2021-07-19 13:03:09 -04:00
Joshua Boniface bd4d94568e Add -XX:+AlwaysPreTouch option for Zookeeper 2021-07-19 12:46:21 -04:00
Joshua Boniface e232ab00da Lower keep count for Zookeeper vacuum to 3
Required to keep disk space growth down when using zookeeper_logging
functionality.
2021-07-19 09:51:07 -04:00
Joshua Boniface 2fa1b8fb2b Readjust warning one more time
The actual cause of the issue wasn't memory, but still leave this
warning in (albeit a little less strict).
2021-07-18 22:15:13 -04:00
Joshua Boniface 3adacf3107 Fix excessive whitespace 2021-07-18 22:13:09 -04:00
Joshua Boniface f285cf4ffc Tweak the memory warning to be even stricter 2021-07-18 21:41:45 -04:00
Joshua Boniface 87cfb643a7 Adjust warning message about Zookeeper memory size 2021-07-18 20:36:13 -04:00
Joshua Boniface 764c2c3928 Fix memory tuning issues 2021-07-18 18:51:21 -04:00
Joshua Boniface 10a1754285 Adjust package lists per Debian version 2021-07-18 18:36:58 -04:00
Joshua Boniface b33096202e Fix bad Ansible variable name 2021-07-18 17:49:42 -04:00
Joshua Boniface 0e046b48d4 Add Zookeeper logging configs 2021-07-18 17:47:02 -04:00
Joshua Boniface a1362c4363 Don't fail if IPMI tasks fail 2021-07-07 10:42:30 -04:00
Joshua Boniface 9ad420eaa5 Reduce timeouts in upgrade playbook 2021-07-06 09:47:06 -04:00
Joshua Boniface 6a3b00807c Fix name to be more clear 2021-07-06 09:30:30 -04:00
Joshua Boniface bb094193b4 Adjust ordering of flush task 2021-07-06 09:28:59 -04:00
Joshua Boniface 96544aabb8 Add GRUB, Plymouth themes and issue for PVC 2021-06-30 02:50:18 -04:00
Joshua Boniface 9d4eb89bde Fix zkcli for good 2021-06-29 18:16:02 -04:00
Joshua Boniface c0ad9740f4 Fix bootstrap collection path for Ceph 2021-06-29 17:52:21 -04:00
Joshua Boniface 3d47b12b76 Add GRUB configuration to Ansible role 2021-06-29 17:48:55 -04:00
Joshua Boniface 120871ee45 Support both versions of psycopg2 and kazoo 2021-06-29 17:29:01 -04:00
Joshua Boniface 231cb7b2aa Fix Patroni ACL to use subnet mask 2021-06-29 16:47:55 -04:00
Joshua Boniface d794197633 Fix zkcli alias to use hostname 2021-06-29 16:47:42 -04:00
Joshua Boniface 9855088a8e Use short ansible_hostname in ipmi fragment 2021-06-29 15:38:19 -04:00
Joshua Boniface 10e8947cb0 Add ipmitool to packages list 2021-06-29 15:30:54 -04:00
Joshua Boniface 53872c0056 Add generic SR-IOV configuration 2021-06-22 03:47:03 -04:00
Joshua Boniface d88ba7272d Ensure we can connect to Patroni 2021-06-22 03:28:36 -04:00
Joshua Boniface 84bf1d7efa Use IPs for Patroni configuration 2021-06-22 03:27:01 -04:00
Joshua Boniface ae45da3f85 Bump max connections in Zookeeper to 200 2021-06-22 03:15:23 -04:00
Joshua Boniface c6590f8ab9 Configure Zookeeper only on Cluster address 2021-06-22 03:15:23 -04:00