384 Commits

Author SHA1 Message Date
5ab40fa15f Update bondX configuration 2021-10-10 02:31:47 -04:00
2c0e09f657 Add setting bridge_mtu to config 2021-10-09 19:29:22 -04:00
74066e6ceb Avoid errors if noout fails 2021-10-07 16:31:52 -04:00
859cfbb51e Add smartmontools to base package list 2021-10-07 15:18:45 -04:00
5797535997 Adjust documentation and behaviour of cpuset
1. Detail the caveats and specific situations and ref the documentation
which will provide more details.

2. Always install the configs, but use /etc/default/ceph-osd-cpuset to
control if the script does anything or not (so, the "osd" cset set is
always active just not set in a special way.
2021-09-29 20:49:00 -04:00
81cf341c32 Install cset configs even if disabled
The setup script handles this instead.
2021-09-29 10:23:01 -04:00
645249b57e Allow dynamic enabling/disabling of cset
Add a separate config to handle enable/disable on the system itself.
2021-09-29 10:21:47 -04:00
8ac2a5ea0c Adjust default ceph.conf parameters
1. Remove an explicit OSD journal size, especially such a small one (no
clue why I ever added that...)

2. Add max scrubs, disable scrub during recovery, and set scrub sleep.

3. Add max backfills, tune recovery sleep to 0 to prioritize recovery.
2021-09-28 02:09:50 -04:00
732bfe732c Add Ceph OSD cpuset tuning options
Allows an administrator to set CPU pinning with the cpuset tool for Ceph
OSDs, in situations where CPU contention with VMs or other system tasks
may be negatively affecting OSD performance. This is optional, advanced
tuning and is disabled by default.
2021-09-27 00:27:57 -04:00
d7b07925bb Fix bad flag 2021-09-09 13:07:15 -04:00
de64437ff6 Add interpreter var to sites group 2021-09-09 13:04:50 -04:00
77c84cec52 Add package installs for different Debian versions 2021-09-09 12:59:18 -04:00
a91112fa71 Move paths and keys to defaults 2021-08-24 15:25:42 -04:00
2e9d02ab52 Add additional CMK checks 2021-08-21 15:41:44 -04:00
9b103f48ad Update default repo domain
We now have parallelvirtualcluster.org set up, so use that instead of
the bonifacelabs domain.
2021-08-21 01:00:53 -04:00
b37d6c3009 Wait longer when restarting services
From 15 -> 30 seconds to ensure more time for stabilization before
proceeding with the next.
2021-07-30 11:46:49 -04:00
b62731199f Add default features flag to ceph.conf generator
Coupled with the removal of explicit --image-features flags to the RBD
command in PVC itself, this ensures that only the two features supported
on kernel 4.19 are enabled by default.
2021-07-30 11:39:24 -04:00
2cc4548af6 Fix sources.list for Bullseye 2021-07-26 00:36:39 -04:00
311f388f56 Increase flush/unflush wait timeout
Bump this from 10 minutes (60 * 10 seconds) to 30 minutes (180 * 10
seconds) to ensure there is sufficient time for (relatively) large VMs
to migrate with (relatively) slow networking.
2021-07-22 16:16:27 -04:00
942743daef Use wait on secondary and delay for 15 seconds 2021-07-22 09:35:00 -04:00
01020ab29c Rename Daemon upgrade playbook to match 2021-07-22 09:34:26 -04:00
dd2fe47881 Typo fix 2021-07-20 13:59:47 -04:00
9e42e6ae88 Lower autopurge interval to 1 hour 2021-07-20 13:57:59 -04:00
13dd41bb3e Add some Zookeeper configuration tweaks 2021-07-19 16:31:40 -04:00
f294817b55 Disable any systemd start rate limiting
Because Zookeeper is supremely stupid (see last commit) we want to
disable start limiting. It needs to keep trying forever until it starts.
2021-07-19 13:21:16 -04:00
b112663ef0 Ensure Zookeeper restarts itself
The Zookeeper daemon does not appear to exit with any status other than
0, even after a fatal error. Work around this.
2021-07-19 13:03:09 -04:00
bd4d94568e Add -XX:+AlwaysPreTouch option for Zookeeper 2021-07-19 12:46:21 -04:00
e232ab00da Lower keep count for Zookeeper vacuum to 3
Required to keep disk space growth down when using zookeeper_logging
functionality.
2021-07-19 09:51:07 -04:00
2fa1b8fb2b Readjust warning one more time
The actual cause of the issue wasn't memory, but still leave this
warning in (albeit a little less strict).
2021-07-18 22:15:13 -04:00
3adacf3107 Fix excessive whitespace 2021-07-18 22:13:09 -04:00
f285cf4ffc Tweak the memory warning to be even stricter 2021-07-18 21:41:45 -04:00
87cfb643a7 Adjust warning message about Zookeeper memory size 2021-07-18 20:36:13 -04:00
764c2c3928 Fix memory tuning issues 2021-07-18 18:51:21 -04:00
10a1754285 Adjust package lists per Debian version 2021-07-18 18:36:58 -04:00
b33096202e Fix bad Ansible variable name 2021-07-18 17:49:42 -04:00
0e046b48d4 Add Zookeeper logging configs 2021-07-18 17:47:02 -04:00
a1362c4363 Don't fail if IPMI tasks fail 2021-07-07 10:42:30 -04:00
9ad420eaa5 Reduce timeouts in upgrade playbook 2021-07-06 09:47:06 -04:00
6a3b00807c Fix name to be more clear 2021-07-06 09:30:30 -04:00
bb094193b4 Adjust ordering of flush task 2021-07-06 09:28:59 -04:00
96544aabb8 Add GRUB, Plymouth themes and issue for PVC 2021-06-30 02:50:18 -04:00
9d4eb89bde Fix zkcli for good 2021-06-29 18:16:02 -04:00
c0ad9740f4 Fix bootstrap collection path for Ceph 2021-06-29 17:52:21 -04:00
3d47b12b76 Add GRUB configuration to Ansible role 2021-06-29 17:48:55 -04:00
120871ee45 Support both versions of psycopg2 and kazoo 2021-06-29 17:29:01 -04:00
231cb7b2aa Fix Patroni ACL to use subnet mask 2021-06-29 16:47:55 -04:00
d794197633 Fix zkcli alias to use hostname 2021-06-29 16:47:42 -04:00
9855088a8e Use short ansible_hostname in ipmi fragment 2021-06-29 15:38:19 -04:00
10e8947cb0 Add ipmitool to packages list 2021-06-29 15:30:54 -04:00
53872c0056 Add generic SR-IOV configuration 2021-06-22 03:47:03 -04:00