244 Commits

Author SHA1 Message Date
fc5bcf139c Fix name of IPMI check again 2023-09-01 15:42:28 -04:00
44cedf66c9 Fix name of ipmi check 2023-09-01 15:42:28 -04:00
9f7dbfb4f8 Add IPMI check to tasks 2023-09-01 15:42:28 -04:00
b9ae4d1009 Adjust headers and add LOM check 2023-09-01 15:42:27 -04:00
48fb21af75 Add node list to PVC MOTD 2023-09-01 15:42:27 -04:00
e009cf4076 Fix whitespaced manufacturer and bad [[ 2023-09-01 15:42:27 -04:00
e65f1d15a6 Add coordinator state to MOTD 2023-09-01 15:42:27 -04:00
894ce9b517 Support unknown manufacturers in MOTD 2023-09-01 15:42:27 -04:00
55ec177919 Ignore errors restarting libvirtd
This seems to inexplicably fail sometimes. We can just ignore it.
2023-09-01 15:42:27 -04:00
b814ec60f6 Add resolv.conf customization 2023-09-01 15:42:27 -04:00
ddecb94348 Disable unified cgroup heirarchy on kernel cmdline
This is required on Debian 11 to use the cset tool, since the newer
systemd implementation of a unified cgroup hierarchy is not compatible
with the cset tool.

Ref for future use:
  https://github.com/lpechacek/cpuset/issues/40
2023-09-01 15:42:27 -04:00
be3ce67574 Use inventory_hostname in IPMI fragment 2023-09-01 15:42:27 -04:00
5f05835721 Update bondX configuration 2023-09-01 15:42:27 -04:00
4cb2d7835c Add setting bridge_mtu to config 2023-09-01 15:42:27 -04:00
9f16995f59 Add smartmontools to base package list 2023-09-01 15:42:27 -04:00
6e2d661134 Adjust documentation and behaviour of cpuset
1. Detail the caveats and specific situations and ref the documentation
which will provide more details.

2. Always install the configs, but use /etc/default/ceph-osd-cpuset to
control if the script does anything or not (so, the "osd" cset set is
always active just not set in a special way.
2023-09-01 15:42:27 -04:00
83bd1b1efd Install cset configs even if disabled
The setup script handles this instead.
2023-09-01 15:42:27 -04:00
7927ec4f11 Allow dynamic enabling/disabling of cset
Add a separate config to handle enable/disable on the system itself.
2023-09-01 15:42:27 -04:00
2ae9b9075a Adjust default ceph.conf parameters
1. Remove an explicit OSD journal size, especially such a small one (no
clue why I ever added that...)

2. Add max scrubs, disable scrub during recovery, and set scrub sleep.

3. Add max backfills, tune recovery sleep to 0 to prioritize recovery.
2023-09-01 15:42:27 -04:00
6e48d6fe84 Add Ceph OSD cpuset tuning options
Allows an administrator to set CPU pinning with the cpuset tool for Ceph
OSDs, in situations where CPU contention with VMs or other system tasks
may be negatively affecting OSD performance. This is optional, advanced
tuning and is disabled by default.
2023-09-01 15:42:27 -04:00
45424a28ce Fix bad flag 2023-09-01 15:42:27 -04:00
044a14fa6d Add package installs for different Debian versions 2023-09-01 15:42:27 -04:00
ae40227ea1 Move paths and keys to defaults 2023-09-01 15:42:27 -04:00
f25a80ff53 Add additional CMK checks 2023-09-01 15:42:26 -04:00
8c2d117a3c Wait longer when restarting services
From 15 -> 30 seconds to ensure more time for stabilization before
proceeding with the next.
2023-09-01 15:42:26 -04:00
647ca1c446 Add default features flag to ceph.conf generator
Coupled with the removal of explicit --image-features flags to the RBD
command in PVC itself, this ensures that only the two features supported
on kernel 4.19 are enabled by default.
2023-09-01 15:42:26 -04:00
86eaeed2b4 Fix sources.list for Bullseye 2023-09-01 15:42:26 -04:00
3d64ad2420 Typo fix 2023-09-01 15:42:26 -04:00
eaea860b61 Lower autopurge interval to 1 hour 2023-09-01 15:42:26 -04:00
524f857f56 Add some Zookeeper configuration tweaks 2023-09-01 15:42:26 -04:00
13556918d7 Disable any systemd start rate limiting
Because Zookeeper is supremely stupid (see last commit) we want to
disable start limiting. It needs to keep trying forever until it starts.
2023-09-01 15:42:26 -04:00
8eecc95f2f Ensure Zookeeper restarts itself
The Zookeeper daemon does not appear to exit with any status other than
0, even after a fatal error. Work around this.
2023-09-01 15:42:26 -04:00
b03ecf0125 Add -XX:+AlwaysPreTouch option for Zookeeper 2023-09-01 15:42:26 -04:00
b842276002 Lower keep count for Zookeeper vacuum to 3
Required to keep disk space growth down when using zookeeper_logging
functionality.
2023-09-01 15:42:26 -04:00
681afd1d1b Fix excessive whitespace 2023-09-01 15:42:26 -04:00
2d31e6c8ea Fix memory tuning issues 2023-09-01 15:42:26 -04:00
71b6da6555 Adjust package lists per Debian version 2023-09-01 15:42:26 -04:00
4b0a4ae73c Fix bad Ansible variable name 2023-09-01 15:42:26 -04:00
a52d4cbf37 Add Zookeeper logging configs 2023-09-01 15:42:26 -04:00
7bacbd5dd6 Don't fail if IPMI tasks fail 2023-09-01 15:42:26 -04:00
eef0f959dd Add GRUB, Plymouth themes and issue for PVC 2023-09-01 15:42:26 -04:00
6d3e5ac728 Fix zkcli for good 2023-09-01 15:42:26 -04:00
e760114b8d Fix bootstrap collection path for Ceph 2023-09-01 15:42:26 -04:00
bace67b8bf Add GRUB configuration to Ansible role 2023-09-01 15:42:26 -04:00
0802cca980 Support both versions of psycopg2 and kazoo 2023-09-01 15:42:26 -04:00
31a677b444 Fix Patroni ACL to use subnet mask 2023-09-01 15:42:26 -04:00
35089f6dda Fix zkcli alias to use hostname 2023-09-01 15:42:26 -04:00
9dc9139c35 Use short ansible_hostname in ipmi fragment 2023-09-01 15:42:26 -04:00
329bc9690e Add ipmitool to packages list 2023-09-01 15:42:26 -04:00
a2ed38b459 Add generic SR-IOV configuration 2023-09-01 15:42:26 -04:00