341 Commits

Author SHA1 Message Date
ca3a5e144f Update tags and add kernel-cleanup script 2021-02-02 15:41:38 -05:00
1c05c8729f Fix incorrect systemd enabling in Patroni 2021-01-28 16:28:02 -05:00
4b179b66ed Add reboot to purge 2021-01-28 14:13:15 -05:00
71edb9db15 Remove log dirs during purge 2021-01-28 14:12:40 -05:00
f4974d648d Add some additional compression libraries 2021-01-28 13:34:58 -05:00
fa0aeec88e Add local domain to resolver config 2021-01-28 13:34:26 -05:00
04ca8f73d2 Correct bugs during bootstrap
1. Ensure Zookeeper restarts and checks out successfully before
proceeding with other steps.
2. Make sure PVC itself doesn't start prematurely.
2021-01-28 13:32:36 -05:00
21e3e0e172 Add removal of Zookeeper keys too 2021-01-28 13:26:46 -05:00
20d802f0b0 Update purge script 2021-01-27 17:08:38 -05:00
b7f251ea16 Retry pgsql bootstrap startup 6 times
This will sometimes fail, so retry it several times
2021-01-27 15:45:36 -05:00
7b08610efa Retry msgr2 enabling 6 times
This will sometimes fail, so retry it several times
2021-01-27 14:13:09 -05:00
c4c285c7b3 Remove invalid timezone entries in postgres conf 2021-01-26 15:20:25 -05:00
97869ca5c3 Reorder Ceph stop and lower some waits 2021-01-07 11:11:16 -05:00
d35250b870 Add tasks to verify node has finished (un)flushing 2021-01-07 10:49:23 -05:00
cd164d1984 Increase all wait timeouts to 30s
Ensure that even on slow(er) clusters, these timeouts have more time to
complete before proceeding so the task won't fail.
2021-01-05 16:17:19 -05:00
7585553225 Add default values 2020-12-21 00:20:45 -05:00
ac071f4bf0 Add configurable ZK memory limits 2020-12-21 00:20:45 -05:00
98e3e39570 Remove libjemalloc package 2020-12-21 00:20:45 -05:00
8e104113d7 Tune Zookeeper memory usage
Use Xms and Xmx=128M to reduce overall Zookeeper memory usage.
2020-12-21 00:20:45 -05:00
de04105a38 Add tuning for Ceph OSDs 2020-12-21 00:20:45 -05:00
28c86d170f Don't use libjemalloc for Ceph daemons
This was an artifact of a much, much older Ceph configuration I ran, and
is not relevant with newer Ceph versions like those used in PVC.
Performance testing with Nautilus and Bluestore reveals a minimal
performance hit, and using `jemalloc` prevents cache autotuning from
being effective, so remove it.
2020-12-21 00:20:45 -05:00
f277acc974 Disable pvc-flush service while rebooting
Prevents the flush daemon from starting on node boot, before the
playbook is actually ready to unflush the node.
2020-12-15 14:32:50 -05:00
8b474760ed Tweak oneshot script
Cleanly stop daemons; check if OSDs are back before continuing; wait
less
2020-11-26 10:51:54 -05:00
cb96ef4e7a Use new init command location
Command was renamed in the PVC CLI to facilitate other "task" actions
like backup/restore.
2020-11-24 12:22:34 -05:00
3c0c3e8e56 Add jute.maxbuffer to Zookeeper environment ops
Adds this option based on the findings of
https://github.com/python-zk/kazoo/issues/630, whereby restores of >1MB
in size would fail. This is considered an unsafe option, but given our
usecase no actual znode should ever exceed this limit; this is purely
for the large transactions that come from a `pvc task restore` action to
an empty Zookeeper instance.
2020-11-24 12:20:25 -05:00
da8c357d38 Add PVC status MOTD script 2020-11-17 12:48:53 -05:00
9f84609808 Set proper mode on agent plugins 2020-10-27 15:48:57 -04:00
b4ba4f9eda Add cluster safe update playbook
This playbook will perform a oneshot upgrade of the systems in the
cluster, including performing a clean and safe reboot of the node(s) if
required (either due to services needing a restart, or the kernel
changing). It runs in serial=1 and only reboots if needed.
2020-10-27 15:41:20 -04:00
2d1b76ecdf Add check-mk-agent plugin installs
These are used by various Ansible tasks, even if the administrator is
not using Check_MK for monitoring.
2020-10-27 15:41:20 -04:00
2b0398dec8 Add PCI and USB utils 2020-10-05 16:10:10 -04:00
934f73af0f Support using existing SSL certs on system
Add the additional pvc_api_ssl_cert_path and pvc_api_ssl_key_path
group_vars options, which can be used to set the SSL details to existing
files on the filesystem if desired. If these are empty (or nonexistent),
the original pvc_api_ssl_cert and pvc_api_ssl_key raw format options
will be used as they were.

Allows the administrator to use outside methods (such as Let's Encrypt)
to obtain the certs locally on the system, avoiding changes to the
group_vars and redeployment to manage SSL keys.
2020-08-26 14:11:14 -04:00
2edea75fbe Use generic Debian repos and PVC component 2020-08-26 12:16:39 -04:00
2f2123b70e Rename remaining "pvc_prov" items to pvc_api 2020-08-25 13:01:48 -04:00
d79c587384 Change name of default API database
From pvcprov to pvcapi to reflect the changing use of this database.
2020-08-25 02:00:29 -04:00
663d525bb1 Add comments to defaults 2020-08-21 09:40:51 -04:00
7f60033282 Add more detailed comments 2020-08-20 21:42:44 -04:00
e32dfe6200 Add additional configuration to group_vars
Also include defaults and the new pvc_vm_shutdown_timeout option.
2020-08-20 21:39:44 -04:00
1e8579ed9a Add nice warning to purge script 2020-08-19 16:35:05 -04:00
774595cdb7 Ensure ZK prioritizes IPv4 2020-08-19 13:10:03 -04:00
c9b487f5e6 Use FQDN for Zookeeper server entries 2020-08-19 12:47:06 -04:00
a0e4f3bd30 Improve SSH configuration for nodes
Ensure hostbased auth works with configs, remove erroneous old
conditional for authtypes, remove obsolete config option.
2020-08-06 15:56:01 -04:00
6851d42885 Use Google DNS instead of Cloudflare
For some reason Cloudflare works in fewer places than Google, so just
use it instead.
2020-08-06 13:22:30 -04:00
6b8232d38e Use cluster_group variable for paths
Instead of trying to automagic this group out of the Ansible hostvars,
just make it explicitly defined in the group_vars to avoid any
confusion.
2020-08-06 13:20:14 -04:00
a488f62ef8 Ignore errors in bringing up bootstrap interfaces 2020-07-27 13:08:24 -04:00
69b0590b54 Add storage components to default pvcnoded.yaml 2020-06-06 21:15:10 -04:00
646219737c Ensure uuid-runtime is installed 2020-05-12 11:15:01 -04:00
8c7b2dde86 Add update to purge command 2020-04-06 15:20:20 -04:00
b0186b85c2 Use correct syntax for init command 2020-04-06 15:19:49 -04:00
af1927e384 Use consistent naming in patroni.yml 2020-04-06 14:33:13 -04:00
417dde5b1b Remove obsolete issue-gen script on install 2020-04-06 13:55:51 -04:00