Joshua Boniface
ca3a5e144f
Update tags and add kernel-cleanup script
2021-02-02 15:41:38 -05:00
Joshua Boniface
1c05c8729f
Fix incorrect systemd enabling in Patroni
2021-01-28 16:28:02 -05:00
Joshua Boniface
4b179b66ed
Add reboot to purge
2021-01-28 14:13:15 -05:00
Joshua Boniface
71edb9db15
Remove log dirs during purge
2021-01-28 14:12:40 -05:00
Joshua Boniface
f4974d648d
Add some additional compression libraries
2021-01-28 13:34:58 -05:00
Joshua Boniface
fa0aeec88e
Add local domain to resolver config
2021-01-28 13:34:26 -05:00
Joshua Boniface
04ca8f73d2
Correct bugs during bootstrap
...
1. Ensure Zookeeper restarts and checks out successfully before
proceeding with other steps.
2. Make sure PVC itself doesn't start prematurely.
2021-01-28 13:32:36 -05:00
Joshua Boniface
21e3e0e172
Add removal of Zookeeper keys too
2021-01-28 13:26:46 -05:00
Joshua Boniface
20d802f0b0
Update purge script
2021-01-27 17:08:38 -05:00
Joshua Boniface
b7f251ea16
Retry pgsql bootstrap startup 6 times
...
This will sometimes fail, so retry it several times
2021-01-27 15:45:36 -05:00
Joshua Boniface
7b08610efa
Retry msgr2 enabling 6 times
...
This will sometimes fail, so retry it several times
2021-01-27 14:13:09 -05:00
Joshua Boniface
c4c285c7b3
Remove invalid timezone entries in postgres conf
2021-01-26 15:20:25 -05:00
Joshua Boniface
97869ca5c3
Reorder Ceph stop and lower some waits
2021-01-07 11:11:16 -05:00
Joshua Boniface
d35250b870
Add tasks to verify node has finished (un)flushing
2021-01-07 10:49:23 -05:00
Joshua Boniface
cd164d1984
Increase all wait timeouts to 30s
...
Ensure that even on slow(er) clusters, these timeouts have more time to
complete before proceeding so the task won't fail.
2021-01-05 16:17:19 -05:00
Joshua Boniface
7585553225
Add default values
2020-12-21 00:20:45 -05:00
Joshua Boniface
ac071f4bf0
Add configurable ZK memory limits
2020-12-21 00:20:45 -05:00
Joshua Boniface
98e3e39570
Remove libjemalloc package
2020-12-21 00:20:45 -05:00
Joshua Boniface
8e104113d7
Tune Zookeeper memory usage
...
Use Xms and Xmx=128M to reduce overall Zookeeper memory usage.
2020-12-21 00:20:45 -05:00
Joshua Boniface
de04105a38
Add tuning for Ceph OSDs
2020-12-21 00:20:45 -05:00
Joshua Boniface
28c86d170f
Don't use libjemalloc for Ceph daemons
...
This was an artifact of a much, much older Ceph configuration I ran, and
is not relevant with newer Ceph versions like those used in PVC.
Performance testing with Nautilus and Bluestore reveals a minimal
performance hit, and using `jemalloc` prevents cache autotuning from
being effective, so remove it.
2020-12-21 00:20:45 -05:00
Joshua Boniface
f277acc974
Disable pvc-flush service while rebooting
...
Prevents the flush daemon from starting on node boot, before the
playbook is actually ready to unflush the node.
2020-12-15 14:32:50 -05:00
Joshua Boniface
8b474760ed
Tweak oneshot script
...
Cleanly stop daemons; check if OSDs are back before continuing; wait
less
2020-11-26 10:51:54 -05:00
Joshua Boniface
cb96ef4e7a
Use new init command location
...
Command was renamed in the PVC CLI to facilitate other "task" actions
like backup/restore.
2020-11-24 12:22:34 -05:00
Joshua Boniface
3c0c3e8e56
Add jute.maxbuffer to Zookeeper environment ops
...
Adds this option based on the findings of
https://github.com/python-zk/kazoo/issues/630 , whereby restores of >1MB
in size would fail. This is considered an unsafe option, but given our
usecase no actual znode should ever exceed this limit; this is purely
for the large transactions that come from a `pvc task restore` action to
an empty Zookeeper instance.
2020-11-24 12:20:25 -05:00
Joshua Boniface
da8c357d38
Add PVC status MOTD script
2020-11-17 12:48:53 -05:00
Joshua Boniface
9f84609808
Set proper mode on agent plugins
2020-10-27 15:48:57 -04:00
Joshua Boniface
b4ba4f9eda
Add cluster safe update playbook
...
This playbook will perform a oneshot upgrade of the systems in the
cluster, including performing a clean and safe reboot of the node(s) if
required (either due to services needing a restart, or the kernel
changing). It runs in serial=1 and only reboots if needed.
2020-10-27 15:41:20 -04:00
Joshua Boniface
2d1b76ecdf
Add check-mk-agent plugin installs
...
These are used by various Ansible tasks, even if the administrator is
not using Check_MK for monitoring.
2020-10-27 15:41:20 -04:00
Joshua Boniface
2b0398dec8
Add PCI and USB utils
2020-10-05 16:10:10 -04:00
Joshua Boniface
934f73af0f
Support using existing SSL certs on system
...
Add the additional pvc_api_ssl_cert_path and pvc_api_ssl_key_path
group_vars options, which can be used to set the SSL details to existing
files on the filesystem if desired. If these are empty (or nonexistent),
the original pvc_api_ssl_cert and pvc_api_ssl_key raw format options
will be used as they were.
Allows the administrator to use outside methods (such as Let's Encrypt)
to obtain the certs locally on the system, avoiding changes to the
group_vars and redeployment to manage SSL keys.
2020-08-26 14:11:14 -04:00
Joshua Boniface
2edea75fbe
Use generic Debian repos and PVC component
2020-08-26 12:16:39 -04:00
Joshua Boniface
2f2123b70e
Rename remaining "pvc_prov" items to pvc_api
2020-08-25 13:01:48 -04:00
Joshua Boniface
d79c587384
Change name of default API database
...
From pvcprov to pvcapi to reflect the changing use of this database.
2020-08-25 02:00:29 -04:00
Joshua Boniface
663d525bb1
Add comments to defaults
2020-08-21 09:40:51 -04:00
Joshua Boniface
7f60033282
Add more detailed comments
2020-08-20 21:42:44 -04:00
Joshua Boniface
e32dfe6200
Add additional configuration to group_vars
...
Also include defaults and the new pvc_vm_shutdown_timeout option.
2020-08-20 21:39:44 -04:00
Joshua Boniface
1e8579ed9a
Add nice warning to purge script
2020-08-19 16:35:05 -04:00
Joshua Boniface
774595cdb7
Ensure ZK prioritizes IPv4
2020-08-19 13:10:03 -04:00
Joshua Boniface
c9b487f5e6
Use FQDN for Zookeeper server entries
2020-08-19 12:47:06 -04:00
Joshua Boniface
a0e4f3bd30
Improve SSH configuration for nodes
...
Ensure hostbased auth works with configs, remove erroneous old
conditional for authtypes, remove obsolete config option.
2020-08-06 15:56:01 -04:00
Joshua Boniface
6851d42885
Use Google DNS instead of Cloudflare
...
For some reason Cloudflare works in fewer places than Google, so just
use it instead.
2020-08-06 13:22:30 -04:00
Joshua Boniface
6b8232d38e
Use cluster_group variable for paths
...
Instead of trying to automagic this group out of the Ansible hostvars,
just make it explicitly defined in the group_vars to avoid any
confusion.
2020-08-06 13:20:14 -04:00
Joshua Boniface
a488f62ef8
Ignore errors in bringing up bootstrap interfaces
2020-07-27 13:08:24 -04:00
Joshua Boniface
69b0590b54
Add storage components to default pvcnoded.yaml
2020-06-06 21:15:10 -04:00
Joshua Boniface
646219737c
Ensure uuid-runtime is installed
2020-05-12 11:15:01 -04:00
Joshua Boniface
8c7b2dde86
Add update to purge command
2020-04-06 15:20:20 -04:00
Joshua Boniface
b0186b85c2
Use correct syntax for init command
2020-04-06 15:19:49 -04:00
Joshua Boniface
af1927e384
Use consistent naming in patroni.yml
2020-04-06 14:33:13 -04:00
Joshua Boniface
417dde5b1b
Remove obsolete issue-gen script on install
2020-04-06 13:55:51 -04:00