Commit Graph

183 Commits

Author SHA1 Message Date
Joshua Boniface 39b8229c35 Add libguestfs-tools to libvirt role deps 2023-09-01 15:42:25 -04:00
Joshua Boniface 0bf9c6209c Fix incorrect systemd enabling in Patroni 2023-09-01 15:42:25 -04:00
Joshua Boniface 4f5dbee8ee Correct bugs during bootstrap
1. Ensure Zookeeper restarts and checks out successfully before
proceeding with other steps.
2. Make sure PVC itself doesn't start prematurely.
2023-09-01 15:42:25 -04:00
Joshua Boniface 26dbd082ef Retry pgsql bootstrap startup 6 times
This will sometimes fail, so retry it several times
2023-09-01 15:42:25 -04:00
Joshua Boniface e9f08ad100 Retry msgr2 enabling 6 times
This will sometimes fail, so retry it several times
2023-09-01 15:42:25 -04:00
Joshua Boniface a77e41bf7c Remove invalid timezone entries in postgres conf 2023-09-01 15:42:25 -04:00
Joshua Boniface cba276e248 Add default values 2023-09-01 15:42:24 -04:00
Joshua Boniface be94bc134f Add configurable ZK memory limits 2023-09-01 15:42:24 -04:00
Joshua Boniface 6e74ac44a5 Remove libjemalloc package 2023-09-01 15:42:24 -04:00
Joshua Boniface 2bd5cc5a25 Tune Zookeeper memory usage
Use Xms and Xmx=128M to reduce overall Zookeeper memory usage.
2023-09-01 15:42:24 -04:00
Joshua Boniface b4e36d146a Add tuning for Ceph OSDs 2023-09-01 15:42:24 -04:00
Joshua Boniface 24764fe704 Don't use libjemalloc for Ceph daemons
This was an artifact of a much, much older Ceph configuration I ran, and
is not relevant with newer Ceph versions like those used in PVC.
Performance testing with Nautilus and Bluestore reveals a minimal
performance hit, and using `jemalloc` prevents cache autotuning from
being effective, so remove it.
2023-09-01 15:42:24 -04:00
Joshua Boniface 458e7b4872 Use new init command location
Command was renamed in the PVC CLI to facilitate other "task" actions
like backup/restore.
2023-09-01 15:42:24 -04:00
Joshua Boniface bcb5962353 Add jute.maxbuffer to Zookeeper environment ops
Adds this option based on the findings of
https://github.com/python-zk/kazoo/issues/630, whereby restores of >1MB
in size would fail. This is considered an unsafe option, but given our
usecase no actual znode should ever exceed this limit; this is purely
for the large transactions that come from a `pvc task restore` action to
an empty Zookeeper instance.
2023-09-01 15:42:24 -04:00
Joshua Boniface f79fb605de Support using existing SSL certs on system
Add the additional pvc_api_ssl_cert_path and pvc_api_ssl_key_path
group_vars options, which can be used to set the SSL details to existing
files on the filesystem if desired. If these are empty (or nonexistent),
the original pvc_api_ssl_cert and pvc_api_ssl_key raw format options
will be used as they were.

Allows the administrator to use outside methods (such as Let's Encrypt)
to obtain the certs locally on the system, avoiding changes to the
group_vars and redeployment to manage SSL keys.
2023-09-01 15:42:24 -04:00
Joshua Boniface 2caed2ae12 Rename remaining "pvc_prov" items to pvc_api 2023-09-01 15:42:24 -04:00
Joshua Boniface 2a2d318dbc Change name of default API database
From pvcprov to pvcapi to reflect the changing use of this database.
2023-09-01 15:42:24 -04:00
Joshua Boniface 833d99a360 Add comments to defaults 2023-09-01 15:42:24 -04:00
Joshua Boniface 8109f13386 Add additional configuration to group_vars
Also include defaults and the new pvc_vm_shutdown_timeout option.
2023-09-01 15:42:24 -04:00
Joshua Boniface 72df058684 Ensure ZK prioritizes IPv4 2023-09-01 15:42:24 -04:00
Joshua Boniface 457e18a850 Use FQDN for Zookeeper server entries 2023-09-01 15:42:24 -04:00
Joshua Boniface fbbf5ffe09 Use cluster_group variable for paths
Instead of trying to automagic this group out of the Ansible hostvars,
just make it explicitly defined in the group_vars to avoid any
confusion.
2023-09-01 15:42:23 -04:00
Joshua Boniface a925e4bd40 Ignore errors in bringing up bootstrap interfaces 2023-09-01 15:42:23 -04:00
Joshua Boniface e3ad750412 Add storage components to default pvcnoded.yaml 2023-09-01 15:42:23 -04:00
Joshua Boniface 12d50cfca6 Use correct syntax for init command 2023-09-01 15:42:23 -04:00
Joshua Boniface 92ccc0a737 Use consistent naming in patroni.yml 2023-09-01 15:42:23 -04:00
Joshua Boniface c35c58389d Use short names in PVC configs 2023-09-01 15:42:23 -04:00
Joshua Boniface 157c56fd46 Use shortname for Zookeeper 2023-09-01 15:42:23 -04:00
Joshua Boniface 6a3c32f306 Use local CLI command instead of API to init 2023-09-01 15:42:23 -04:00
Joshua Boniface c71415317a Use only short names in Ceph MON config 2023-09-01 15:42:23 -04:00
Joshua Boniface 52862f9daf Fix conditional checks with inventory_hostname 2023-09-01 15:42:23 -04:00
Joshua Boniface 91313e848e Handle bridge creation more sensibly 2023-09-01 15:42:23 -04:00
Joshua Boniface 6d3999eaab Don't restart pvcd.service on bootstrap 2023-09-01 15:42:23 -04:00
Joshua Boniface 0d9e209b45 Allow deb migrations to be installed 2023-09-01 15:42:23 -04:00
Joshua Boniface 4b89aff1d8 Add symlink for pvc files dir 2023-09-01 15:42:23 -04:00
Joshua Boniface 8c15edd75c Handle creation and collection on bootstrap better 2023-09-01 15:42:23 -04:00
Joshua Boniface b4079cae88 Use new in-built database migrations in API 2023-09-01 15:42:23 -04:00
Joshua Boniface 0e5cb688dc Use new package and file names
References parallelvirtualclient/pvc#79
2023-09-01 15:42:23 -04:00
Joshua Boniface 999e50a68f Don't mess with upstream at all during bootstrap
This caused some major breakage and is not required.
2023-09-01 15:42:23 -04:00
Joshua Boniface 42d76618e3 Modify add_cluster_ips to support new bridges 2023-09-01 15:42:22 -04:00
Joshua Boniface 32b719cb4a Enable and start vhostmd service 2023-09-01 15:42:22 -04:00
Joshua Boniface b654be8825 Add source_volume column to storage table 2023-09-01 15:42:22 -04:00
Joshua Boniface e3f83713a0 Add new empty script entry 2023-09-01 15:42:22 -04:00
Joshua Boniface f68ba7a735 Add bridge_device entry to config
Used to properly allow bridged networks to be formed.

Ref parallelvirtualcluster/pvc#64
2023-09-01 15:42:22 -04:00
Joshua Boniface 9848eb10bb Fix additional reference to userdata_template 2023-09-01 15:42:22 -04:00
Joshua Boniface f3212d5e4f Adjust provisioner database schema 2023-09-01 15:42:22 -04:00
Joshua Boniface bc1d9cd33b Set msgr2 mode on Ceph monitors 2023-09-01 15:42:22 -04:00
Joshua Boniface 372b949930 Apply fix with some tweaks to other serial handlers 2023-09-01 15:42:22 -04:00
Joshua Boniface 15768130e2 Change ordering of networks in file 2023-09-01 15:42:22 -04:00
Joshua Boniface 146e660a21 Replace broken "serial" restarts with a new method 2023-09-01 15:42:22 -04:00
Joshua Boniface 4a17a9918a Correct bad address in pvcd.yaml 2023-09-01 15:42:22 -04:00
Joshua Boniface 744e3d0988 Set provisioner database in pvcd.yaml 2023-09-01 15:42:22 -04:00
Joshua Boniface ba7270ab23 Add and remove floating IP during cluster bootstrap 2023-09-01 15:42:22 -04:00
Joshua Boniface 9546f34c34 Move netmask to separate config part 3 2023-09-01 15:42:22 -04:00
Joshua Boniface 211f83995b Ensure the Patroni ZK is clean for bootstrap 2023-09-01 15:42:22 -04:00
Joshua Boniface c27244f72d Move netmask to separate config part 2 2023-09-01 15:42:22 -04:00
Joshua Boniface a557b12e1c Don't try to do crazy restart ordering, it fails 2023-09-01 15:42:22 -04:00
Joshua Boniface 45185c5a16 Change Patroni scope to just pvc 2023-09-01 15:42:22 -04:00
Joshua Boniface 21c9aa3e6a Move netmask to separate config 2023-09-01 15:42:22 -04:00
Joshua Boniface e76dc2b796 Use API endpoint to bootstrap PVC cluster 2023-09-01 15:42:22 -04:00
Joshua Boniface f1e31ed169 Add additional API configuration 2023-09-01 15:42:22 -04:00
Joshua Boniface da24aaf5ff Install Provisioner schema to database 2023-09-01 15:42:22 -04:00
Joshua Boniface 484325698f Add provisioner database schema 2023-09-01 15:42:22 -04:00
Joshua Boniface cb3e4c2fa3 Remove empty newline 2023-09-01 15:42:22 -04:00
Joshua Boniface f76802be6d Remove invalid flag to ceph-authtool 2023-09-01 15:42:22 -04:00
Joshua Boniface 4b488a56ea Don't become for uuidgen 2023-09-01 15:42:22 -04:00
Joshua Boniface ff68f8a2a5 Move Ceph access to storage network 2023-09-01 15:42:22 -04:00
Joshua Boniface 6ce036f61e Improve script to run ZK cleanup on all hosts 2023-09-01 15:42:22 -04:00
Joshua Boniface 9448cf3d90 Add jq dependency 2023-09-01 15:42:21 -04:00
Joshua Boniface 7689e659fe Make vacuum script more comprehensive 2023-09-01 15:42:21 -04:00
Joshua Boniface 8163769183 Add check_mk check for PVC status 2023-09-01 15:42:21 -04:00
Joshua Boniface 2e0d5c9d44 Improve daily vacuum script 2023-09-01 15:42:21 -04:00
Joshua Boniface 94ef3490ab Add daily Zookeeper data cleanup 2023-09-01 15:42:21 -04:00
Joshua Boniface 15a2bf1418 Add custom systemd unit for Zookeeper
We're 100% systemd here, and the lack of control/information that the
old-school ZK initscript provides is frustrating. Replace it with our
own simple unit file.
2023-09-01 15:42:21 -04:00
Joshua Boniface f98a2ee433 Add logrotate configuration 2023-09-01 15:42:21 -04:00
Joshua Boniface c0acd3e994 Add daily Postgres vacuum script 2023-09-01 15:42:21 -04:00
Joshua Boniface 2cee9f08a0 Add Zookeeper autopurge @72h 2023-09-01 15:42:21 -04:00
Joshua Boniface c83995ff1d Update config template with recent changes
1) Add debug flag
2) Move intervals config up one level
2023-09-01 15:42:21 -04:00
Joshua Boniface 41c2dcd330 Set debug value in API config 2023-09-01 15:42:21 -04:00
Joshua Boniface 823310e8a3 Limit database tasks to coordinators only
Non-coordinators don't need these configurations as they shouldn't run
there.
2023-09-01 15:42:21 -04:00
Joshua Boniface 1e562a58da Support new log flags and update default log conf
Tweak the defaults a bit; pvc-ansible assumes we're running under
systemd, so set a log format that's better for it (no colour or date).
2023-09-01 15:42:21 -04:00
Joshua Boniface 035aef7c47 Replace tabs with spaces 2023-09-01 15:42:21 -04:00
Joshua Boniface 72d22a9c3c Remove bad content from pvc-api.yml 2023-09-01 15:42:21 -04:00
Joshua Boniface 36657e28f9 Add quote around ZK nodes in Patroni 2023-09-01 15:42:21 -04:00
Joshua Boniface db3198aadc Bring up underlying interfaces 2023-09-01 15:42:21 -04:00
Joshua Boniface 5d3de3ece2 Complete configuration of API via Ansible 2023-09-01 15:42:21 -04:00
Joshua Boniface 32bcbbeca0 Enable client API in Daemon config 2023-09-01 15:42:21 -04:00
Joshua Boniface d5516d891c Add client API to configuration 2023-09-01 15:42:21 -04:00
Joshua Boniface c5b07fcdad Add only coordinators in Patroni config 2023-09-01 15:42:21 -04:00
Joshua Boniface cfbe724458 Install ethtool 2023-09-01 15:42:21 -04:00
Joshua Boniface c4382eb208 Make libvirt listen only on cluster IP 2023-09-01 15:42:20 -04:00
Joshua Boniface f82bb6a414 Add debootstrap to package list 2023-09-01 15:42:20 -04:00
Joshua Boniface d8e9b5353f Don't try to set pool limits on libvirt key
I figured a * wildcard would work, but no it doesn't. Libvirt needs
the ability to talk to any pool arbitrarily since PVC can create and
remove them at will.
2023-09-01 15:42:20 -04:00
Joshua Boniface 0352dd7f8f Create mgr after starting monitors 2023-09-01 15:42:20 -04:00
Joshua Boniface e118267843 Set nicer default Patroni values re: replication 2023-09-01 15:42:20 -04:00
Joshua Boniface dbf6e52f3c Split PVC bootstrap into separate task 2023-09-01 15:42:20 -04:00
Joshua Boniface 935b4c48ae Correct bug with libvirt permissions 2023-09-01 15:42:20 -04:00
Joshua Boniface 958d2525da Handle restarting ceph-mon/mgr sequentially 2023-09-01 15:42:20 -04:00
Joshua Boniface 596ce789b1 Enable pool deletion in ceph.conf 2023-09-01 15:42:20 -04:00
Joshua Boniface c4825e6de6 Support for detailed keepalive logging 2023-09-01 15:42:20 -04:00