parallelvirtualcluster/pvc

Author	SHA1	Message	Date
Joshua M. Boniface	70ba364f1d	Flip VM state condition to remove shutdown Don't cause health degredation for shutdown state, and flip the list around to make it clearer.	2023-02-16 20:32:33 -05:00
Joshua M. Boniface	1f8561d59a	Format cluster health like node healths Make a cleaner construct here.	2023-02-16 12:33:36 -05:00
Joshua M. Boniface	1093ca6264	Disallow health less than 0	2023-02-15 16:50:24 -05:00
Joshua M. Boniface	29584e5636	Add per-node health entries for 3rd party checks	2023-02-15 16:44:49 -05:00
Joshua M. Boniface	f4e8449356	Fix bugs and formatting of health messages	2023-02-15 16:28:56 -05:00
Joshua M. Boniface	ec79acf061	Fix linting of cluster.py file	2023-02-15 15:48:31 -05:00
Joshua M. Boniface	00586074cf	Modify cluster health to use new values	2023-02-15 15:45:43 -05:00
Joshua M. Boniface	f4eef30770	Add JSON health to cluster data	2023-02-15 15:26:57 -05:00
Joshua M. Boniface	b07396c39a	Fix bugs if plugins fail to load	2023-02-13 21:51:48 -05:00
Joshua M. Boniface	e6f9e6e0e8	Fix several bugs and optimize output	2023-02-13 16:36:15 -05:00
Joshua M. Boniface	9c14d84bfc	Add node health value and send out API	2023-02-13 15:53:39 -05:00
Joshua M. Boniface	3c742a827b	Initial implementation of monitoring plugin system	2023-02-13 12:06:26 -05:00
Joshua M. Boniface	671a907236	Allow rename in disable state	2023-01-30 11:48:43 -05:00
Joshua M. Boniface	38d63d9837	Flip behaviour of memory selectors It didn't make any sense to me for mem(prov) to be the default selector, since this has too many caveats versus mem(free). Switch to using mem(free) as the default (i.e. "mem") and make memprov the alternative.	2022-11-15 15:45:59 -05:00
Joshua M. Boniface	79eb994a5e	Ensure equality of none and None for selector	2022-11-07 11:59:53 -05:00
Joshua M. Boniface	8af7189dd0	Add module tag for daemon lib	2022-11-04 03:47:18 -04:00
Joshua M. Boniface	726d0a562b	Update copyright header year	2022-10-06 11:55:27 -04:00
Joshua M. Boniface	881550b610	Actually fix VM sorting Due to the executor the previous attempt did not work.	2022-08-12 17:46:29 -04:00
Joshua M. Boniface	bcabd7d079	Always sort VM list Same justification as previous commit.	2022-08-09 12:05:40 -04:00
Joshua M. Boniface	05a316cdd6	Ensure the node list is sorted Otherwise the node entries could come back in an arbitrary order; since this is an ordered list of dictionaries that might not be expected by the API consumers, so ensure it's always sorted.	2022-08-09 12:03:49 -04:00
Joshua M. Boniface	d8d3feee22	Add selector help and adjust flag name 1. Add documentation on the node selector flags. In the API, reference the daemon configuration manual which now includes details in this section; in the CLI, provide the help in "pvc vm define" in detail and then reference that command's help in the other commands that use this field. 2. Ensure the naming is consistent in the CLI, using the flag name "--node-selector" everywhere (was "--selector" for "pvc vm" commands and "--node-selector" for "pvc provisioner" commands).	2022-06-10 02:42:06 -04:00
Joshua M. Boniface	f8cdcb30ba	Add migration selector via free memory Closes #152	2022-05-18 03:47:16 -04:00
Joshua M. Boniface	c401a1f655	Use consistent language for primary mode I didn't call it "router" anywhere else, but the state in the list is called "coordinator" so, call it "coordinator mode".	2022-05-06 15:40:52 -04:00
Joshua M. Boniface	7a40c7a55b	Add support for replacing/refreshing OSDs Adds commands to both replace an OSD disk, and refresh (reimport) an existing OSD disk on a new node. This handles the cases where an OSD disk should be replaced (either due to upgrades or failures) or where a node is rebuilt in-place and an existing OSD must be re-imported to it. This should avoid the need to do a full remove/add sequence for either case. Also cleans up some aspects of OSD removal that are identical between methods (e.g. using safe-to-destroy and sleeping after stopping) and fixes a bug if an OSD does not truly exist when the daemon starts up.	2022-05-06 15:32:06 -04:00
Joshua M. Boniface	464f0e0356	Store additional OSD information in ZK Ensures that information like the FSIDs and the OSD LVM volume are stored in Zookeeper at creation time and updated at daemon start time (to ensure the data is populated at least once, or if the /dev/sdX path changes). This will allow safer operation of OSD removals and the potential implementation of re-activation after node replacements.	2022-05-02 12:11:39 -04:00
Joshua M. Boniface	d6ca74376a	Fix bugs with forced removal	2022-04-29 14:03:07 -04:00
Joshua M. Boniface	4d698be34b	Add OSD removal force option Ensures a removal can continue even in situations where some step(s) might fail, for instance removing an obsolete OSD from a replaced node.	2022-04-29 11:16:33 -04:00
Joshua M. Boniface	1142454934	Add pool PGs count modification Allows an administrator to adjust the PG count of a given pool. This can be used to increase the PGs (for example after adding more OSDs) or decrease it (to remove OSDs, reduce CPU load, etc.).	2021-12-28 21:53:29 -05:00
Joshua M. Boniface	bbfad340a1	Add PGs count to pool list	2021-12-28 21:12:02 -05:00
Joshua M. Boniface	c73939e1c5	Fix issue if pool stats have not updated yet	2021-12-28 21:03:10 -05:00
Joshua M. Boniface	25fe45dd28	Add device class tiers to Ceph pools Allows specifying a particular device class ("tier") for a given pool, for instance SSD-only or NVMe-only. This is implemented with Crush rules on the Ceph side, and via an additional new key in the pool Zookeeper schema which is defaulted to "default".	2021-12-28 20:58:15 -05:00
Joshua M. Boniface	6ccd19e636	Standardize fuzzy matching and use fullmatch Solves two problems: 1. How match fuzziness was used was very inconsistent; make them all the same, i.e. "if is_fuzzy and limit, apply .* to both sides". 2. Use re.fullmatch instead of re.match to ensure exact matching of the regex to the value. Without fuzziness, this would sometimes cause inconsistent behavior, for instance if a limit was non-fuzzy "vm", expecting to match the actual "vm", but also matching "vm1" too.	2021-12-06 16:35:29 -05:00
Joshua M. Boniface	0d857d5ab8	Use positive check rather than negative Ensure the VM is start before doing shutdown/stop, rather than being stopped. Prevents overwrite of existing disable state and other weirdness.	2021-11-06 04:08:33 -04:00
Joshua M. Boniface	5f193a6134	Perform automatic shutdown/stop on VM disable Instead of requiring the VM to already be stopped, instead allow disable state changes to perform a shutdown first. Also add a force option which will do a hard stop instead of a shutdown. References #148	2021-11-06 03:57:24 -04:00
Joshua M. Boniface	c41664d2da	Reformat code with Black code formatter Unify the code style along PEP and Black principles using the tool.	2021-11-06 03:02:43 -04:00
Joshua M. Boniface	87cda72ca9	Fix invalid schema key Addresses #144	2021-10-09 18:42:33 -04:00
Joshua M. Boniface	24de0f4189	Add MTU to network creation/modification Addresses #144	2021-10-09 17:51:32 -04:00
Joshua M. Boniface	50d8aa0586	Add handlers for client network MTUs Refactors some of the code in VXNetworkInterface to handle MTUs in a more streamlined fashion. Also fixes a bug whereby bridge client networks were being explicitly given the cluster dev MTU which might not be correct. Now adds support for this option explicitly in the configs, and defaults to 1500 for safety (the standard Ethernet MTU). Addresses #144	2021-10-09 17:02:27 -04:00
Joshua M. Boniface	c0f7ba0125	Add limit negation to VM list When using the "state", "node", or "tag" arguments to a VM list, add support for a "negate" flag to look for all VMs not in the state, node, or tag state.	2021-10-07 11:50:52 -04:00
Joshua M. Boniface	65df807b09	Add support for configurable OSD DB ratios The default of 0.05 (5%) is likely ideal in the initial implementation, but allow this to be set explicitly for maximum flexibility in space-constrained or performance-critical use-cases.	2021-09-24 01:06:39 -04:00
Joshua M. Boniface	adc8a5a3bc	Add separate OSD DB device support Adds in three parts: 1. Create an API endpoint to create OSD DB volume groups on a device. Passed through to the node via the same command pipeline as creating/removing OSDs, and creates a volume group with a fixed name (osd-db). 2. Adds API support for specifying whether or not to use this DB volume group when creating a new OSD via the "ext_db" flag. Naming and sizing is fixed for simplicity and based on Ceph recommendations (5% of OSD size). The Zookeeper schema tracks the block device to use during removal. 3. Adds CLI support for the new and modified API endpoints, as well as displaying the block device and DB block device in the OSD list. While I debated supporting adding a DB device to an existing OSD, in practice this ended up being a very complex operation involving stopping the OSD and setting some options, so this is not supported; this can be specified during OSD creation only. Closes #142	2021-09-23 13:59:49 -04:00
Joshua M. Boniface	58db537093	Add memory and vCPU checks to VM define/modify Ensures that a VM won't: (a) Have provisioned more RAM than there is available on a given node. Due to memory overprovisioning, this is simply a "is the VM memory count more than the node count", and doesn't factor in free or used memory on a node, total cluster usage, etc. So if a node has 64GB total RAM, the VM limit is 64GB. It is up to an administrator to ensure sanity below that value. (b) Have provisioned more vCPUs than there are CPU cores on the node, minus 2 to account for hypervisor/storage processes. Will ensure there is no severe CPU contention caused by a single VM having more vCPUs than there are actual execution threads available. Closes #139	2021-09-13 01:51:21 -04:00
Joshua M. Boniface	e71a6c90bf	Add pool size check when resizing volumes Closes #140	2021-09-12 19:54:51 -04:00
Joshua M. Boniface	e962743e51	Add VM device hot attach/detach support Adds a new API endpoint to support hot attach/detach of devices, and the corresponding client-side logic to use this endpoint when doing VM network/storage add/remove actions. The live attach is now the default behaviour for these types of additions and removals, and can be disabled if needed. Closes #141	2021-09-12 19:33:00 -04:00
Joshua M. Boniface	73e8149cb0	Remove explicit image-features from rbd cmd This should be managed in ceph.conf with the `rbd default features` configuration option instead, and thus can be tailored to the underlying OS version.	2021-07-30 11:33:59 -04:00
Joshua M. Boniface	4a7246b8c0	Ensure RBD resize has bytes appended If this isn't, the resize will be interpreted as a MB value and result in an absurdly big volume instead. This is the same consistency validation that occurs on add.	2021-07-30 11:25:13 -04:00
Joshua M. Boniface	c49351469b	Revert "Ensure consistent sizing of volumes" This reverts commit `dc03e95bbf`.	2021-07-29 15:30:00 -04:00
Joshua M. Boniface	dc03e95bbf	Ensure consistent sizing of volumes Convert from human to bytes, then to megabytes and always pass this to the RBD command. This ensures consistency regardless of what is actually passed by the user.	2021-07-29 15:14:25 -04:00
Joshua M. Boniface	45f23c12ea	Remove logs from schema validation These are managed entirely by the logging subsystem not by the schema handler due to catch-22's.	2021-07-20 00:00:37 -04:00
Joshua M. Boniface	b14bc7e3a3	Add retry to log writes	2021-07-19 13:11:28 -04:00

1 2 3 4 5 ...

276 Commits