parallelvirtualcluster/pvc - pvc

Commit Graph

Author	SHA1	Message	Date
Joshua Boniface	0bcf8cfe19	Add Zookeeper metrics proxy	2023-12-28 13:53:15 -05:00
Joshua Boniface	3e4cc53fdd	Add node network statistics and utilization values Adds a new physical network interface stats parser to the node keepalives, and leverages this information to provide a network utilization overview in the Prometheus metrics.	2023-12-21 15:45:01 -05:00
Joshua Boniface	39f9f3640c	Rename health metrics and add resource metrics	2023-12-21 09:40:49 -05:00
Joshua Boniface	4ca2381077	Rework metrics output and add combined endpoint	2023-12-09 15:47:40 -05:00
Joshua Boniface	9b3c9f1be5	Add Ceph metrics proxy and health fault counts	2023-12-09 12:22:36 -05:00
Joshua Boniface	7373bfed3f	Add Prometheus metric exporter Adds a "fake" Prometheus metrics endpoint which returns cluster status information in Prometheus format.	2023-12-09 12:22:36 -05:00
Joshua Boniface	20acf3295f	Add mass ack/delete of faults	2023-12-06 13:59:39 -05:00
Joshua Boniface	672e58133f	Implement interfaces to faults	2023-12-04 01:37:54 -05:00
Joshua Boniface	102c3c3106	Port all Celery worker functions to discrete pkg Moves all tasks run by the Celery worker into a discrete package/module for easier installation. Also adjusts several parameters throughout to accomplish this.	2023-11-30 02:24:54 -05:00
Joshua Boniface	0c0fb65c62	Rework Flask API to route Celery tasks manually Avoids needing to define any of these tasks here; they can all be defined in the pvcworkerd code.	2023-11-30 00:40:09 -05:00
Joshua Boniface	03a738f878	Move config parser into daemon_lib And reformat/add config values for API.	2023-11-30 00:05:37 -05:00
Joshua Boniface	24cabd3b99	Fix missing result_backend on Debian 10/11 For whatever reason, a Celery worker on <5.2.x was not picking these up. Move them back to the root of the module so they are properly picked up on these older versions but still prevents calling the routing functions during an API doc generation.	2023-11-25 15:35:25 -05:00
Joshua Boniface	b66cfb07d8	Isolate cluster-dependent Celery startup Avoids calling unworkable functions when generating API docs etc. by isolating them into a Celery startup function called by Daemon.py. Also update to Celery 4+ settings format.	2023-11-16 20:32:29 -05:00
Joshua Boniface	9ab505ec98	Return and show task_name	2023-11-16 14:50:02 -05:00
Joshua Boniface	0cb81f96e6	Use custom task IDs for Celery tasks Full UUIDs were obnoxiously long, so switch to using just the first 8-character section of a UUID instead. Keeps the list nice and short, makes them easier to copy, and is just generally nicer. Could this cause uniqueness problems? Perhaps, but I don't see that happening nearly frequently enough to matter.	2023-11-16 13:22:14 -05:00
Joshua Boniface	d226e9f4e5	Enable extended Celery results	2023-11-16 12:02:57 -05:00
Joshua Boniface	fa361a55d9	Explicitly use kwargs in Celery task calls	2023-11-16 11:55:30 -05:00
Joshua Boniface	262babc63d	Use kwargs for all task arguments This will help ensure that the CLI frontend can properly parse the args in a consistent way.	2023-11-16 10:10:48 -05:00
Joshua Boniface	289049d223	Properly handle a "primary" run_on value	2023-11-16 02:49:29 -05:00
Joshua Boniface	0bec6abe71	Return proper run_on for ported tasks	2023-11-16 02:28:57 -05:00
Joshua Boniface	484e6542c2	Port remaining tasks to new task handler Move the create_vm and run_benchmark tasks to use the new Celery subsystem, handlers, and wait command. Remove the obsolete, dedicated API endpoints. Standardize the CLI client and move the repeated handler code into a separate common function.	2023-11-16 02:00:23 -05:00
Joshua Boniface	ce17c60a20	Port OSD on-node tasks to Celery worker system Adds Celery versions of the osd_add, osd_replace, osd_refresh, osd_remove, and osd_db_vg_add functions.	2023-11-09 23:28:08 -05:00
Joshua Boniface	89681d54b9	Port VM on-node tasks to Celery worker system Adds Celery versions of the flush_locks, device_attach, and device_detach functions.	2023-11-06 20:40:46 -05:00
Joshua Boniface	3dc1f57de2	Revert "Switch to ZK+PG over Redis for Celery queue" This reverts commit `54215bab6c`.	2023-11-05 17:10:46 -05:00
Joshua Boniface	af8a8d969e	Ensure queues are set up for non-coordinator nodes Allows a runner to operate on every possible node, not just coordinators, as OSDs or other things could be on any node. Also add more comments.	2023-11-04 15:05:07 -04:00
Joshua Boniface	a6caac1b78	Add Celery queue routing for tasks By default, tasks will continue to run as they did, on the primary coordinator's task runner. However this opens the possibility for defining more tasks that will run on other nodes or coordinators.	2023-11-04 14:29:59 -04:00
Joshua Boniface	54215bab6c	Switch to ZK+PG over Redis for Celery queue Redis did not provide a distributed solution for the worker, which precluded several important planned functions. So instead, move to using Zookeeper + PostgreSQL as the broker and result backend respectively. Should be a seamless drop-in change but for future uses requires the database host to be the primary coordinator IP rather than localhost, so that writes can occur to the database from non-primary hosts.	2023-11-04 12:46:34 -04:00
Joshua Boniface	64e37ae963	Update OSD replacement functionality 1. Simplify this by leveraging the existing remove_osd/add_osd functions, since its task was functionally identical to those two in sequential order. 2. Add support for split OSDs within the command (replacing all OSDs on the block device(s) as required). 3. Add additional configurability and flexibility around the old device, weight, and external DB LVs.	2023-11-03 01:45:49 -04:00
Joshua Boniface	980ea6a9e9	Adjust handling of ext_db and _count options Avoid the use of superfluous flag options, default them to none, and add support for fixed-size DB LVs.	2023-11-02 13:29:47 -04:00
Joshua Boniface	526a5f4a74	Add support for split OSD adds Allows creating multiple OSDs on a single (NVMe) block device, leveraging the "ceph-volume lvm batch" command. Replaces the previous method of creating OSDs. Also adds a new ZK item for each OSD indicating if it is split or not.	2023-11-01 21:31:35 -04:00
Joshua Boniface	c87736eb0a	Use consistent path name and format	2023-10-24 01:20:44 -04:00
Joshua Boniface	63d0a85e29	Add backup deletion command	2023-10-24 01:18:27 -04:00
Joshua Boniface	55ca131c2c	Handle snapshots on restore and provide options Also rename the retain option to remove superfluous plural.	2023-10-24 00:25:06 -04:00
Joshua Boniface	8d256a1737	Complete VM restore functionality	2023-10-23 22:23:17 -04:00
Joshua Boniface	4fc9b15652	Fix bad function name	2023-10-17 10:56:32 -04:00
Joshua Boniface	b997c6f31e	Add support for full VM backups Adds support for exporting full VM backups, including configuration, metainfo, and RBD disk images, with incremental support.	2023-10-17 10:15:06 -04:00
Joshua Boniface	522da3fd95	Adjust wording for volume create too	2023-10-03 09:42:23 -04:00
Joshua Boniface	3a1bf0724e	Mention file_size as bytes	2023-10-03 09:39:19 -04:00
Joshua Boniface	35e27f79ef	Fix uploading of non-raw image files Adds a new API query parameter to define the file size, which is then used for the temporary image. This is required for, at least VMDK, files to work properly in qemu-img convert.	2023-09-29 16:19:22 -04:00
Joshua Boniface	311bb69785	Format based on updated Black	2023-09-12 16:41:02 -04:00
Joshua Boniface	e773211293	Add PVC version to cluster status output	2023-02-22 16:09:24 -05:00
Joshua Boniface	96defebd0b	Add last item to swagger doc	2023-02-22 00:25:27 -05:00
Joshua Boniface	e9aa545e9b	Update API specification	2023-02-22 00:06:52 -05:00
Joshua Boniface	38d63d9837	Flip behaviour of memory selectors It didn't make any sense to me for mem(prov) to be the default selector, since this has too many caveats versus mem(free). Switch to using mem(free) as the default (i.e. "mem") and make memprov the alternative.	2022-11-15 15:45:59 -05:00
Joshua Boniface	726d0a562b	Update copyright header year	2022-10-06 11:55:27 -04:00
Joshua Boniface	7a3870fc44	Add OVA script support 1. Ensure that system_template and script are not nullable in the DB. 2. Ensure that the CLI and API enforce the above and clean up CLI arguments for profile add. 3. Ensure that, before uploading OVAs, a 'default_ova' provisioning script is present. 4. Use the 'default_ova' script for new OVA uploads. 5. Ensure that OVA details are properly added to the vm_data dict in the provisioner vmbuilder.	2022-10-06 10:48:12 -04:00
Joshua Boniface	4df70cf086	Implement new provisioner setup	2022-10-05 16:03:05 -04:00
Joshua Boniface	d8d3feee22	Add selector help and adjust flag name 1. Add documentation on the node selector flags. In the API, reference the daemon configuration manual which now includes details in this section; in the CLI, provide the help in "pvc vm define" in detail and then reference that command's help in the other commands that use this field. 2. Ensure the naming is consistent in the CLI, using the flag name "--node-selector" everywhere (was "--selector" for "pvc vm" commands and "--node-selector" for "pvc provisioner" commands).	2022-06-10 02:42:06 -04:00
Joshua Boniface	b1357cafdb	Add memfree to selector and use proper defaults	2022-06-10 02:03:12 -04:00
Joshua Boniface	7a40c7a55b	Add support for replacing/refreshing OSDs Adds commands to both replace an OSD disk, and refresh (reimport) an existing OSD disk on a new node. This handles the cases where an OSD disk should be replaced (either due to upgrades or failures) or where a node is rebuilt in-place and an existing OSD must be re-imported to it. This should avoid the need to do a full remove/add sequence for either case. Also cleans up some aspects of OSD removal that are identical between methods (e.g. using safe-to-destroy and sleeping after stopping) and fixes a bug if an OSD does not truly exist when the daemon starts up.	2022-05-06 15:32:06 -04:00

1 2 3

150 Commits