parallelvirtualcluster/pvc

Author	SHA1	Message	Date
Joshua M. Boniface	3705daff43	Better handle failing RBD lock frees If the VM is not in a stop state, failing to free the lock is now considered a fatal error and will put the domain into fail state, aborting the start. This is better than being unsafe or trying to start a VM which will fail to boot due to read-only volumes.	2020-12-14 16:04:38 -05:00
Joshua M. Boniface	7c99a7bda7	Safely reset RBD locks on failed VMs Should correct issues on cold start as well as if a VM crashes uncleanly, which would prevent the VM from starting due to stale RBD locks. This implementation has four parts: 1. Update how IP addresses are handled, specifically by replacing all previous instances of "vni_ipaddr" with "vni_floatingipaddr", and then adding the "vni_ipaddr" with the real data for this node's IPs. Also include the storage IPs in this where they weren't before, so each this_node actually has the local IPs plus floating IPs. This enables the next two steps. 2. Modify flush_locks to take this_node as an argument, and update the run_command function to only operate against this node, rather than on the primary coordinator. 3. Have the flush_locks check each lock against the current node, to verify that the lock is actually held by the current node. This is the only way to do this safely. During fencing, we override this by not passing a this_node which bypasses this check. 4. Have the VM start do the check for VM failure/startup and execute a flush_locks before actually starting the VM.	2020-12-14 15:53:18 -05:00
Joshua M. Boniface	70dfcd434f	Ensure inmigrate is cleared on failure	2020-11-17 12:57:37 -05:00
Joshua M. Boniface	260b39ebf2	Lint: E302 expected 2 blank lines, found X	2020-11-07 14:45:24 -05:00
Joshua M. Boniface	ab0b932fe3	Lint: E125 continuation line with same indent as next logical line	2020-11-07 13:49:54 -05:00
Joshua M. Boniface	e553c5d42a	Lint: E122 continuation line missing indentation or outdented	2020-11-07 13:12:26 -05:00
Joshua M. Boniface	7932be3948	Lint: E261 at least two spaces before inline comment	2020-11-07 13:11:03 -05:00
Joshua M. Boniface	3f242cd437	Lint: E202 whitespace before '}'	2020-11-07 12:57:42 -05:00
Joshua M. Boniface	e333f2b935	Lint: E201 whitespace after '{'	2020-11-07 12:38:31 -05:00
Joshua M. Boniface	4b47a2424c	Lint: E303 too many blank lines (2)	2020-11-06 21:16:52 -05:00
Joshua M. Boniface	5da314902f	Lint: F841 local variable '<variable>' is assigned to but never used	2020-11-06 21:13:13 -05:00
Joshua M. Boniface	aecb845d6a	Lint: E713 test for membership should be 'not in'	2020-11-06 20:37:52 -05:00
Joshua M. Boniface	57c51d3234	Lint: E711 comparison to None should be 'if cond is not None:'	2020-11-06 19:37:13 -05:00
Joshua M. Boniface	ce01b41d81	Lint: E711 comparison to None should be 'if cond is None:'	2020-11-06 19:36:36 -05:00
Joshua M. Boniface	4d6f36aca0	Lint: E712 comparison to False should be 'if cond is False:' or 'if not cond:'	2020-11-06 19:35:51 -05:00
Joshua M. Boniface	d9e7b7ec15	Lint: F401 <library> imported but unused	2020-11-06 19:22:49 -05:00
Joshua M. Boniface	63f4f9aed7	Lint: E722 do not use bare 'except'	2020-11-06 18:55:10 -05:00
Joshua M. Boniface	ec0b8acf90	Support per-VM migration type selectors Allow a VM to specify its migration type as a default choice. The valid options are "default" (i.e. behave as now), "live" which forces a live migration only, and "shutdown" which forces a shutdown migration only. The new option is treated as a VM meta option and is set to default if not found.	2020-10-29 12:01:29 -04:00
Joshua M. Boniface	890023cbfc	Make sender wait dynamic based on receiver	2020-10-21 14:43:54 -04:00
Joshua M. Boniface	28abb018e3	Improve some timeouts and conditionals	2020-10-21 12:00:10 -04:00
Joshua M. Boniface	017953c2e6	Move lock release to phase D	2020-10-21 11:07:01 -04:00
Joshua M. Boniface	82b4d3ed1b	Add missing prefix statements to loggers	2020-10-21 10:52:53 -04:00
Joshua M. Boniface	bae366a316	Add waits and only receive check on send	2020-10-21 10:43:42 -04:00
Joshua M. Boniface	351076c15e	Check if node changed during final check Avoids situations where two migrates, to different nodes, happen in rapid succession. Aborts the migration if the current target node no longer matches what was set at the start of the execution.	2020-10-21 02:52:36 -04:00
Joshua M. Boniface	42514b9a50	Improve messages further	2020-10-21 02:41:42 -04:00
Joshua M. Boniface	611e47f338	Add messages to migration aborts Results in some information duplication, but ensures logging of the reason a migration was aborted separate from the error(s) this may generate.	2020-10-21 02:38:42 -04:00
Joshua M. Boniface	1523959074	Move where setting last_ vars happens	2020-10-21 02:24:00 -04:00
Joshua M. Boniface	ef762359f4	Adjust timing to avoid migrating to self quickly Add another separate state lock, release it earlier, and ensure timings are good to avoid double-migrating one VM.	2020-10-21 02:17:55 -04:00
Joshua M. Boniface	398d33778f	Avoid stopping duplicates, just lock our own key	2020-10-20 16:10:39 -04:00
Joshua M. Boniface	a6d492ed9f	Remove spurious writes and adjust sleep	2020-10-20 16:04:26 -04:00
Joshua M. Boniface	11fa3b0df3	Remove additional wait and add last_node entries These allow for aborting a migration to retain the previous settings and override what the client set.	2020-10-20 15:58:55 -04:00
Joshua M. Boniface	442aa4e420	Tweak timers further	2020-10-20 15:43:59 -04:00
Joshua M. Boniface	3910843660	Add missing break	2020-10-20 15:39:29 -04:00
Joshua M. Boniface	70f3fdbfb9	Tweak the delays slightly on receive	2020-10-20 15:38:07 -04:00
Joshua M. Boniface	7cb0241a12	Attempt live migrates 3 times before proceeding	2020-10-20 15:33:41 -04:00
Joshua M. Boniface	9fb33ed7a7	Increase peer lock acquiring timers	2020-10-20 15:26:59 -04:00
Joshua M. Boniface	abfe0108ab	Better handle aborting migrations	2020-10-20 15:22:16 -04:00
Joshua M. Boniface	567fe8f36b	Wait for existing migrations before proceeding	2020-10-20 15:12:32 -04:00
Joshua M. Boniface	ec7b78b9b8	Add additional short sleep in receive	2020-10-20 13:29:17 -04:00
Joshua M. Boniface	224c8082ef	Alter text of synchronization messages	2020-10-20 13:08:18 -04:00
Joshua M. Boniface	f9e7e9884f	Improve handling of VM migrations The VM migration code was very old, very spaghettified, and prone to strange failures. Improve this by taking cues from the node primary migration. Use synchronization between the nodes to ensure lockstep completion of the migration in discrete steps. A proper queue can be built later to integrate with this code more cleanly. References #108	2020-10-20 13:01:55 -04:00
Joshua M. Boniface	a4b80be5ed	Add provisioned memory to node info Adds a separate field to the node memory, "provisioned", which totals the amount of memory provisioned to all VMs on the node, regardless of state, and in contrast to "allocated" which only counts running VMs. Allows for the detection of potential overprovisioned states when factoring in non-running VMs. Includes the supporting code to get this data, since the original implementation of VM memory selection was dependent on the VM being running and getting this from libvirt. Now, if the VM is not active, it gets this from the domain XML instead.	2020-10-18 14:17:15 -04:00
Joshua M. Boniface	3cc7df63f2	Add configurable VM shutdown timeout Closes #102	2020-08-20 21:26:12 -04:00
Joshua M. Boniface	0e5c681ada	Clean up imports Make several imports more specific to reduce redundant code imports and improve memory utilization.	2020-08-11 12:09:10 -04:00
Joshua M. Boniface	ce60836c34	Allow enforcement of live migration Provides a CLI and API argument to force live migration, which triggers a new VM state "migrate-live". The node daemon VMInstance during migrate will read this flag from the state and, if enforced, will not trigger a shutdown migration. Closes #95	2020-06-06 12:00:44 -04:00
Joshua M. Boniface	b5434ba744	Fix typo in variable name	2020-06-06 11:29:48 -04:00
Joshua M. Boniface	b9e5b14f94	Update lastnode too if a self-migrate is aborted References #92	2020-06-04 10:28:04 -04:00
Joshua M. Boniface	5d2031d99e	Prevent a VM migrating to the same node Prevents a rare edge case where a node can end up "migrating" to itself. Quick hack to fix this, though like most of the VM management should probably be rethought/rewritten later. Fixes #92	2020-06-04 10:26:47 -04:00
Joshua M. Boniface	c02bc0b46a	Correct issues with VM lock freeing Code was bad and using a depricated feature.	2020-03-02 12:45:12 -05:00
Joshua M. Boniface	1e4350ca6f	Properly handle takeover state in VXNetworks Most of these actions/conditionals were looking for primary state, but were failing during node takeover. Update the conditionals to look for both router states instead. Also add a wait to lock flushing until a takeover is completed.	2020-03-02 10:41:00 -05:00

1 2

51 Commits