parallelvirtualcluster/pvc - pvc

Commit Graph

Author	SHA1	Message	Date
Joshua Boniface	57b041dc62	Ensure default for vLAN and QOS is 0 not empty	2021-06-17 01:54:37 -04:00
Joshua Boniface	5607a6bb62	Avoid overwriting VF data Ensures that the configuration of a VF is not overwritten in Zookeeper on a node restart. The SRIOVVFInstance handlers were modified to start with None values, so that the DataWatch statements will always trigger updates to the live system interfaces on daemon startup, thus ensuring that the config stored in Zookeeper is applied to the system on startup (mostly relevant after a cold boot or if the API changes them during a daemon restart).	2021-06-17 01:45:22 -04:00
Joshua Boniface	8f1af2a642	Ignore hostdev interfaces in VM net stat gathering Prevents errors if a SR-IOV hostdev interface is configured until this is more defined.	2021-06-17 01:33:11 -04:00
Joshua Boniface	e7b6a3eac1	Implement SR-IOV PF and VF instances Adds support for the node daemon managing SR-IOV PF and VF instances. PFs are added to Zookeeper automatically based on the config at startup during network configuration, and are otherwise completely static. PFs are automatically removed from Zookeeper, along with all coresponding VFs, should the PF phy device be removed from the configuration. VFs are configured based on the (autocreated) VFs of each PF device, added to Zookeeper, and then a new class instance, SRIOVVFInstance, is used to watch them for configuration changes. This will enable the runtime management of VF settings by the API. The set of keys ensures that both configuration and details of the NIC can be tracked. Most keys are self-explanatory, especially for PFs and the basic keys for VFs. The configuration tree is also self-explanatory, being based entirely on the options available in the `ip link set {dev} vf` command. Two additional keys are also present: `used` and `used_by`, which will be able to track the (boolean) state of usage, as well as the VM that uses a given VIF. Since the VM side implementation will support both macvtap and direct "hostdev" assignments, this will ensure that this state can be tracked on both the VF and the VM side.	2021-06-17 01:33:03 -04:00
Joshua Boniface	0ad6d55dff	Add initial SR-IOV support to node daemon Adds configuration values for enabled flag and SR-IOV devices to the configuration and sets up the initial SR-IOV configuration on daemon startup (inserting the module, configuring the VF count, etc.).	2021-06-15 22:56:09 -04:00
Joshua Boniface	e4a65230a1	Just do the shutdown command itself	2021-06-15 02:32:14 -04:00
Joshua Boniface	284c581845	Ensure shutdown migrations actually time out Without this a VM that fails to respond to a shutdown will just spin forever, blocking state changes.	2021-06-15 00:23:15 -04:00
Joshua Boniface	953e46055a	Fix issue with loading None version schema	2021-06-14 21:09:55 -04:00
Joshua Boniface	d2bcfe5cf7	Bump version to 0.9.20	2021-06-14 18:06:27 -04:00
Joshua Boniface	ef1701b4c8	Handle an additional exception case	2021-06-14 17:15:40 -04:00
Joshua Boniface	08dc756549	Actually disable the pvcapid service Prevents it from trying to start itself during updates or reboots on non-primary coordinators.	2021-06-14 17:13:22 -04:00
Joshua Boniface	0a9c0c1ccb	Use a nicer reload method on hot schema update Instead of exiting and trusting systemd to restart us, instead leverage the os.execv() call to reload the process in the current PID context. Also improves the log messages so it's very clear what's going on.	2021-06-14 17:10:21 -04:00
Joshua Boniface	e34a7d4d2a	Handle hot reloads properly A hot reload isn't possible due to DataWatch and ChildrenWatch constructs, so we instead need to terminate the daemon to "apply" the schema update. Thus we use exit code 150 (Application defined in LSB) and reorder some of the elements of the schema validation to ensure things happen in the right order.	2021-06-14 12:52:43 -04:00
Joshua Boniface	1f49bfa1b2	Fix name of schema element	2021-06-13 20:56:17 -04:00
Joshua Boniface	647bce2a22	Ensure we don't grab None data	2021-06-13 16:43:25 -04:00
Joshua Boniface	26b1f531e9	Fix bad variable interpolation	2021-06-13 14:37:23 -04:00
Joshua Boniface	be9f1e8636	Use more compatible is_alive in thread	2021-06-13 14:36:27 -04:00
Joshua Boniface	b694945010	Fix incorrect name bug	2021-06-10 01:11:14 -04:00
Joshua Boniface	058c2ceef3	Convert VXNetworkInstance to new ZK schema handler	2021-06-10 00:36:18 -04:00
Joshua Boniface	e7d60260a0	Fix typo in CephInstance path	2021-06-10 00:36:02 -04:00
Joshua Boniface	85aba7cc18	Convert VMInstance to new ZK schema handler	2021-06-09 23:15:08 -04:00
Joshua Boniface	7e42118e6f	Adjust lock schema in NodeInstance and VMInstance Removes a superfluous lock and puts the sync_lock keys in more usable places.	2021-06-09 22:51:00 -04:00
Joshua Boniface	2704badfbe	Convert VMConsole... to new ZK schema handler	2021-06-09 22:08:32 -04:00
Joshua Boniface	450bf6b153	Convert NodeInstance to new ZK schema handler	2021-06-09 22:07:32 -04:00
Joshua Boniface	b94fe88405	Convert fencing to new ZK schema handler	2021-06-09 21:29:01 -04:00
Joshua Boniface	610f6e8f2c	Convert CephInstance to new ZK schema handler	2021-06-09 21:17:09 -04:00
Joshua Boniface	f913f42a6d	Replace schema paths with updated zkhandler	2021-06-09 20:29:42 -04:00
Joshua Boniface	e475552391	Fix some bugs with hot reload	2021-06-09 00:03:26 -04:00
Joshua Boniface	5540bdc86b	Add automatic schema upgrade to nodes Performs an automatic schema upgrade when all nodes are updated to the latest version. Addresses #129	2021-06-08 23:35:39 -04:00
Joshua Boniface	3c102b3769	Add per-node schema tracking This will allow nodes to start with their own schema versions, and then be updated simultaneously by the API. References #129	2021-06-08 23:35:39 -04:00
Joshua Boniface	a4aaf89681	Add ZKSchema loading and validation to Daemon Also removes some previous hack migrations from pre-0.9.19. Addresses #129	2021-06-08 23:35:39 -04:00
Joshua Boniface	126f0742cd	Add Zookeeper schema manager to zkhandler Adds a new class, ZKSchema, to handle schema management in Zookeeper in an automated and consistent way. This should solve several issues: 1. Pain in managing changes to ZK keys 2. Pain in handling those changes during live upgrades 3. Simplifying the codebase to remove hardcoded ZK paths The current master schema for PVC 0.9.19 is committed as version 0. Addresses #129	2021-06-08 23:35:39 -04:00
Joshua Boniface	5843d8aff4	Fix fence call to findTargetNode	2021-06-08 23:34:49 -04:00
Joshua Boniface	cf96bb009f	Bump version to 0.9.19	2021-06-06 01:47:41 -04:00
Joshua Boniface	719954b70b	Fix missing list comma	2021-06-06 01:39:43 -04:00
Joshua Boniface	7dea5d2fac	Move logger to common, fix buffering	2021-06-01 18:50:26 -04:00
Joshua Boniface	3a5226b893	Add missing flushed output	2021-06-01 18:30:18 -04:00
Joshua Boniface	de2ff2e01b	Fix removed function args	2021-06-01 17:02:36 -04:00
Joshua Boniface	cd75413667	Increase initial lock timer With the new library the reader seems to be a little too quick, so hold the write lock for 1 second instead of 1/2 second to ensure it is caught.	2021-06-01 17:00:11 -04:00
Joshua Boniface	9764090d6d	Merge node common with daemon common	2021-06-01 12:22:11 -04:00
Joshua Boniface	12ac3686de	Convert missed elements to new zkhandler	2021-06-01 11:57:21 -04:00
Joshua Boniface	5740d0f2d5	Remove obsolete zkhandler.py	2021-06-01 11:55:44 -04:00
Joshua Boniface	889f4cdf47	Convert common to new zkhandler	2021-06-01 11:55:32 -04:00
Joshua Boniface	8f66a8d00e	Fix missed zkhandler conversion	2021-06-01 11:53:33 -04:00
Joshua Boniface	6beea0693c	Convert fencing to new zkhandler	2021-06-01 11:53:21 -04:00
Joshua Boniface	1c9a7a6479	Convert VXNetworkInstance to new zkhandler	2021-06-01 11:49:39 -04:00
Joshua Boniface	790098f181	Convert VMInstance to new zkhandler	2021-06-01 11:46:27 -04:00
Joshua Boniface	8a4a41e092	Convert NodeInstance to new zkhandler	2021-06-01 11:27:35 -04:00
Joshua Boniface	a48bf2d71e	More gracefully handle none selectors Allow selection of "none" as the node selector, and handle this by always using the cluster default instead of writing it in.	2021-06-01 11:13:13 -04:00
Joshua Boniface	a0b9087167	Set Daemon migration selector in zookeeper	2021-06-01 10:52:41 -04:00
Joshua Boniface	33a54cf7f2	Move configuration keys to /config tree	2021-06-01 10:48:55 -04:00
Joshua Boniface	d6a8cf9780	Convert MetadataAPIInstance to new zkhandler	2021-05-31 19:55:09 -04:00
Joshua Boniface	abd619a3c1	Convert DNSAggregatorInstance to new zkhandler	2021-05-31 19:55:01 -04:00
Joshua Boniface	ef5fe78125	Convert CepnInstance to new zkhandler	2021-05-31 19:51:27 -04:00
Joshua Boniface	f6d0e89568	Properly add absent node type	2021-05-31 19:26:27 -04:00
Joshua Boniface	ede3e88cd7	Modify node daemon root to use updated zkhandler	2021-05-31 03:14:09 -04:00
Joshua Boniface	c23a53d082	Add daemon_lib symlink to pvcnoded	2021-05-30 00:00:07 -04:00
Joshua Boniface	0c75a127b2	Bump version to 0.9.18	2021-05-23 17:23:10 -04:00
Joshua Boniface	9de14c46fb	Bump version to 0.9.17	2021-05-19 17:06:29 -04:00
Joshua Boniface	fe15bdb854	Bump version to 0.9.16	2021-05-10 01:13:21 -04:00
Joshua Boniface	b851a6209c	Catch all other exceptions in subprocess run Found a rare glitch where the subprocess pipes would not engage, causing a daemon crash. Catch these exceptions with a retcode of 255 instead of bailing out. Closes #124	2021-05-10 01:07:25 -04:00
Joshua Boniface	5ceb57e540	Handle emptying corrupted console log files Libvirt will someones write junk out to console log files, which breaks the log parser deque with a UnicodeDecodeError. If this happens, clear the log and re-open the deque again for newer updates. Closes #123	2021-05-10 01:03:04 -04:00
Joshua Boniface	669338c22b	Bump version to 0.9.15	2021-04-08 13:37:47 -04:00
Joshua Boniface	c4ac75b973	Bump version to 0.9.14	2021-03-30 10:27:37 -04:00
Joshua Boniface	0bf276fd51	Update copyright year in headers	2021-03-25 17:01:55 -04:00
Joshua Boniface	f4ec161aa2	Update file copyright header. Remove the option to select a later version of the GPL.	2021-03-25 16:58:02 -04:00
Joshua Boniface	0ccfc41398	Bump version to 0.9.13	2021-02-17 11:37:59 -05:00
Joshua Boniface	9100c63e99	Add stored_bytes to pool stats information	2021-02-09 01:46:01 -05:00
Joshua Boniface	aba567d6c9	Add nice startup banners to both daemons Add nicer easy-to-find (yay ASCII art) banners for the startup printouts of both the node and API daemons. Also adds the safe loader to pvcnoded to prevent hassle messages and a version string in the API daemon file.	2021-02-08 02:51:43 -05:00
Joshua Boniface	0db8fd9da6	Bump version to 0.9.12	2021-01-28 16:29:58 -05:00
Joshua Boniface	a44f134230	Remove systemd deps on zookeeper and libvirt This caused a serious race condition, since the IPs managed by PVC had not yet come up, but Zookeeper was trying to start and bind to them, which of course failed. Remove these dependencies entirely - the daemon itself starts these services during initialization and they do not need to be started by systemd first.	2021-01-28 16:25:02 -05:00
Joshua Boniface	9fbe35fd24	Bump version to 0.9.11	2021-01-05 15:58:26 -05:00
Joshua Boniface	a24724d9f0	Use external ceph cmd for ceph df	2020-12-26 14:04:21 -05:00
Joshua Boniface	78c017d51d	Remove erroneous extra colon in log output	2020-12-20 16:06:35 -05:00
Joshua Boniface	1b6613c280	Add live VNC information to domain output Sets in the node daemon, returns via the API, and shows in the CLI, information about the live VNC listen address and port for VNC-enabled VMs. Closes #115	2020-12-20 16:00:55 -05:00
Joshua Boniface	d6ef722997	Fix bad log message	2020-12-15 10:51:52 -05:00
Joshua Boniface	518d699c15	Bump version to 0.9.10	2020-12-15 10:45:15 -05:00
Joshua Boniface	ac3ef3d792	Revamp fencing order Prevents unnecessarily excessive timeouts if IPMI connections time out; before, would have to go through 3 timed out commands at ~20s each before failure was registered; reduced to 1 if the first times out.	2020-12-15 02:48:25 -05:00
Joshua Boniface	3705daff43	Better handle failing RBD lock frees If the VM is not in a stop state, failing to free the lock is now considered a fatal error and will put the domain into fail state, aborting the start. This is better than being unsafe or trying to start a VM which will fail to boot due to read-only volumes.	2020-12-14 16:04:38 -05:00
Joshua Boniface	7c99a7bda7	Safely reset RBD locks on failed VMs Should correct issues on cold start as well as if a VM crashes uncleanly, which would prevent the VM from starting due to stale RBD locks. This implementation has four parts: 1. Update how IP addresses are handled, specifically by replacing all previous instances of "vni_ipaddr" with "vni_floatingipaddr", and then adding the "vni_ipaddr" with the real data for this node's IPs. Also include the storage IPs in this where they weren't before, so each this_node actually has the local IPs plus floating IPs. This enables the next two steps. 2. Modify flush_locks to take this_node as an argument, and update the run_command function to only operate against this node, rather than on the primary coordinator. 3. Have the flush_locks check each lock against the current node, to verify that the lock is actually held by the current node. This is the only way to do this safely. During fencing, we override this by not passing a this_node which bypasses this check. 4. Have the VM start do the check for VM failure/startup and execute a flush_locks before actually starting the VM.	2020-12-14 15:53:18 -05:00
Joshua Boniface	89c7e225a0	Move OSD stats uploading to primary only Instead of each node uploading its own OSD stats, which would not work if the PVC daemon wasn't running, instead have the primary upload stats for all OSDs in the cluster.	2020-12-09 02:46:09 -05:00
Joshua Boniface	b36ec43a2d	Bump version to 0.9.9	2020-12-09 02:20:20 -05:00
Joshua Boniface	ce5ee11841	Bump version to 0.9.8	2020-11-24 12:26:57 -05:00
Joshua Boniface	d4a28d7a58	Bump version to 0.9.7	2020-11-19 10:48:28 -05:00
Joshua Boniface	e69eb93cb3	Bump version to 0.9.6	2020-11-17 13:01:54 -05:00
Joshua Boniface	70dfcd434f	Ensure inmigrate is cleared on failure	2020-11-17 12:57:37 -05:00
Joshua Boniface	a4e5323e81	Bump version to 0.9.5	2020-11-17 12:34:04 -05:00
Joshua Boniface	9053edacd8	Bump version to 0.9.4	2020-11-10 15:33:50 -05:00
Joshua Boniface	baac8f24fd	Bump version to 0.9.3	2020-11-09 10:28:15 -05:00
Joshua Boniface	11702f4bc8	Bump version to 0.9.2	2020-11-08 02:03:29 -05:00
Joshua Boniface	6f66b77a00	Lint: E121/E126 continuation line under/over-indented for hanging indent	2020-11-07 15:06:21 -05:00
Joshua Boniface	9135c5e3e4	Lint: E241 multiple spaces after ','	2020-11-07 14:52:39 -05:00
Joshua Boniface	260b39ebf2	Lint: E302 expected 2 blank lines, found X	2020-11-07 14:45:24 -05:00
Joshua Boniface	ab0b932fe3	Lint: E125 continuation line with same indent as next logical line	2020-11-07 13:49:54 -05:00
Joshua Boniface	f5988ad53d	Lint: F821 undefined name 'pool'/'volume' This class is actually entirely unused but is kept for consistency with the others. It may be used someday for something.	2020-11-07 13:34:18 -05:00
Joshua Boniface	c3dfe2e381	Lint: F821 undefined name 'myshorthostname'	2020-11-07 13:31:19 -05:00
Joshua Boniface	961ebb4c01	Lint: E305 expected 2 blank lines after class or function definition, found X	2020-11-07 13:17:49 -05:00
Joshua Boniface	e553c5d42a	Lint: E122 continuation line missing indentation or outdented	2020-11-07 13:12:26 -05:00
Joshua Boniface	7932be3948	Lint: E261 at least two spaces before inline comment	2020-11-07 13:11:03 -05:00
Joshua Boniface	d2490419c5	Lint: E202 whitespace before ']'	2020-11-07 13:02:54 -05:00
Joshua Boniface	d2e5ede399	Lint: E202 whitespace before ')'	2020-11-07 12:58:54 -05:00
Joshua Boniface	3f242cd437	Lint: E202 whitespace before '}'	2020-11-07 12:57:42 -05:00
Joshua Boniface	b7daa8e1f6	E201 whitespace after '['	2020-11-07 12:39:59 -05:00
Joshua Boniface	c88965e898	Lint: E201 whitespace after '('	2020-11-07 12:39:27 -05:00
Joshua Boniface	e333f2b935	Lint: E201 whitespace after '{'	2020-11-07 12:38:31 -05:00
Joshua Boniface	3cb92fed75	Lint: E401 multiple imports on one line	2020-11-07 12:29:32 -05:00
Joshua Boniface	27c6ac2b66	Lint: W605 invalid escape sequence '\d' This is the only one where forcing an `r` type to the string was required; the remainder of W605 were replaced with character class enclosures.	2020-11-07 12:22:20 -05:00
Joshua Boniface	8ba267a59e	Lint: E211 whitespace before '['/'('	2020-11-07 12:20:01 -05:00
Joshua Boniface	39cc992e9b	Lint: E306 expected 1 blank line before a nested definition, found 0	2020-11-07 12:17:38 -05:00
Joshua Boniface	8c623023d5	Lint: F811 redefinition of unused '<function>'	2020-11-07 12:14:29 -05:00
Joshua Boniface	5b3ee363b2	Lint: E222 multiple spaces after operator	2020-11-07 12:10:24 -05:00
Joshua Boniface	fad27a7f4d	Lint: E131 continuation line unaligned for hanging indent	2020-11-06 22:29:49 -05:00
Joshua Boniface	2eef6a1c21	Lint: E265 block comment should start with '# '	2020-11-06 21:32:17 -05:00
Joshua Boniface	4b47a2424c	Lint: E303 too many blank lines (2)	2020-11-06 21:16:52 -05:00
Joshua Boniface	cb2defbde9	Lint: W391 blank line at end of file	2020-11-06 21:14:19 -05:00
Joshua Boniface	5da314902f	Lint: F841 local variable '<variable>' is assigned to but never used	2020-11-06 21:13:13 -05:00
Joshua Boniface	98a573bbc7	Lint: E402 module level import not at top of file	2020-11-06 20:40:32 -05:00
Joshua Boniface	aecb845d6a	Lint: E713 test for membership should be 'not in'	2020-11-06 20:37:52 -05:00
Joshua Boniface	fde8ea2fea	Lint: W291 trailing whitespace	2020-11-06 19:44:14 -05:00
Joshua Boniface	57c51d3234	Lint: E711 comparison to None should be 'if cond is not None:'	2020-11-06 19:37:13 -05:00
Joshua Boniface	ce01b41d81	Lint: E711 comparison to None should be 'if cond is None:'	2020-11-06 19:36:36 -05:00
Joshua Boniface	4d6f36aca0	Lint: E712 comparison to False should be 'if cond is False:' or 'if not cond:'	2020-11-06 19:35:51 -05:00
Joshua Boniface	fb4aafcea9	Lint: E111 indentation is not a multiple of four	2020-11-06 19:26:22 -05:00
Joshua Boniface	d9e7b7ec15	Lint: F401 <library> imported but unused	2020-11-06 19:22:49 -05:00
Joshua Boniface	ebf254f62d	Lint: W293 blank line contains whitespace	2020-11-06 19:11:07 -05:00
Joshua Boniface	63f4f9aed7	Lint: E722 do not use bare 'except'	2020-11-06 18:55:10 -05:00
Joshua Boniface	56ba7b1457	Bump version to 0.9.1	2020-10-29 12:16:38 -04:00
Joshua Boniface	ec0b8acf90	Support per-VM migration type selectors Allow a VM to specify its migration type as a default choice. The valid options are "default" (i.e. behave as now), "live" which forces a live migration only, and "shutdown" which forces a shutdown migration only. The new option is treated as a VM meta option and is set to default if not found.	2020-10-29 12:01:29 -04:00
Joshua Boniface	5d08ad9573	Fix incorrect keepalive interval setting	2020-10-26 11:44:45 -04:00
Joshua Boniface	0f299777f1	Modify version to 3-digit numbering I expect 0.9 will be fairly long-lived, so add another decimal place so I may continue adding tweaks to it. THIS IS NOT SEMVER.	2020-10-26 02:13:11 -04:00
Joshua Boniface	890023cbfc	Make sender wait dynamic based on receiver	2020-10-21 14:43:54 -04:00
Joshua Boniface	28abb018e3	Improve some timeouts and conditionals	2020-10-21 12:00:10 -04:00
Joshua Boniface	017953c2e6	Move lock release to phase D	2020-10-21 11:07:01 -04:00
Joshua Boniface	82b4d3ed1b	Add missing prefix statements to loggers	2020-10-21 10:52:53 -04:00
Joshua Boniface	bae366a316	Add waits and only receive check on send	2020-10-21 10:43:42 -04:00
Joshua Boniface	351076c15e	Check if node changed during final check Avoids situations where two migrates, to different nodes, happen in rapid succession. Aborts the migration if the current target node no longer matches what was set at the start of the execution.	2020-10-21 02:52:36 -04:00
Joshua Boniface	42514b9a50	Improve messages further	2020-10-21 02:41:42 -04:00
Joshua Boniface	611e47f338	Add messages to migration aborts Results in some information duplication, but ensures logging of the reason a migration was aborted separate from the error(s) this may generate.	2020-10-21 02:38:42 -04:00
Joshua Boniface	1523959074	Move where setting last_ vars happens	2020-10-21 02:24:00 -04:00
Joshua Boniface	ef762359f4	Adjust timing to avoid migrating to self quickly Add another separate state lock, release it earlier, and ensure timings are good to avoid double-migrating one VM.	2020-10-21 02:17:55 -04:00
Joshua Boniface	398d33778f	Avoid stopping duplicates, just lock our own key	2020-10-20 16:10:39 -04:00
Joshua Boniface	a6d492ed9f	Remove spurious writes and adjust sleep	2020-10-20 16:04:26 -04:00
Joshua Boniface	11fa3b0df3	Remove additional wait and add last_node entries These allow for aborting a migration to retain the previous settings and override what the client set.	2020-10-20 15:58:55 -04:00
Joshua Boniface	442aa4e420	Tweak timers further	2020-10-20 15:43:59 -04:00
Joshua Boniface	3910843660	Add missing break	2020-10-20 15:39:29 -04:00
Joshua Boniface	70f3fdbfb9	Tweak the delays slightly on receive	2020-10-20 15:38:07 -04:00
Joshua Boniface	7cb0241a12	Attempt live migrates 3 times before proceeding	2020-10-20 15:33:41 -04:00
Joshua Boniface	9fb33ed7a7	Increase peer lock acquiring timers	2020-10-20 15:26:59 -04:00
Joshua Boniface	abfe0108ab	Better handle aborting migrations	2020-10-20 15:22:16 -04:00
Joshua Boniface	567fe8f36b	Wait for existing migrations before proceeding	2020-10-20 15:12:32 -04:00
Joshua Boniface	ec7b78b9b8	Add additional short sleep in receive	2020-10-20 13:29:17 -04:00
Joshua Boniface	224c8082ef	Alter text of synchronization messages	2020-10-20 13:08:18 -04:00
Joshua Boniface	f9e7e9884f	Improve handling of VM migrations The VM migration code was very old, very spaghettified, and prone to strange failures. Improve this by taking cues from the node primary migration. Use synchronization between the nodes to ensure lockstep completion of the migration in discrete steps. A proper queue can be built later to integrate with this code more cleanly. References #108	2020-10-20 13:01:55 -04:00
Joshua Boniface	726501f4d4	Add additional logging to flush selector Adds additional debug logging to the flush selector to determine how any why any given node is selected. Useful for troubleshooting strange choices.	2020-10-20 12:34:18 -04:00
Joshua Boniface	7cc33451b9	Improve Munin check with extinfo	2020-10-19 11:01:00 -04:00
Joshua Boniface	c6e34c7dc6	Bump base version to 0.9	2020-10-18 14:31:19 -04:00
Joshua Boniface	f749633f7c	Use provisioned memory for mem migration selector Use the new "provisioned" memory field, instead of the "allocated" memory field, to determine the optimal node when using the "mem" migration selector. This will take into account non-running VMs in the calculation as well as running VMs.	2020-10-18 14:17:15 -04:00
Joshua Boniface	a4b80be5ed	Add provisioned memory to node info Adds a separate field to the node memory, "provisioned", which totals the amount of memory provisioned to all VMs on the node, regardless of state, and in contrast to "allocated" which only counts running VMs. Allows for the detection of potential overprovisioned states when factoring in non-running VMs. Includes the supporting code to get this data, since the original implementation of VM memory selection was dependent on the VM being running and getting this from libvirt. Now, if the VM is not active, it gets this from the domain XML instead.	2020-10-18 14:17:15 -04:00
Joshua Boniface	aa5f8c93fd	Entirely disable IPv6 on bridged interfaces Prevents any potential leakage due to autoconfigured IPv6 on bridged interfaces. These are exclusively VM-side bridges, and the PVC host should not have any IPv6 configuration on them, ever.	2020-10-15 11:00:59 -04:00
Joshua Boniface	9366977fe6	Copy d_domain before iterating Prevents a bug where the thread can crash due to a change in the d_domain object while running the for loop. By copying and iterating over the copy, this becomes safer.	2020-09-16 15:12:37 -04:00
Joshua Boniface	65b44f2955	Avoid breaking keepalive during incoming migration The keepalive was getting stuck gathering memoryStats from the non-running VM, since it was in a paused state. Avoid this by just skipping past the rest of the stats gathering if the VM isn't running.	2020-08-28 01:47:36 -04:00
Joshua Boniface	78dec77987	Bump version to 0.8	2020-08-26 10:24:44 -04:00
Joshua Boniface	1dcc1f6d55	Rename sample database for API From pvcprov to pvcapi to facilitate the changing nature of this database and its expansion to benchmark results.	2020-08-25 01:59:35 -04:00
Joshua Boniface	921e57ca78	Fix syntax error	2020-08-20 23:05:56 -04:00
Joshua Boniface	3cc7df63f2	Add configurable VM shutdown timeout Closes #102	2020-08-20 21:26:12 -04:00
Joshua Boniface	7e2114b536	Add initial monitoring configurations to daemon Initial work to support multiple monitoring agents including Munin, Check_MK, and NRPE at the least.	2020-08-17 17:05:55 -04:00
Joshua Boniface	e8e65934e3	Use logger prefix for thread debug logs	2020-08-17 14:30:21 -04:00
Joshua Boniface	24fda8a73f	Use new debug logger for DNS Aggregator	2020-08-17 14:26:43 -04:00
Joshua Boniface	9b3ef6d610	Add connect timeout to Ceph This doesn't seem to actually do anything (like most of these timeouts...) but add it just for posterity.	2020-08-17 13:58:14 -04:00
Joshua Boniface	b451c0e8e3	Add additional start/finish debug messages	2020-08-17 13:11:03 -04:00
Joshua Boniface	f9b126a106	Make zkhandler accept failures more robustly Most of these would silently fail if there was e.g. an issue with the ZK connection. Instead, encase things in try blocks and handle the exceptions in a more graceful way, returning None or False if applicable. Except for locks, which should retry 5 times before aborting.	2020-08-17 13:03:36 -04:00
Joshua Boniface	553f96e7ef	Use logger for debug output Using simple print statements was annoying (lack of timing info and formatting), so move to using the debug logger for these instead with a custom state ('d') with white text to differentiate them. Also indicate which subthread of the keepalive each task is being executed in for easier tracing of issues.	2020-08-17 12:46:52 -04:00
Joshua Boniface	65add58c9a	Properly properly handle issue	2020-08-16 11:38:39 -04:00
Joshua Boniface	0a01d84290	Tie fence timers to keepalive_interval Also wait 2 full keepalive intervals after fencing before doing anything else, to give the Ceph cluster a chance to recover.	2020-08-15 12:38:03 -04:00
Joshua Boniface	4afb288429	Properly handle missing domain_name fail	2020-08-15 12:07:23 -04:00
Joshua Boniface	985ad5edc0	Warn if fencing will fail Verify our IPMI state on startup, and then warn if fencing will fail. For now, this is sufficient, but in future (requires refactoring) we might want to adjust how fencing occurs based on this information.	2020-08-13 14:42:18 -04:00
Joshua Boniface	0587bcbd67	Go back to manual command for OSD stats Using the Ceph library was a disaster here; it had no timeout or way to force it to continue, so keepalives would become stuck and trigger fence storms. Go back to the manual osd dump command with a 2s timeout which is far more reliable and can be adequately terminated if it runs long.	2020-08-12 22:31:25 -04:00
Joshua Boniface	09c1bb6a46	Increase start delay of flush service	2020-08-11 14:17:35 -04:00
Joshua Boniface	e0cb4a58c3	Ensure zk_listener is readded after reconnect	2020-08-11 12:46:15 -04:00
Joshua Boniface	099c58ead8	Fix missing char in log message	2020-08-11 12:40:35 -04:00
Joshua Boniface	0e5c681ada	Clean up imports Make several imports more specific to reduce redundant code imports and improve memory utilization.	2020-08-11 12:09:10 -04:00
Joshua Boniface	46ffe352e3	Better handle subthread timeouts in keepalive Prevent the main keepalive thread from getting stuck due to a subthread taking an enormous time. If this happens, the rest of the main keepalive will continue onward, thus ensuring that the main keepalive does not fail for a significant number of cycles, which would cause a fence.	2020-08-11 11:37:26 -04:00
Joshua Boniface	ccee124c8b	Adjust fence failcount limit to 6 (30s) The previous saving throw limit (3/15s) seems to have been too low. I was observing bizarre failures where a node would be fenced while it was still starting up. Some of this may have been related to Zookeeper connections taking too long, but this was inconsistent. Increase this to 6 saving throws (30s). This provides significantly more time for a node to properly check in on startup before another node fences it. In the real world, 15s vs 30s isn't that big of a downtime change, but prevents false-positive fences.	2020-08-05 22:40:07 -04:00
Joshua Boniface	02343079c0	Improve fencing migrate layout Open the option to do this in parallel with some threads	2020-08-05 22:26:01 -04:00
Joshua Boniface	37b83aad6a	Add logging and use better conditional	2020-08-05 21:57:36 -04:00
Joshua Boniface	876f2424e0	Ensure dead state isn't written erroneously	2020-08-05 21:57:11 -04:00
Joshua Boniface	5871380e1b	Avoid crashing VM stats thread if domain migrated	2020-06-10 17:10:46 -04:00
Joshua Boniface	654a3cb7fa	Improve debug output and use ceph df util data	2020-06-06 22:52:49 -04:00
Joshua Boniface	9b65d3271a	Improve handling of Ceph status gathering Use the Rados library instead of random OS commands, which massively improves the performance of these tasks. Closes #97	2020-06-06 22:30:25 -04:00
Joshua Boniface	598b2025e8	Use Rados and add Ceph entries to pvcnoded.yaml	2020-06-06 21:12:51 -04:00
Joshua Boniface	70b787d1fd	Move all VM functions into thread	2020-06-06 15:44:05 -04:00
Joshua Boniface	e1310a05f2	Implement recording of VM stats during keepalive	2020-06-06 15:34:03 -04:00
Joshua Boniface	2ad6860dfe	Move Ceph statistics gathering into thread	2020-06-06 13:25:02 -04:00
Joshua Boniface	cebb4bbc1a	Comment cleanup	2020-06-06 13:20:40 -04:00
Joshua Boniface	a672e06dd2	Move fencing to end of keepalive function	2020-06-06 13:19:11 -04:00
Joshua Boniface	1db73bb892	Move libvirt closure into previous section	2020-06-06 13:18:37 -04:00
Joshua Boniface	c1956072f0	Rename update_zookeeper function to node_keepalive	2020-06-06 12:49:50 -04:00
Joshua Boniface	ce60836c34	Allow enforcement of live migration Provides a CLI and API argument to force live migration, which triggers a new VM state "migrate-live". The node daemon VMInstance during migrate will read this flag from the state and, if enforced, will not trigger a shutdown migration. Closes #95	2020-06-06 12:00:44 -04:00
Joshua Boniface	b5434ba744	Fix typo in variable name	2020-06-06 11:29:48 -04:00
Joshua Boniface	b9e5b14f94	Update lastnode too if a self-migrate is aborted References #92	2020-06-04 10:28:04 -04:00
Joshua Boniface	5d2031d99e	Prevent a VM migrating to the same node Prevents a rare edge case where a node can end up "migrating" to itself. Quick hack to fix this, though like most of the VM management should probably be rethought/rewritten later. Fixes #92	2020-06-04 10:26:47 -04:00
Joshua Boniface	5f9836f96d	Add error message to OSD parse fail	2020-05-12 11:04:38 -04:00
Joshua Boniface	95c59ba629	Improve flush handling slightly	2020-05-12 11:04:38 -04:00
Joshua Boniface	72a38fd437	Correct changed dhcp_reservations key name	2020-05-09 10:00:53 -04:00
Joshua Boniface	b580760537	Add missing fmt_cyan variable	2020-05-08 18:15:02 -04:00
Joshua Boniface	331027d124	Add further tweaks to takeover state checks Just ensure that everything is proper state before proceeding	2020-04-22 11:16:19 -04:00
Joshua Boniface	ae4f36b881	Hook flush into more services Trying to ensure that pvc-flush completes before anything tries to shut down.	2020-04-14 19:58:53 -04:00
Joshua Boniface	611e0edd80	Reorder last keepalive during cleanup Make sure the stopping of the keepalive timer and final keepalive update are done as the last step before complete shutdown. The previous setup could conceivably result in a node being fenced should the cleanup operations take longer than ~45 seconds, for instance if primary node switchover took too long or blocked, or log watchers failed to stop quickly enough. Ensures that keepalives will continue to be run during the shutdown process until the last possible moment.	2020-04-12 03:49:29 -04:00
Joshua Boniface	b413e042a6	Improve handling of primary contention Previously, contention could occasionally cause a flap/dual primary contention state due to the lack of checking within this function. This could cause a state where a node transitions to primary than is almost immediately shifted away, which could cause undefined behaviour in the cluster. The solution includes several elements: * Implement an exclusive lock operation in zkhandler * Switch the become_primary function to use this exclusive lock * Implement exclusive locking during the contention process * As a failsafe, check stat versions before setting the node as the primary node, in case another node already has * Delay the start of takeover/relinquish operations by slightly longer than the lock timeout * Make the current router_state conditions more explicit (positive conditionals rather than negative conditionals) The new scenario ensures that during contention, only one secondary will ever succeed at acquiring the lock. Ideally, the other would then grab the lock and pass, but in testing this does not seem to be the case - the lock always times out, so the failsafe check is technically not needed but has been left as an added safety mechanism. With this setup, the node that fails the contention will never block the switchover nor will it try to force itself onto the cluster after another node has successfully won contention. Timeouts may need to be adjusted in the future, but the base timeout of 0.4 seconds (and transition delay of 0.5 seconds) seems to work reliably during preliminary tests.	2020-04-12 03:40:17 -04:00
Joshua Boniface	e672d799a6	Set flush after pvcapid.service This may or may not help, but should in theory prevent the flush from trying to run after a (locally-running) API daemon is terminated, which could cause an API failure and a failure to flush.	2020-04-12 01:48:50 -04:00
Joshua Boniface	a130f19a19	Depend pvcnoded on Zookeeper (harder) and libvirtd	2020-04-09 09:57:53 -04:00
Joshua Boniface	a671d9d457	Use consistent tense in messages	2020-04-08 22:00:51 -04:00
Joshua Boniface	fee1c7dd6c	Reorder cleanup and gracefully wait for flushes	2020-04-08 22:00:08 -04:00
Joshua Boniface	5d58bee34f	Add some time around noded startup/shutdown Otherwise, systemd kills networking before the node daemon fully stops and it goes into "dead" status, which is super annoying.	2020-04-01 23:59:14 -04:00
Joshua Boniface	f668412941	Don't use Requires as the dep is too hard Requires seems to flush on every service restart which is NOT what we want. Use Wants instead.	2020-04-01 15:15:37 -04:00
Joshua Boniface	a0ebc0d3a7	Add more robust requirements to pvc-flush service	2020-04-01 15:09:44 -04:00
Joshua Boniface	98a7005c1b	Add significant TimeoutSec to pvc-flush service This will stop systemd from killing the service in the middle of a flush or unflush operation, which completely defeats the purpose. 30 minutes was chosen as this is a very large but still somewhat manageable value, which should cover even a very large very loaded cluster with room to spare.	2020-04-01 01:24:09 -04:00
Joshua Boniface	0a367898a0	Don't trigger aggregator fail if fine	2020-03-12 13:22:12 -04:00
Joshua Boniface	c02bc0b46a	Correct issues with VM lock freeing Code was bad and using a depricated feature.	2020-03-02 12:45:12 -05:00
Joshua Boniface	1e4350ca6f	Properly handle takeover state in VXNetworks Most of these actions/conditionals were looking for primary state, but were failing during node takeover. Update the conditionals to look for both router states instead. Also add a wait to lock flushing until a takeover is completed.	2020-03-02 10:41:00 -05:00
Joshua Boniface	57768f2583	Remove an obsolete script	2020-02-19 21:40:23 -05:00
Joshua Boniface	e4e4e336b4	Handle invalid cursor setup cleanly This seems to happen only during termination, so catch it and continue so the loop terminates.	2020-02-19 16:29:59 -05:00
Joshua Boniface	d2a5fe59c0	Use transitional takeover states for migration Use a pair of transitional states, "takeover" and "relinquish", when transitioning between primary and secondary coordinator states. This provides a clsuter-wide record that the nodes are still working during their synchronous transition states, and should allow clients to determine when the node(s) have fully switched over. Also add an additional 2 seconds of wait at the end of the transition jobs to ensure everything has had a chance to start before proceeding. References #72	2020-02-19 14:06:54 -05:00
Joshua Boniface	9c7041f12c	Update package version to 0.7	2020-02-15 23:25:47 -05:00
Joshua Boniface	7ace5b5056	Remove /ceph/cmd pipe for (most) Ceph commands Addresses #80	2020-02-08 23:40:02 -05:00
Joshua Boniface	37310e5455	Correct name of systemd target	2020-02-08 20:39:07 -05:00
Joshua Boniface	ce985234c3	Use consistent naming of components Rename "pvcd" to "pvcnoded", and "pvc-api" to "pvcapid" so names for the daemons are fully consistent. Update the names of the configuration files as well to match this new formatting. References #79	2020-02-08 19:34:07 -05:00
Joshua Boniface	4505b239eb	Rename API and common Debian packages Closes #79	2020-02-08 18:50:38 -05:00
Joshua Boniface	74228eb063	Bump version to 0.6	2020-02-08 18:27:39 -05:00
Joshua Boniface	90e42683c6	Reduce sleep time during VM migrations	2020-02-04 17:52:37 -05:00
Joshua Boniface	20c8466296	Handle invalid search fields better	2020-02-04 17:35:24 -05:00
Joshua Boniface	ab28bf40d1	Change ordering of services during primary switch Fixes #77	2020-01-30 09:18:56 -05:00
Joshua Boniface	5d73974e95	Fix several bugs around load-based migrations	2020-01-29 17:35:10 -05:00
Joshua Boniface	0b31bab797	Add more helpful config parse error message	2020-01-22 12:09:31 -05:00
Joshua Boniface	4c1b78d7a4	Use dictionary get() to prevent crashes Use the get() function throughout to prevent crashes in various scenarios if the profile data isn't present or consistent.	2020-01-13 09:21:57 -05:00
Joshua Boniface	4ad29f669d	Update default configuration samples	2020-01-12 21:33:15 -05:00
Joshua Boniface	0d2e22a111	Normalize all static networks with bridges Modifies the storage and upstream networks to mirror the cluster network, with a bridge on top of the underlying specified dev, and all IPs bound to the bridge. Allows creating VMs in the storage or upstream networks, as well as the cluster network, should the administrator choose to do so (manually).	2020-01-12 19:04:31 -05:00
Joshua Boniface	1671a87dd4	Fix the flush service	2020-01-11 17:04:12 -05:00
Joshua Boniface	b6474198a4	Implement cluster maintenance mode Implements a "maintenance mode" for PVC clusters. For now, the only thing this mode does is disable node fencing while the state is true. This allows the administrator to tell PVC that network connectivity, etc. might be interrupted and to avoid fencing nodes. Closes #70	2020-01-09 10:53:27 -05:00
Joshua Boniface	4e5bce4975	Update copyright header year to 2020	2020-01-08 19:38:02 -05:00
Joshua Boniface	c515d63340	Add provision state for VMs	2020-01-08 17:40:02 -05:00
Joshua Boniface	21d87f5e51	Add v6 configurations to dnsmasq These options were only applied with v4 networks; now, use the v6 address in a dual-stack or v6-only network.	2020-01-06 23:48:04 -05:00
Joshua Boniface	f326fd99e2	Properly fix IPv4 no-DHCP networking	2020-01-06 22:31:37 -05:00
Joshua Boniface	38dae8b32f	Change name of cluster in patronictl command	2020-01-06 16:37:17 -05:00
Joshua Boniface	2d2bdb879e	Use get() instead of direct dict reference	2020-01-06 16:34:39 -05:00
Joshua Boniface	30d4470c8f	Only print AXFR errors in debug mode	2020-01-06 16:04:37 -05:00
Joshua Boniface	bbfadac5e1	Fix dnsmasq options for DHCP-disabled networks	2020-01-06 16:04:26 -05:00
Joshua Boniface	7b3e267f7a	Implement bridge_device for bridged VNIs Required due to #64. Bridged networks were being created on top of a vLAN if the Cluster network was a vLAN device, rather than being created on the underlying device. This came from a previous revision of the cluster architecture guidelines where Cluster was supposed to be a raw device rather than a vLAN. This fixed the problem by implementing a configuration field for a "bridge_device", a NIC device that can then have the bridged vLANs created on top of it. Fixes #64	2020-01-06 14:44:56 -05:00
Joshua Boniface	094ac8c3a8	Ensure stdout is used	2020-01-06 12:34:35 -05:00
Joshua Boniface	13548b791d	Add additional debugging and fix pool_idx loop var	2020-01-06 11:31:22 -05:00
Joshua Boniface	e7bc4f7328	Handle empty None-type hostname	2020-01-05 22:46:56 -05:00
Joshua Boniface	be20ba02a7	Handle VM states in flush more accurately We don't want to block forever on a failure, so limit valid waiting states to just those we know it should be in during a migration.	2020-01-05 15:21:16 -05:00
Joshua Boniface	7311fa561b	Fix bad join with new table name	2020-01-04 15:17:27 -05:00
Joshua Boniface	bf89050e8b	Update userdata table name	2020-01-04 15:10:37 -05:00
Joshua Boniface	20ae2186f9	Run VM state actions in a thread Prevents blocking the main thread(s) while a VM is changing state. In particular, this caused some issues with nodes not responding to cancellation/reversal of a flush/ready state until the previous migration was finished, which could cause issues. This entire subset of actions is now threaded and so can run on its own in the background.	2019-12-26 11:08:16 -05:00
Joshua Boniface	b3483fa810	Add explicit returns from flush/ready threads	2019-12-26 11:08:00 -05:00
Joshua Boniface	47cf0a8006	Ensure migration out occurs	2019-12-25 21:11:02 -05:00
Joshua Boniface	77db36a891	Ensure migration out occurs	2019-12-25 21:02:46 -05:00
Joshua Boniface	9a39d739e8	Ensure we empty of flush_thread	2019-12-25 20:29:17 -05:00
Joshua Boniface	a66b834ae4	Fix several small bugs	2019-12-19 18:58:53 -05:00
Joshua Boniface	b17b7bf22b	Add black magic to minimize ping losses This particular arping interval/count, along with forcing it to run in the foreground, seems to minimize the packet loss when the primary coordinator transitions. Through extensive testing, this value results in the, consistently, least amount of loss: 1-2 pings, at an 0.025s ping interval, return "TTL exceeded", with no other loss, and only when the node the test VM is on is the one switching to secondary state. No other combination of values here, nor tweaks to other parts of the code, seem able to reduce this further, therefore this is likely the best configuration possible.	2019-12-19 18:57:32 -05:00
Joshua Boniface	8c252aeecc	Implemented coordinated locked node transitions The previous method was a "throw it in the sea"-type migration with some (very arbitrary) sleep statements thrown in for good measure. Reimplement this with some hard locking. During each phase of the transition, the nodes acquire read/write shared locks to a Zookeeper key so that they can tightly coordinate the actions of transferring each part of the primary state between them. This is done in a subthread to prevent strange blocking issues that were encountered, likely due to business in the existing main thread.	2019-12-19 10:56:34 -05:00
Joshua Boniface	0841ddf8b0	Handle integrity errors in DNS aggregator	2019-12-19 10:45:06 -05:00
Joshua Boniface	98764f1edd	Clean up some aspects of node switchover	2019-12-18 21:39:40 -05:00
Joshua Boniface	23188199cb	Handle failing Patroni events more gracefully	2019-12-18 21:12:22 -05:00
Joshua Boniface	2b1b78622e	Fix invalid arping option It made little difference and didn't error, but was incorrect.	2019-12-18 12:06:40 -05:00
Joshua Boniface	364ab10673	Add slight delay when stopping the metadata API	2019-12-18 11:56:04 -05:00
Joshua Boniface	39c9f911cc	Increase arping interval to 0.2s	2019-12-15 14:55:34 -05:00
Joshua Boniface	686af31c08	Reduce arping interval to 0.1s	2019-12-15 12:30:45 -05:00
Joshua Boniface	0a94fac407	Fix bugs around passing master Was not passing properly and getting stuck sometimes, so modify the checking and route creation a bit to prevent it. Seems to work.	2019-12-15 00:08:18 -05:00
Joshua Boniface	b3e21a5bf8	Integrate metadata API into node daemon	2019-12-14 16:41:01 -05:00
Joshua Boniface	8c36e7618a	Modify node daemon to follow API	2019-12-14 14:13:26 -05:00
Joshua Boniface	78f053d81f	Recreate network in aggregator if DNS changes	2019-12-13 00:03:47 -05:00
Joshua Boniface	0a8dd30a48	Restart dnsmasq when network details change	2019-12-12 23:51:22 -05:00
Joshua Boniface	6fa828e721	Don't stop the provisioner worker It should probably just be running on all nodes all the time already, but is started when a node first becomes primary.	2019-12-12 23:08:02 -05:00
Joshua Boniface	c1b6ce0ff7	Reorder starting clients	2019-12-12 23:03:34 -05:00
Joshua Boniface	b854d53fab	Add API management to node daemon	2019-12-12 22:59:07 -05:00
Joshua Boniface	88a181b20d	Allow metadata API in nft rules	2019-12-11 17:04:29 -05:00
Joshua Boniface	1fb560e996	Add DNS nameservers to networks	2019-12-08 23:55:45 -05:00
Joshua Boniface	9cb5561e77	Move default NS record to upstream_domain	2019-12-08 23:05:32 -05:00
Joshua Boniface	3471f4e57a	Remove obsolete pvc-nsX and add pvc-ns name Should point towards the floating IP.	2019-12-08 20:20:20 -05:00
Joshua Boniface	356c12db2e	Add ceph df output to pool data Allows additional information visible in the `ceph df` command, including pool free space and used percentage.	2019-12-06 00:47:27 -05:00
Joshua Boniface	531578fd28	Use consistent tense for VM states Replace "failed" with "fail" and "disabled" with "disable" for consistency with the remaining states.	2019-10-23 23:57:59 -04:00
Joshua Boniface	040ca33683	Clean up handling of OSD dump command	2019-10-22 12:51:29 -04:00
Joshua Boniface	190623bdd9	Use empty string for node limit	2019-10-22 12:32:14 -04:00
Joshua Boniface	f0e0a38a20	Fix bug in config element retrieval	2019-10-22 12:30:23 -04:00
Joshua Boniface	237a37015d	Set upstream IP in key if changed	2019-10-21 16:50:41 -04:00
Joshua Boniface	10ae260b92	Properly handle empty node limit	2019-10-17 13:34:11 -04:00
Joshua Boniface	03447d3374	Update copyright string year to include 2019	2019-10-13 12:09:51 -04:00
Joshua Boniface	116013695f	Fix bugs with bad strings	2019-10-12 18:43:29 -04:00
Joshua Boniface	18fc49fc6c	Use node instead of hypervisor consistently	2019-10-12 01:59:08 -04:00
Joshua Boniface	8dc0c8f0ac	Fix minor bugs	2019-10-12 01:36:50 -04:00
Joshua Boniface	5995353597	Implement VM metadata and use it Implements the storing of three VM metadata attributes: 1. Node limits - allows specifying a list of hosts on which the VM must run. This limit influences the migration behaviour of VMs. 2. Per-VM node selectors - allows each VM to have its migration autoselection method specified, to automatically allow different methods per VM based on the administrator's preferences. 3. VM autorestart - allows a VM to be automatically restarted from a stopped state, presumably due to a failure to find a target node (either due to limits or otherwise) during a flush/fence recovery, on the next node unflush/ready state of its home hypervisor. Useful mostly in conjunction with limits to ensure that VMs which were shut down due to there being no valid migration targets are started back up when their node becomes ready again. Includes the full client interaction with these metadata options, including printing, as well as defining a new function to modify this metadata. For the CLI it is set/modified either on `vm define` or via the `vm meta` command. For the API it is set/modified either on a POST to the `/vm` endpoint (during VM definition) or on POST to the `/vm/<vm>` endpoint. For the API this replaces the previous reserved word for VM creation from scratch as this will no longer be implemented in-daemon (see #22). Closes #52	2019-10-12 01:17:39 -04:00
Joshua Boniface	76e6b42389	Add clone_volume backend command	2019-10-10 14:09:07 -04:00
Joshua Boniface	983daceaed	Fix shutdown abort during restart Restart state, being different from shutdown, would trigger an abort of the shutdown. Fix this by including restart in the valid states to continue.	2019-09-07 12:08:31 -04:00
Joshua Boniface	7c4d18691a	Implement configurable replcfg (node-side) Implements administrator-selectable replication configurations for new pools in PVC clusters, overriding the default of copies=3,mincopies=2.	2019-08-23 21:58:54 -04:00
Joshua Boniface	267a3d16e5	Bump version to 0.5	2019-08-08 20:56:27 -04:00
Joshua Boniface	2880a761c0	Move Ceph command pipe to new location Matching the new /cmd/domain pipe, move Ceph pipe to /cmd/ceph.	2019-08-07 14:47:27 -04:00
Joshua Boniface	b7546e3711	Fix bugs in command pipeline for VMs	2019-08-07 14:13:01 -04:00
Joshua Boniface	0ff2d7d537	Use shlex for command splitting This will preserve quoted strings, required for the rbd lock commands.	2019-08-07 14:02:57 -04:00
Joshua Boniface	a2a630f6a0	Add pipeline for VM lock flush cmd	2019-08-07 13:49:33 -04:00
Joshua Boniface	496216321e	Move lock flushing to VMInstance Prepares for reuse of this function via client commands.	2019-08-07 13:36:56 -04:00
Joshua Boniface	0446b2db02	Catch exceptions if Patroni is not up	2019-08-07 11:46:58 -04:00
Joshua Boniface	7e77752ce5	Add limit to Patroni switchover attempts	2019-08-07 11:46:42 -04:00
Joshua Boniface	33a963c2af	Improve fence output on failure and increase delay	2019-08-07 11:35:49 -04:00
Joshua Boniface	e92a57606d	Use better forceful arping command Send ARP responses with the source IP in it to force update even if the old primary did not cleanly terminate (during fencing for instance).	2019-08-07 11:29:38 -04:00
Joshua Boniface	ef3b6b3723	Arping 3 times instead of 2 During fence 2 is not always enough for the network to recognize the change in primary coordinator.	2019-08-07 11:15:36 -04:00
Joshua Boniface	3b27a88128	Allow abort of shutdown state Adds some logic to allow an active shutdown state to be aborted by changing the VM to another state. Useful mostly if a VM is doing funky things and not responding to the shutdown, but the administrator either doesn't want to wait for the timer to expire (forcing an immediate termination) or wishes to abort the shutdown attempt. Fixes #49	2019-08-07 10:58:18 -04:00
Joshua Boniface	e2ae58b62c	Add the missing newline to the string compare	2019-08-04 17:00:33 -04:00
Joshua Boniface	d0d5ab4425	Fix bug if the switchover target is the same	2019-08-04 16:51:11 -04:00
Joshua Boniface	a329376d33	Lock primary_node key during primary switchover Also implements a looping to switch over the Patroni leader to ensure this always follows the primary and clean up the code around here a bit.	2019-08-04 16:42:06 -04:00
Joshua Boniface	710d2cf9c2	Fix record duplication bug and general cleanup Fixes #47	2019-08-01 13:11:45 -04:00
Joshua Boniface	8bdec03cf1	Properly support debug logging via config	2019-08-01 11:22:27 -04:00
Joshua Boniface	c6e58796ba	Clean up redundant return section	2019-07-31 23:57:31 -04:00
Joshua Boniface	7380f45b1b	Improve dnsmasq interface handling listen-address is enough; adding interface too causes weird issues where dnsmasq is listening on an IPv6 global wildcard too which conflicts with the PowerDNS instance.	2019-07-31 10:03:56 -04:00
Joshua Boniface	324990739e	Make DNS aggregator listen on port 53 Using the non-standard port was a pain. Now that all the DNSMasq stuff works, move back to the default port.	2019-07-30 09:20:01 -04:00
Joshua Boniface	717d00cfcf	Implement snapshot rename in node daemon [4/2] Implements #44	2019-07-28 23:06:12 -04:00
Joshua Boniface	83b806d0b5	Move intervals config one level up Makes for a slightly-better-organized configuration and explanation.	2019-07-28 19:33:23 -04:00
Joshua Boniface	68ca493b3b	Fix bad error code	2019-07-26 20:53:01 -04:00
Joshua Boniface	837666a15e	Revamp renamekey function The function had numerous bugs and didn't work. Fix them up.	2019-07-26 16:38:05 -04:00
Joshua Boniface	35363671a0	Implement Ceph volume resize and rename Includes a simple implementation of a zookeeper "rename" facility, allowing a key and all data to be replaced by a new key with a different name but containing all the same child elements and data. [2/2] Implements #44	2019-07-26 15:13:21 -04:00
Joshua Boniface	50367c9190	Improve OSD create messages	2019-07-26 11:41:51 -04:00
Joshua Boniface	96bc181877	Set the routerstate on daemon startup Allows switching from coordinator to not coordinator with a service restart.	2019-07-12 09:51:56 -04:00
Joshua Boniface	2a220cd16e	Nicer colour output for coordinator state client	2019-07-12 09:31:42 -04:00
Joshua Boniface	439c5f18c3	Add router_state to output of keepalives	2019-07-11 20:11:05 -04:00
Joshua Boniface	f30be555c1	Improve message output for logging Improve some formatting of the messages being printed to make it nicer for long-term logging.	2019-07-10 22:38:32 -04:00
Joshua Boniface	ac36870a86	Implement hup for log rotation This function was long-existent, but never used; implement it.	2019-07-10 22:22:02 -04:00
Joshua Boniface	58f4222ee7	Support disabling log colours and dates For usecases such as a pure-syslog, allow disabling of dates or colours in the log messages (separately).	2019-07-10 22:17:23 -04:00
Joshua Boniface	32a6369de2	Add nicer message when live migrate fails	2019-07-10 17:42:24 -04:00
Joshua Boniface	8a28738bff	Use consistent terminology in fence message	2019-07-10 11:54:56 -04:00
Joshua Boniface	8f160abf90	Handle cancelling flushes when new ones run Store the flush_thread of a node as a class object. Before starting a new flush thread (either flush or unflush), stop the existing one if it exists to prevent further migrations, then start the new thread. Set the object to None on init and again once the task actually finishes. Remove the inflush flag as this is not required when using these threads and functionally does nothing any longer, but add the flush_stopper flag to trigger cancellation of the current job.	2019-07-10 11:54:34 -04:00
Joshua Boniface	c7c8c8bcbb	Fix bug with flush	2019-07-10 00:43:55 -04:00
Joshua Boniface	7a8aee9fe7	Remove flush locking functionality This just seemed like more trouble that it was worth. Flush locks were originally intended as a way to counteract the weird issues around flushing that were mostly fixed by the code refactoring, so this will help test if those issues are truly gone. If not, will look into a cleaner solution that doesn't result in unchangeable states.	2019-07-09 23:59:17 -04:00
Joshua Boniface	ad284b13bc	Fix bugs with fencing	2019-07-09 19:17:53 -04:00
Joshua Boniface	7df200ac44	Improve ZK connection loss handling	2019-07-09 19:17:32 -04:00
Joshua Boniface	47f86475f8	Handle failures of Ceph commands gradefully If these commands fail, catch the error, print a message, and set up empty lists. Also handle later data parsing in this case.	2019-07-09 16:43:38 -04:00
Joshua Boniface	1a8e7509f7	Support run_os_command timeout; use timeouts	2019-07-09 15:09:13 -04:00
Joshua Boniface	83a4140703	Allow enabling debug mode in config Makes debugging easier without modifying code.	2019-07-09 14:59:00 -04:00
Joshua Boniface	8eeba9bc9b	Make Ceph commands time out if needed	2019-07-09 14:35:53 -04:00
Joshua Boniface	19701c66e4	Move fencing to after keepalive output Just makes the messages a little easier to read when triggered.	2019-07-09 14:24:31 -04:00
Joshua Boniface	17dfaf43c5	Move hypervisor selection out to common	2019-07-09 14:20:58 -04:00
Joshua Boniface	b551b54642	Rename message when contending	2019-07-09 14:03:48 -04:00
Joshua Boniface	4249d5d982	Always load and store IPMI on daemon start Without this, the IPMI information set during initial node creation can never be changed, which can cause issues later. Instead, always set it fresh on each node boot.	2019-07-09 14:00:31 -04:00
Joshua Boniface	7f828a27a5	Free RBD locks when fencing node	2019-07-09 10:59:31 -04:00
Joshua Boniface	bc54ea2449	Log message when starting or stopping API client	2019-07-08 19:29:49 -04:00
Joshua Boniface	cda690e94f	Set RADOS df information in ZK	2019-07-08 10:19:56 -04:00
Joshua Boniface	d9ebd04264	Fix missing dom_uuid values in data reads	2019-07-07 15:30:28 -04:00
Joshua Boniface	b82ccaa84d	Improve flush handling Similar to recent client changes, don't replace the previous node record of an already-migrated VM. Wait for shutdown if required. Use a continue statement instead of a needless else block.	2019-07-07 15:27:37 -04:00
Joshua Boniface	0d398f663b	Rename "Domain" to "VM" in various class names The name "Domain", though technically correct from a Libvirt perspective, was unnecessarily confusing. Call the class instances what they are, VMs.	2019-07-07 15:20:37 -04:00
Joshua Boniface	8216125b02	Enable autostart of API client on Primary Adds a config flag that turns on the API client following the Primary coordinator. The retcode of the start/stop commands is ignore so this can fail gracefully if e.g. the client isn't installed.	2019-07-06 02:42:56 -04:00
Joshua Boniface	e6012965f1	Add YAML header to sample config files	2019-07-06 02:24:35 -04:00
Joshua Boniface	3e591bd09e	Remove extra whitespaces on blank lines	2019-06-25 22:33:23 -04:00
Joshua Boniface	08cb16bfbc	Revamp VM migration handling This was very old code that was hard to follow and quite fragile, with failures and infinite loops occurring fairly frequently. These changes make the code more robust, including the addition of timeouts, some code cleanup, and some improvements to the logical flow. Also forces the libvirt migration to occur on the cluster network, which couples to changes in the libvirtd listen (via pvc-ansible) and in Daemon.py via the previous commit.	2019-06-25 22:23:48 -04:00
Joshua Boniface	d336fce253	Connect to actual IP not localhost for Libvirt	2019-06-25 22:09:32 -04:00
Joshua Boniface	75d0e7f989	Revert "Only perform fencing duties on primary" This reverts commit `464c69aac6`. Actually, yea, this made sense - if the primary fails, it can't fence itself.	2019-06-25 12:36:48 -04:00
Joshua Boniface	85a5a8a0c9	Disable tx offloading on bridge interfaces Reference: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=717215#68 Without this, DHCP fails when traversing only the local bridge, for Debian Jessie or earlier (and possibly other OSes as well), due to the missing UDP checksums. This disables the offload and hence reenables the checksums even on the software-only bridge. Also rearranged the steps and added comments arround this section to better clarify what each command is doing.	2019-06-25 12:36:37 -04:00
Joshua Boniface	464c69aac6	Only perform fencing duties on primary There was really no need for this to be shared among all the coordinators, which seemed more fragile. This way only the primary will try to fence dead nodes.	2019-06-24 20:17:51 -04:00
Joshua Boniface	249611b161	Remove duplicate import	2019-06-24 20:14:43 -04:00
Joshua Boniface	ef272b0b7d	Add removal confirmations and zap disk before add	2019-06-21 15:52:28 -04:00
Joshua Boniface	867ad1fc1b	Support human-readable biconversion and in volumes	2019-06-21 09:23:52 -04:00
Joshua Boniface	ddedb1a992	Set image features to supported values	2019-06-19 15:19:36 -04:00
Joshua Boniface	0f15e7cda5	Set shutdown state after final keepalive	2019-06-19 14:52:47 -04:00
Joshua Boniface	0060c0313b	Put daemonstate to shutdown when stopping This way it isn't "run" all the way until it shuts down.	2019-06-19 14:23:07 -04:00
Joshua Boniface	9a0554fdbe	Remove all volumes from pool on removal Technically not needed, but otherwise random errors may be thrown, so best to be explicit.	2019-06-19 12:49:03 -04:00
Joshua Boniface	87907d4ce8	Remove size field from volume objects This data is just in the stats anyways.	2019-06-19 10:45:14 -04:00
Joshua Boniface	09562fdc06	Output in json format instead	2019-06-19 10:32:01 -04:00
Joshua Boniface	a940d03959	Fix some bugs and add RBD volume stats	2019-06-19 10:25:22 -04:00
Joshua Boniface	db0b382b3d	Don't bother with snapshot management by Daemon This is definitely not needed in the end, and just uses RAM for no conceivable purpose. Snapshots are fully client-managed.	2019-06-19 09:43:04 -04:00
Joshua Boniface	1c9f606480	Implement volume and snapshot handling by daemon This seems like a super-gross way to do this, but at the moment I don't have a better way. Maybe just remove this component since none of the volume/snapshot stuff is dynamic; will see as this progresses.	2019-06-19 09:40:32 -04:00
Joshua Boniface	784b428ed0	Add creation of volume and snapshot lists	2019-06-19 09:29:36 -04:00
Joshua Boniface	064e6455bc	Correct some more bugs	2019-06-19 00:29:21 -04:00
Joshua Boniface	a4ab3075ab	Correct some bugs around new code	2019-06-19 00:23:25 -04:00
Joshua Boniface	01959cb9e3	Implementation of RBD volumes and snapshots Adds the ability to manage RBD volumes (add/remove) and RBD snapshots (add/remove). (Working) list functions to come.	2019-06-19 00:12:44 -04:00
Joshua Boniface	2bbbda3da5	Only trigger pool updates on primary	2019-06-18 21:26:05 -04:00
Joshua Boniface	612f5ab52c	Strip pv_block from stdout	2019-06-18 20:34:25 -04:00
Joshua Boniface	1622226c32	Add more logging during OSD creation/deletion	2019-06-18 20:31:04 -04:00
Joshua Boniface	3adeef6fdd	Use the fsid to activate new OSDs	2019-06-18 20:22:28 -04:00
Joshua Boniface	443108f53d	Add support for enable/disable keepalive detail	2019-06-18 19:54:42 -04:00
Joshua Boniface	79f284a0a9	Pass logger into run_command	2019-06-18 13:45:59 -04:00
Joshua Boniface	080ca3201c	Correct actual problem with this_node	2019-06-18 13:43:54 -04:00
Joshua Boniface	d076f9f4eb	Use self.this_node everywhere	2019-06-18 13:25:16 -04:00
Joshua Boniface	aee078f3eb	Support disabling keepalive logging	2019-06-18 12:44:07 -04:00
Joshua Boniface	b0411e8e1a	Remove "error" message from Ceph commands This triggeres at every node start and isn't useful.	2019-06-18 12:41:38 -04:00
Joshua Boniface	8d9007f697	Remove OSD stat collection if count is zero Otherwise, ceph osd df will hang indefinitely trying to get data for the zero OSDs.	2019-06-18 12:36:53 -04:00
Joshua Boniface	5a327dc41a	Clean up Ceph pipeline and add more debug logs	2019-06-18 11:19:03 -04:00
Joshua Boniface	46a416bc78	Use a proper variable for vni_mtu	2019-06-18 00:01:12 -04:00
Joshua Boniface	1f92b90a3e	Don't encode initial data as we're using zkhander	2019-06-17 23:53:16 -04:00
Joshua Boniface	d4ebe63d9b	Rename network device field It seems much nicer and more consistent as "device" rather than as "name".	2019-06-17 23:44:41 -04:00
Joshua Boniface	1d3f868206	Unify network devices and addresses in config The old way of doing this was a little cumbersome, with an upper YAML tree split between "devices" (name and MTU) and addresses. This commit unifies these under the root "networking" section to make this section clearer.	2019-06-17 23:41:07 -04:00
Joshua Boniface	e70255dbd6	Support configurable interface MTUs MTUs were hardcoded at 9000, which breaks if the underlying interface or network switch does not support jumbo frames, a possible deployment limitation. This has non-obvious consequences due to MTU mismatches for certain services (Ceph, Zookeeper, etc.). This commit adds support for configurable MTUs for each interface, set in pvcd.yaml. The example has been updated to reflect this, with a default of 1500 (the Ethernet standard). This commit also adds autoconfiguration of the VNI device MTU based on the `vni_mtu` value, the same for bridge networks and minus 50 (rather than 200 from the hardcoded value, based on the following resource [1]) for VXLAN networks. [1] http://ipengineer.net/2014/06/vxlan-mtu-vs-ip-mtu-consideration/	2019-06-17 23:34:48 -04:00
Joshua Boniface	c583ee1709	Revert "Wait a little longer" This reverts commit `bd7a55e9e1`. This is not really needed, but do keep the 5s wait	2019-06-17 21:56:06 -04:00
Joshua Boniface	bd7a55e9e1	Wait a little longer	2019-06-17 12:14:13 -04:00
Joshua Boniface	23994f8a11	Increase wait time for daemons and log message	2019-06-17 10:30:46 -04:00
Joshua Boniface	fe654aa5a2	Correct typo in daemon	2019-06-16 19:27:20 -04:00
Joshua Boniface	14e9ba892c	Wait on both sides for 30s Still finding issues with the flush	2019-05-24 01:23:18 -04:00
Joshua Boniface	ae37afcf75	Wait 10 seconds when starting pvc-flush Without waiting the unflush will trigger too soon, before the daemon is fully ready and such it fails in odd ways.	2019-05-23 23:35:01 -04:00
Joshua Boniface	e8b666708c	Add one final keepalive update before exiting	2019-05-23 23:23:03 -04:00
Joshua Boniface	4c5ce9b995	Perform additional tweaks to units Use RemainAfterExit to avoid pvc-flush from auto-stopping immediately. Use PartOf to tie services to the target itself. Use --wait on flush to avoid daemon stopping before flush is complete.	2019-05-23 23:18:28 -04:00
Joshua Boniface	e46aa22989	Remove invalid Restart in pvc-flush.service	2019-05-23 22:51:36 -04:00
Joshua Boniface	7c6132f7dd	Add node autoflush service and target Add a systemd service to manage node flush/unflush, useful during system startup and shutdown to avoid requiring administrator intervention for this to occur. This is optional and the service is not enabled by default, and the postinst script informs the administrator of this. Also adds a systemd target to collect the two service units together and provide an easy way to flush+shutdown or startup+unflush the entire PVC system. Closes #28	2019-05-23 22:42:51 -04:00

... 6 7 8 9 10 ...

853 Commits