Document/Fix RAM safety issues #104
Loading…
Reference in New Issue
No description provided.
Delete Branch "%!s(<nil>)"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
As it stands PVC has little in the way of helpful "bounds-checking" or "safety checking" of node RAM usage and VM RAM allocations. It is very possible for a node to be stuck in a heavy swapping state or crash due to this.
Document the deficiencies and advise on keeping the node memory usage carefully controlled.
Over time develop safeguards in PVC against this. As a first step, warn the cluster when total usage is greater than n-1 of the largest server.
changed milestone to %4
Add some sub-issues:
First two solutions have been implemented.
An additional node memory field, "provisioned", tracks the total provisioned memory of both running (as in "allocated") and non-running VMs. This is shown in the node list and node details, and in the CLI, like the allocated memory, shows in yellow if the limit is violated.
The
mem
migration selector has been modified to use this "provisioned" memory count, instead of "allocated", to better take into account situations where some VM(s) are stopped/non-running during a migration.Next steps:
The next step has been implemented as well. This makes things much safer in theory.
removed milestone
changed milestone to %5
closed via commit
9bfcab5e2b
Between the warning implemented and the documentation changes, I consider this issue resolved. If administrators force through these two pieces of advice, undefined behaviour can be expected.