Just-in-Time Workload Clusters
Nova can dynamically manage workload clusters to reduce cost and provide capacity when it is needed.
This includes placing idle clusters into standby, resuming clusters when required, and optionally creating new clusters from existing workload clusters.
When to Use This
Use this pattern when:
- Workload demand is variable
- Idle cloud clusters should be minimized to reduce cost
- Additional workload cluster capacity may be needed on demand
- Clusters should be made available only when scheduling requires them
How Nova Helps
Nova can align workload cluster availability with scheduling demand.
Depending on configuration, Nova can:
- Put idle workload clusters into standby
- Resume standby clusters when needed
- Delete and recreate clusters
- Create new clusters by cloning existing workload clusters
Standby Modes
Suspend/Resume
In suspend/resume mode, Nova scales cluster node pools to zero while keeping the cluster control plane intact.
- Faster recovery (~2 minutes)
- Lower cost savings compared to full deletion
Delete/Recreate
In delete/recreate mode, Nova deletes the cluster entirely and recreates it when needed.
- Greater cost savings
- Longer startup time (~3–15 minutes)
Cluster Creation
When enabled, Nova can create new workload clusters by cloning an existing cluster.
This is used when:
- No existing cluster satisfies placement requirements
- Additional capacity is required
- No existing workload cluster is running an autoscaler recognized by Nova (such as Elotl Luna or Kubernetes Cluster Autoscaler) that can provision the required capacity
Cluster creation is driven by the source cluster configuration. When an autoscaler is available, Nova will prefer to use it for provisioning and scaling before creating a new workload cluster.
Behavior with Autoscaling
When an autoscaler is available, Nova will leverage it for node provisioning and scaling, and will prefer using the autoscaler over just-in-time cluster creation when sufficient capacity can be provisioned this way. This includes integrations with autoscalers such as Elotl Luna and Kubernetes Cluster Autoscaler, though their presence is optional and not required for just-in-time provisioning.
Considerations
Just-in-time cluster behavior depends on:
- Cloud provider support
- Cluster creation and startup latency
- Nova configuration
- Whether candidate clusters already have autoscaling enabled
- Whether an existing cluster is available to clone