Upcoming Features
This section lists some of the new features that will be coming soon to Nova.
Adding DRA support to Nova Capacity-based Scheduling
Dynamic Resource Allocation (DRA) is a Kubernetes feature for flexibly requesting, configuring, and sharing specialized devices like GPUs. DRA is GA and is on by default in K8s 1.34. NVIDIA has a DRA Driver for GPUs that works with its GPU Operator.
Nova currently supports capacity-based placement for GPUs via resource limit specification, e.g., nvidia.com/gpu and amd.com/gpu, and Nova respects nodeSelector constaints which can be used to specify node characteristics. Nova also supports simple and mixed mode NVIDIA MIG. Adding DRA support to Nova capacity-based scheduling will provide a powerful new option for expressing workload GPU resource requirements.
Interoperation with Kueue
We are investigating how Nova can interoperate with Kueue, particularly in the area of available resources policy.
Sound interesting?
If any of these features are of interest to you, please write to us at info@elotl.co. We would love to learn more about your specific use-case and explore how best to help you.