GPU Workloads with Nova
Overview
Nova supports running GPU-accelerated workloads using both NVIDIA and AMD GPUs. Nova also respects nodeSelector constraints which can be used to specify GPU characteristics. In addition, Nova supports both simple and mixed mode NVIDIA MIG.
Prerequisites
- Nova workload cluster with GPU-enabled nodes
- Appropriate GPU drivers, such as the
gpu-operatorinstalled on the workload clusters
GPU Resource Types
Nova recognizes the following GPU resource types:
- NVIDIA GPUs:
nvidia.com/gpu - AMD GPUs:
amd.com/gpu - NVIDIA mixed mode MIG GPUs:
nvidia.com/mig-*
GPU Resource Monitoring
Available GPU resources on Nova workload clusters can be monitored using the novactl command (or nova kubectl plugin):
kubectl --context=nova get clusters -o wide
This will display GPU, CPU and Memory resources:
NAME K8S-VERSION K8S-CLUSTER NOVA-CREATED PROVIDER REGION ZONE AVAIL-CPU AVAIL-MEM AVAIL-NVIDIAGPU AVAIL-AMDGPU READY IDLE STANDBY
wlc-1 1.33 worklc-12232 false azure eastus eastus-2 16019m 102957284Ki 3 0 True False False
wlc-2 1.33 worklc-30337 false azure eastus eastus-2 12516m 91274704Ki 3 0 True False False