Home > Workload Solutions > Container Platforms > Red Hat OpenShift Container Platform > Archive > Implementation Guide—Red Hat OpenShift Container Platform 4.10 on AMD-powered Dell Infrastructure > NVIDIA GPU operator
NVIDIA GPU drivers are required to provision GPU on OpenShift Container Shift Platform. These drivers enable CUDA and allow workloads to consume GPU. The Dell engineering team provisioned NVIDIA A100 GPU on a compute node to validate the deployment.
This chapter describes the prerequisites and steps for installing the NVIDIA GPU operator.
The NFD operator is a prerequisite for the NVIDIA GPU operator.
Install the NFD Operator using the Red Hat OperatorHub.
Figure 51. Installing NFD operator
The recommended namespace, openshift-nfd, is created during operator installation.
Figure 52. NFD pod status
The NFD operator uses vendor PCI IDs to identify hardware in a node. NVIDIA uses the PCI ID 10de.
Figure 53. Checking the node labels
As a cluster administrator, install the NVIDIA GPU operator from the OpenShift Container Platform CLI or the web console.
In the OpenShift web console:
Figure 54. Installing the NVIDIA GPU operator
The suggested namespace to use is nvidia-gpu-operator.
Figure 55. GPU operator pod status
During the installation, a CRD for a ClusterPolicy is created. The ClusterPolicy configures the GPU stack, including the image, repository, and pod restrictions and credentials.
The platform assigns the default name gpu-cluster-policy.
Figure 56. Creating ClusterPolicy
The GPU operator installs all the required components to set up the NVIDIA GPUs in the cluster.
The status of the deployed ClusterPolicy gpu-cluster-policy for the NVIDIA GPU operator changes to State:ready.
Figure 57. ClusterPolicy status
The following figure shows the command output:
Figure 58. Output of nvidia-smi command