Home > AI Solutions > Artificial Intelligence > White Papers > White Paper - Virtualizing GPUs for AI with VMware and NVIDIA Based on Dell Infrastructure > VMware
VMware vSphere 8 includes the following features to support AI and machine learning workloads:
This validated design requires the vSphere Enterprise Plus editions. NVIDIA vGPU and distributed virtual switches (required for load balancing in Tanzu) require the Enterprise Plus edition.
vSphere with Tanzu enables administrators to transform vSphere into a platform for running Kubernetes workloads natively on the hypervisor layer. When enabled on a vSphere cluster, vSphere with Tanzu provides the capability to run Kubernetes workloads directly on ESXi hosts and to create upstream Kubernetes clusters in dedicated resource pools.
vSphere administrators can enable existing vSphere clusters for Workload Management, to create a Tanzu Kubernetes cluster in the ESXi hosts that are part of the cluster. The Tanzu Kubernetes cluster is a full distribution of the open-source Kubernetes container orchestration platform that is built, signed, and supported by VMware. Tanzu Kubernetes Grid (TKG) Service provisions and operates Tanzu Kubernetes cluster on vSphere.
Tanzu Kubernetes Grid (TKG), available with VMware vSphere 8, supports virtualizing NVIDIA GPUs through NVIDIA AI Enterprise. With TKG, virtual GPUs are automatically provisioned and configured on the Tanzu Kubernetes Cluster worker nodes and made available to AI workload containers.
VMware vSphere with Tanzu can be licensed through vSphere+ or Tanzu Kubernetes Operations. For more information, see the VMware vSphere Product Line Comparison and VMware Tanzu for Kubernetes Operations Documentation.
VMware offers several products under the Tanzu portfolio to enhance the capabilities of vSphere on Tanzu. These products enable administrators to build, run, and manage the AI workload along with modern applications and continuously deliver value to customers. Depending on the Tanzu edition, these software products are bundled with VMware vSphere with Tanzu and are fully supported by VMware. Some key products that are applicable to this validated design include:
NSX Advanced Load Balancer provides network access and load balancing for Tanzu Kubernetes clusters. You can use it to load balance AI use cases such as Machine Learning Operation applications or inference workloads.
The following additional software is available from VMware to manage and orchestrate container workloads. These software tools address general-purpose application development and are not validated as part of this validated design.
Tanzu Data Services is a portfolio of on-demand caching, messaging, and database software on VMware Tanzu for development teams building modern applications.
vSAN is a software-defined storage solution from VMware, built from the ground up for vSphere VMs. It abstracts and aggregates locally attached disks in a vSphere cluster to create a storage solution that you can provision and manage from vCenter and the vSphere client. vSAN is embedded in the hypervisor, therefore, storage and compute for VMs are delivered from the same x86 server platform running the hypervisor.
vSAN is the market leader in HCI infrastructure. Traditional applications such as Microsoft SQL Server and SAP HANA, and next-generation applications such as AI workloads can run on vSAN. Paradigms associated with traditional infrastructure deployment, operations, and maintenance include various disaggregated tools and often specialized skill sets. The hyperconverged approach of vSphere and vSAN simplifies these tasks using familiar tools to deploy, operate, and manage private-cloud infrastructure.
vSAN 8 Express Storage Architecture (ESA) is the latest major enhancement available for vSphere 8 clusters. vSAN 8 ESA uses a file system that is optimized to take full advantage of certified NVMe storage devices and 25 Gbps+ networking to greatly improve performance and capacity over previous versions. vSAN 7 is now referred to as Original Storage Architecture (OSA).
VMware vSAN is licensed per CPU socket. It is available in the following editions: Standard, Advanced, Enterprise, and Enterprise Plus. For this validated design, we recommend vSAN Enterprise license. vSphere Enterprise Plus, and VMware Tanzu Standard are required to use the Data Persistence platform. The Data Persistence platform is available in vSAN Enterprise and Enterprise Plus only.