Home > AI Solutions > Gen AI > White Papers > Dell Scalable Architecture for Retrieval-Augmented Generation (RAG) with NVIDIA Microservices > Solution design
The solution consists of a cloud-like model with Dell Technologies providing the infrastructure of PowerEdge with NVIDIA GPUs, running Ubuntu Server. Kubernetes simplifies management and connects to PowerScale using the Container Storage Interface driver to provide persistent LLM and RAG volumes. NVIDIA Cloud Stack and NVIDIA AI Enterprise simplify model management and deployment at scale.