Home > AI Solutions > Gen AI > Guides > Design Guide—Generative AI in the Enterprise - Inferencing > Document purpose
This guide describes the Dell Validated Design for Generative AI Inferencing with NVIDIA.
It describes the design and reference architecture for a modular and scalable platform for generative AI in the enterprise. The guide focuses specifically on inferencing, which is the process of serving a trained model to generate predictions, make decisions, or produce outputs based on input data for production outcomes.
This design guide can be read along with the associated white paper Generative AI in the Enterprise. The white paper provides an overview of generative AI, including its underlying principles, benefits, architectures, and techniques; the various types of generative AI models and how they are used in real-world applications; the challenges and limitations of generative AI; and descriptions of the various Dell and NVIDIA hardware and software components used in the architecture. We also recommend reading the related design guide Generative AI in the Enterprise – Model Customization, which focuses on Inferencing and deployment of pretrained models and the technical white paper Generative AI in the Enterprise – Training, which focuses on training new models from scratch.