Home > AI Solutions > Artificial Intelligence > White Papers > Digital Assistant with Red Hat OpenShift AI on Dell APEX Cloud Platform for Red Hat OpenShift > Physical architecture
The following diagram presents the physical architecture overview of the solution.
Figure 5. Digital assistant physical architecture
Dell APEX Cloud Platform for Red Hat OpenShift compute layer setup is configured with four Dell APEX MC-760 nodes for running AI workloads. Dell APEX MC-760 server is a 2U, two-socket fully featured enterprise rack server powered by Intel Xeon processor family, which is designed to optimize even the most demanding workloads such as AI and ML. Compute nodes are powered by NVIDIA L40S GPUs. The NVIDIA L40S GPU is a powerful GPU for the data center, which comes with 48GB GDDR6 and supports PCIe Gen4 x16: 64GB/s bi-directional interconnect interface, delivering end-to-end acceleration for the next generation of AI workloads.
Four dedicated PowerFlex nodes are configured as storage nodes to provide persistent block storage for workloads. Dell PowerFlex is a software-defined storage infrastructure that delivers consistent predictable outcomes at large scale for the most demanding mission-critical environments.
Dell ObjectScale is a high-performance containerized object storage that is designed for scalable and cost-effective data management and provides an ideal solution for housing massive LLMs, large datasets, and pipeline artifacts. As shown in the solution architecture, Dell ObjectScale is deployed separately which hosts Llama 2 model, other datasets, and Red Hat OpenShift AI pipeline artifacts.
Two 25 GbE Dell PowerSwitch network switches are being used for data plane connectivity, and two 10 GbE Dell PowerSwitch network switches are being used for management plane connectivity. Bastion node is used to run VMs such as AD, DNS, NTP, and client VMs.