Home Integrated Products PowerEdge MX Blogs

Direct from Development - Acceleration over Ethernet for Dell EMC PowerEdge MX7000

Mon, 09 Nov 2020 23:18:01 -0000

Read Time: 0 minutes

Ramesh Radhakrishnan

Seamus Jones

Summary

Many of today’s demanding applications require GPU resources. This reference architecture incorporates GPUs to the PowerEdge MX infrastructure, utilizing the PowerEdge MX Scalable Fabric, Dell EMC DSS 8440 GPU Server and Liqid Command Center Software.

Request a remote demo of this reference architecture or a quote from Dell Technologies Design Solutions Experts at the Design Solutions Portal

Background

Emerging workloads, like AI represent a powerfully uneven series of compute processes, such as data-heavy ingest and GPU-heavy data training. When coupled with the fact that these workloads can demand even more resources over time, it becomes clear this complex new paradigm demands a new type of IT infrastructure.

Dell EMC PowerEdge MX7000 modular chassis simplifies the deployment and management of today’s challenging workloads by allowing IT to dynamically assign, move and scale shared pools of compute, storage and networking. It provides IT the ability to deliver fast results, not spend time managing and reconfiguring infrastructure to meet ever-changing needs. Composable GPU Infrastructure from Liqid powered by Dell Technologies expands the promise of software-defined composability for today’s AI-driven compute environments and high value applications.

GPU Acceleration for MX7000

For unique workloads like AI that require accelerated computing, the addition of GPU acceleration within the MX7000 is paramount. With Liqid, supported GPUs can be quickly added to any new or existing MX7000 compute sled, delivering the resources needed to effectively handle each step of the AI workflow including data ingest, cleaning/tagging, training, and inference. Spin-up new bare-metal servers with the exact number of GPUs required, and add or remove dynamically as needed, via Liqid software.

Essential PowerEdge Components and Ethernet Cabling

Liqid Command Center Software

The first step in the GPU expansion process, is to install up to 16x HHHL or 10x FHFL GPUs into a Dell EMC DSS 8440 server. As noted in the table 1, this solution supports several GPU device options. The next step is to connect the DSS 8440 to Fabric A on the MX7000 via 100GbE.

Liqid Command Center software resides on the fabric and will discover the GPU devices in the DS8440 and enable them for utilization by the MX7000 compute nodes. The users can distribute GPU-centric jobs from any compute sled on the MX7000 to GPUs located within the DSS 8440.

Accelerator Performance

To effectively demonstrate the performance of GPU accelerated MX7000 compute sleds, we tested it against DSS 8440 server with local GPUs and measured minimal to no overhead. The deep learning benchmark tests were run on the following networks: ResNet-50, ResNet-152, Inception V3, VGG-16. The DS8440 was outfitted with 8x NVDIA Tesla RTX8000 GPUs. The results clearly demonstrate that GPU enabled MX7000 delivers unrestricted performance on various industry standard benchmarks, using accelerator optimized Dell PowerEdge infrastructure.

In Conclusion

GPU expansion for the MX7000 unlocks the ability to handle the most demanding compute workloads for both new and existing AI and HPC deployments. Liqid Command Center on Dell EMC PowerEdge Servers accelerates applications by dynamically composing GPU resources directly to workloads without a power cycle on the compute sled.

Tags:

GPU Expansion Over PCIe
Compute Sleds	Up to 8 x Compute Sleds per Chassis
GPU Chassis	PCIe Expansion Chassis (Contain GPU or other devices Direct Connect to Compute Sled)
Interconnect	PCIe Gen3x4 Per Compute Sled (Multiple G3x4 Links Possible)
GPU Expansion	20x GPU (FHFL)
GPU Supported	V100, A100, RTX, T4, Others
OS Supported	Linux, Windows, VMWare and Others
Devices Supported	GPU, FPGA, and NVMe Storage
Form Factor	14U Total = MX7000 (7U) + PCIe Expansion Chassis (7U)

Your Browser is Out of Date

Direct from Development - Acceleration over Ethernet for Dell EMC PowerEdge MX7000

Summary

Background

GPU Acceleration for MX7000

Essential PowerEdge Components and Ethernet Cabling

Liqid Command Center Software

Accelerator Performance

In Conclusion

Related Blog Posts

Reference Architecture: Acceleration over PCIe for Dell EMC PowerEdge MX7000

Summary

Background

GPU Acceleration for PowerEdge MX7000

Figure 3 Liqid Command Center Implementing GPU Expansion for MX

Software Defined Composability

Enabling GPU Peer-2-Peer Capability

Application Level Performance

Conclusion

Learn More | See a Demo | Get a Quote

This reference architecture is available as part of the Dell Technologies Design Solutions.

Dell Technologies PowerEdge MX 100 GbE solution with external Fabric Switching Engine

PowerEdge MX 100 GbE solution design example

Components for 100 GbE networking solution

Dell Networking MX8116n Fabric Expander Module

Dell PowerEdge MX760c compute sled

Dell PowerSwitch Z9432F-ON external Fabric Switching Engine

100 GbE deployment options

Single fabric

Dual fabric combined fabrics

Dual fabric separate fabrics

Dual fabric, single MX8116n in each fabric, separate fabrics

References

Figure 3 Liqid Command Center

Implementing GPU Expansion for MX