Our results are based on single-node Dell PowerEdge XE9680, XE8640, and R760xa servers with a combination of four NVIDIA GPUs:
We also submitted with previous generation PowerEdge servers, such as the PowerEdge XE8545 and R750xa servers with NVIDIA A100 GPUs, to help compare versions.
We submitted the DLRMv2 results on the Dell PowerEdge XE9680 and XE8640 servers with NVIDIA H100 and NVIDIA A100 GPUs.
We ran tests on different operating systems to show the performance difference. The testing shows that Dell PowerEdge servers and NVIDIA accelerators work well for different workloads and models.
Our MLPerf Training v3.0 submission includes multinode and single-node systems. This whitepaper describes the performance of the latest generation single-node systems.
Note: A future whitepaper will describe the performance of multinode systems.
The following general syntax is used for the system name:
<Number of servers> x < Dell server name> x <number of accelerators> x <NVIDIA accelerator name>
To identify a single-node system, note that there are no entries for the number of servers or the number of servers is equal to one. For example, R750xax4A100-PCIE-80GB is a single-node system.
The following table lists the single-node systems to show performance improvement rendered by the newer generation servers.
Note: All systems include NVMe drives.
Table 3. Dell single-node systems
MLPerf ID | MLPerf system | Operating system | CPU | Memory | GPU | GPU form factor | GPU TDP
| GPU count | Software stack |
3.0-2053 | XE9680x8H100-SXM-80GB | Ubuntu 22.04.2 LTS
| Intel Xeon Platinum 8480+
| 1.024 TB | NVIDIA H100-SXM5-80GB | SXM5
| 700 W
| 8
| NGC MXNet 23.04 NGC PyTorch 23.04 NGC HugeCTR 23.04
|
3.0-2052 | XE9680x8A100-SXM-80GB | 2.048 TB
| NVIDIA A100-SXM-80GB CTS | ||||||
3.0-2051 | XE8640x4H100-SXM-80GB | NVIDIA H100-SXM5-80GB CTS | 500 W | 4
| |||||
3.0-2048 | R760xax4H100-PCIE-80GB | Ubuntu 20.04.6 LTS | 1.0 TB
| NVIDIA H100-PCIe-80GB | PCIe (Gen 5 on server) | 350 W | |||
3.0-2050 | XE8545x4A100-SXM-80GB | Ubuntu 20.04.4 | AMD EPYC 7763 64-Core Processor | NVIDIA A100-SXM-80GB CTS | SXM4 | 500 W | NGC MXNet 22.09 NGC PyTorch 22.09 NGC TensorFlow 22.09-tf1 | ||
3.0-2047 | R750xax4H100-PCIE-80GB | Ubuntu 20.04.6 LTS | Intel Xeon Gold 6338 | NVIDIA H100-PCIe-80GB | PCIe (Gen 4 on server) | 310 W | NGC MXNet 23.04 NGC PyTorch 23.04 NGC HugeCTR 23.04 |