Home Servers Rack and Tower Servers Intel Direct from Development - Tech Notes

PowerEdge R760 HiBench- K-Means test report

Download PDF

Thu, 14 Dec 2023 18:12:20 -0000

Read Time: 0 minutes

Todd Mottershead

Rodrigo Escobar Palacios-Intel

Esther Baldwin-Intel

Amandeep Raina-Intel

Sammy Nah-Intel

Summary

Companies should always be looking for ways to better serve their customers. Customers are overwhelmed with information and often make buying decisions based on existing relationships. Companies looking to expand their relationships with customers can benefit from combining Machine Learning technologies with Data Mining to better understand their customers’ needs and to tailor their offerings to those needs.

Earlier this year, Dell and Intel conducted testing to determine how the new PowerEdge Server family utilizing Intel^® 4th Generation Xeon^®Scalable Processors could improve a company’s Data Mining efforts with Machine Learning technologies.

HiBench is a big data benchmark suite that helps evaluate different big data frameworks in terms of speed, throughput, and system resource utilizations. Part of the HiBench framework focuses on Machine Learning and utilizes Bayesian Classification and K-Means Clustering to effectively measure the relative performance of systems in a Machine Learning environment. The information below highlights the performance differences between a Dell PowerEdge R750 server with 3rd Generation Intel^® Xeon^® Scalable processors compared to the new Dell PowerEdge R760 with 4th Generation Intel^® Xeon^® Scalable processors.

All testing was conducted in Dell Labs by Intel and Dell Engineers in January of 2023.

Solution Overview

One of the primary benefits of the new 4th Generation Intel^® Xeon^® Scalable processors is core count. The previous generation of processors offered a maximum of 40 cores while the new processor family scales up to 56 cores. For the testing outlined in this report, we decided to use the new Intel^® Xeon^® Platinum 8470 processor which provides 52 cores. For the previous generation processor, we chose the Intel^® Xeon^® Platinum 8380 which provides 40 cores.

In addition, to increased core count, the 4th Generation processors also support faster memory. The Dell R750 system we tested were configured with 512GB of memory (16x32GB DDR4) running at 3200MT/s. The new Dell R760 system was also configured with 512GB of memory (16x32GB DDR5) which operates at 4800MT/s.

Our testing utilized the HiBench K-Means elements of the test. This Algorithm aims to partition n observations into k clusters as shown in the graphic below:

Methodology

Each system was configured with the same number of processors, memory, and the configuration of hard drives. Each test bed was then subjected to two “warm up” cycles prior to running three iterations of the benchmark. The results for each test were averaged to measure processing time.

Hardware Configurations tested

	PowerEdge R750	PowerEdge R760
CPU	2x Intel^® Xeon^® Platinum 8380 CPU's 40 - Core Processors	2x Intel^® Xeon^® Platinum 8470 CPU's 52 - Core Processors
Base Frequency	2.3GHz	2.0GHz
Turbo Frequency	3.4GHz	3.8GHz
All Core Turbo Frequency	3.0GHz	3.0GHz
Network card	Intel^® E810-C Dual Port 100Gb/s	Intel^® E810-C Dual Port 100Gb/s
Boot Drives	1 x 1.6TB Dell Ent NVMe	1 x 1.6TB Dell Ent NVMe
Primary Storage	6 x 3.2TB NVMe Solidgm* D7-P5620	6 x 3.2TB NVMe Solidgm* D7-P5620
*D7-P5620 drives supplied by Solidigm (formerly Intel)

Software Configuration

	All Nodes
OS	Red Hat® Enterprise Linux 8.6
Toolkit	Hibench-7.1.1, 3.1.1
JNI	Netlib-java 1.1
BLAS Libraries	OpenBLAS 0.3.15
Hadoop Distribution	Cloudera 7.1.7
Compute Engine	Spark 3.1.1

Test Results

Key takeaways:

78% performance gain with the 4th Generation Intel^® Xeon^® 8470 compared to 3rd Generation Intel^® Xeon^® 8380 for Spark K-Means algorithm using OpenBlas library
4th Generation Intel^® Xeon^®Scalable processors benefits are results of:
1. Innovative CPU microarchitecture providing up to a 37% performance boost
2. Increased Parallelism (30% more cores)

Conclusion

Implementing Machine Learning technologies with Big Data can help Companies better serve their customers. As shown in the testing above, the new Dell PowerEdge R760 with 4th Generation Intel^® Xeon^® Scalable processors can significantly reduce processing times leading to faster decision making.

Tags:

Feature	Control-Plane (Master) Nodes	ML/Artificial Intelligence (AI) CPU Cluster (Worker) Nodes
Platform	Dell R660 supporting 10 x 2.5” drives with NVMe backplane - direct connection
CPU		Base configuration	Plus configuration
CPU	2x Xeon^® Gold 6426Y (16c @ 2.5GHz)	2x Xeon^® Gold 6448Y (32c @ 2.1GHz)	2x Xeon^® Platinum 8468 (48c @ 2.1GHz)
DRAM	128GB (8x 16GB DDR5-4800)	256GB (16x 16GB DDR5-4800)	512GB (16x 32GB DDR5-4800)
Boot device	Dell BOSS-N1 with 2x 480GB M.2 NVMe SSD (RAID1)
Storage[1]	1x 1.6TB Solidigm[2] D7-P5620 SSD (PCIe Gen4, Mixed-use)	2x 1.6TB Solidigm² D7-P5620 SSD (PCIe Gen4, Mixed-use)
Object storage[3]	N/A	4x (up to 10x) 1.92TB, 3.84TB or 7.68TB Solidigm D7-P5520 SSD (PCIe Gen4, Read-Intensive)
Shared storage[4]	N/A	External
NIC[5]	Intel^® X710-T4L for OCP3 (Quad-port 10Gb)	Intel^® X710-T4L for OCP3 (Quad-port 10Gb), or Intel^® E810-CQDA2 PCIe add-on card (dual-port 100Gb)
Additional NIC for external storage[6]	N/A	Intel^® X710-T4L for OCP3 (Quad-port 10Gb), or Intel^® E810-CQDA2 PCIe add-on card (dual-port 100Gb)

Feature	Description
Node type	High performance	High capacity
Platform	Dell R660 supporting 10x 2.5” drives with NVMe backplane	Dell R760 supporting 12x 3.5” drives with SAS/SATA backplane
CPU	2x Xeon^® Gold 6442Y (24c @ 2.6GHz)	2x Xeon^® Gold 6426Y (16c @ 2.5GHz)
DRAM	128GB (8x 16GB DDR5-4800)
Storage controller	None	HBA355e adapter
Boot device	Dell BOSS-N1 with 2x 480GB M.2 NVMe SSD (RAID1)
Object storage³	up to 10x 1.92TB / 3.84TB / 7.68TB Solidigm D7-P5520 SSD (PCIe Gen4, Read-Intensive)	up to 12x 8TB/16TB/22TB 3.5in 12Gbps SAS HDD 7.2k RPM
NIC⁴	Intel^®E810-CQDA2 PCIe add-on card (dual-port 100Gb)	Intel^® E810-XXV for OCP3 (dual-port 25Gb)

Your Browser is Out of Date

PowerEdge R760 HiBench- K-Means test report

Summary

Related Documents

Launch Flexible Machine Learning Models Quickly with cnvrg.io® on Red Hat OpenShift

Summary

Key considerations

Recommended configurations

Controller nodes (3 nodes required) and worker nodes

Optional – Dedicated storage nodes

Learn more

Powering Kafka with Kubernetes and Dell PowerEdge Servers with Intel® Processors

Kafka with Kubernetes

Solution Overview