Home Integrated Products Microsoft HCI Solutions from Dell Technologies Blogs

Evaluating Performance Capabilities of Dell EMC Solutions for Azure Stack HCI

Wed, 16 Jun 2021 13:35:49 -0000

Read Time: 0 minutes

Sam Lucido

Just the facts:

A Dell EMC Storage Spaces Direct four-node cluster was tested with VM Fleet in a 100 random-read workload and achieved 2,953,095 IOPS with an average read latency of 242 microseconds.
A Dell EMC Storage Spaces Direct four-node cluster was tested with VM Fleet in a 100 percent random-write workload and achieved 818,982 IOPS at an average write latency of 4 milliseconds.
A Dell EMC Storage Space Direct four-node cluster was tested with VM Fleet in a 100 percent sequential-read workload and achieved 63 GB/s and with a 100 percent sequential-write workload 9 GB/s

The details:

User experience is everything. In today’s world, fast and intuitive applications are a necessity, and anything less might be labeled slow and not very useful. Once an application is labeled slow, it’s hard to change that impression with end users. Thus, architecting a system for performance is a key consideration in ensuring a good application experience.

In this blog, we explore a Dell EMC Storage Spaces Direct solution that delivered amazing performance in our internal tests. Storage Spaces Direct is part of Azure Stack HCI and enables customers to use industry-standard servers with locally attached drives to create high-performance and high-availability storage. Azure Stack HCI enables the IT organization to run virtual machines with cloud services on-premises. Benefits include:

The capability to consolidate data center applications with software-defined compute, storage, and networking.
Using virtual machines to drive greater operational efficiencies while accelerating performance with Storage Spaces Direct. Support for Non-Volatile Memory Express (NVMe) drives enables software-defined storage to reach new levels of performance.
Improved high availability with clustering and distributed software resiliency.

Database and other storage-intensive applications could benefit from the faster NVMe drives. NVMe is an open logical device specification that has been designed for low latency and internal parallelism of solid-state storage devices. The result is a significant boost in storage performance because data can be accessed faster and with less I/O overhead.

In our labs, we created a Storage Spaces Direct performance cluster consisting of four Dell EMC PowerEdge R640 nodes. Each storage node had two Intel 6248 Cascade Lake processors, ten P4510 Intel NVMe drives, and one Mellanox CX5 dual-port 100 GbE adapter. Networking between the nodes consisted of a Dell EMC S5232 switch that supports up to thirty-two 100 GbE ports. Our goal was to drive simplicity in the configuration while showing performance value.

We used Storage Spaces Direct three-way mirroring because this configuration offers the greatest performance and protection. Protection does have a cost in terms of capacity. The capacity efficiency of a three-way mirror is 33 percent, meaning 3 TB equates to 1 TB of usable storage space. The data protection benefit with three-way mirroring is that the storage cluster can safely tolerate at least two hardware problems—for example, the loss of a drive and server at the same time. The following diagram is a simple representation of the four-node performance configuration of the Storage Spaces Direct cluster.

Figure 1: Storage Spaces Direct Cluster with four PowerEdge R640 nodes

We ran VM Fleet on the storage cluster to test performance, and the results were impressive! Here is the first test configuration:

Block size: 4 KB
Thread count: 2
Outstanding I/O counts: 32
Write ratio: 0
Pattern: Random

Thus, this VM Fleet test used 4 KB block sizes, 100 percent reads, and a random-access pattern. This Storage Spaces Direct configuration achieved 2,953,095 IOPS with an average read latency of 242 microseconds. A microsecond is equal to one-millionth of a second. This is the kind of performance that can really accelerate online transaction processing (OLTP) workloads and make enterprise applications highly responsive to the end users.

We also tested a 100 percent random-write workload on the storage cluster. All the VM Fleet configuration settings remained the same, except the write ratio was 100. With 100 percent writes, the storage cluster achieved 818,982 IOPS at an average write latency of 4 milliseconds. We could have been less aggressive in our internal tests and delivered even lower write latency, but the goal was to push the storage cluster in terms of performance. Both these tests were done internally in our Dell EMC labs, and it’s important to note that results will vary.

Figure 2: Summary of internal test findings for 100 percent read and write workloads for IOPS and latency

Some applications, such as business intelligence and decision support systems, and some analytical workloads are more dependent on throughput. Throughput is defined by the amount of data that is delivered over a fixed period. The greater the throughput the more data that can be read and the faster the analysis or report. Our labs used the following VM Fleet configuration to test throughput:

Block size: 512 KB
Thread count: 2
Outstanding I/O counts: 2
Write ratio: 0
Pattern: Sequential

The throughput test configuration uses larger blocks at 512 KB, 100 percent reads, and a sequential read pattern that is like scanning large datasets. The storage cluster sustained 63 gigabytes per second (GB/s). This throughput could enable faster analytics for the business and provide the capability to make timely decisions.

We also ran the same test with 100 percent writes, which simulates a data load activity such as streaming data from an IoT gateway to an internal database. In this test case, the storage cluster sustained a throughput of 9 GB/s for writes. Both the read and write throughput tests show the strength of this all-NVMe configuration from Dell EMC.

Figure 3: Summary of internal test findings for 100 percent read and write workloads for throughput

If performance is what you need, then Dell EMC can use NVMe technology to accelerate your applications. But flexibility is another factor that can be equally important. Not every application requires high IOPS and very low latencies. Dell EMC offers an expanded portfolio of Storage Spaces Direct nodes that can meet most any business requirements. A great resource for reviewing the Dell EMC Storage Spaces Direct options is the Azure Stack HCI certification pages. The following table summarizes all the Dell EMC options but doesn’t contain CPU, RAM, and other details that can be found on the certification pages.

	PowerEdge R440	PowerEdge R640	PowerEdge R740xd	PowerEdge R740xd2
Intel Optane SSD Cache + SDD		✔
All-NVMe		✔
SDD	✔	✔	✔
NVMe + HDD		✔	✔
NVMe (AIC) + HDD			✔
SDD + HDD	✔	✔	✔	✔

Start with a minimal configuration using the R440 Ready Nodes, which can have up to 44 cores, 1 TB of RAM, and 19.2 TB of storage. Or go big with the R740xd2 hybrid with up to 44 cores, 384 GB of RAM, and 240 TB of storage capacity. The range of options provides you with the flexibility to configure a Storage Spaces Direct solution to meet your business needs.

The Dell EMC Ready Nodes have been configured to work with Windows 2019, so they are future-ready. For example, the Ready Nodes integrate with Windows Admin Center, so you can tier storage, implement resiliency, provision VMs and storage, configure networking, and monitor health and performance, all with just a few clicks. With your Windows Server 2019 Datacenter licenses, no separate hypervisor license is needed for VMs. You can create unlimited VMs, achieve high-availability clusters, and secure your tenants or applications with shielded VMs.

Dell EMC Storage Spaces Direct nodes have been designed to make storage in your Azure Stack HCI easy. If you are interested in learning more, see Dell EMC Cloud for Microsoft Azure Stack HCI and contact a Dell EMC expert.

Tags:

Cluster Resources	Without Intel Optane PM	With Intel Optane PM
Number of nodes	4	4
CPU	2 x Intel 6248 CPU @ 2.50 GHz (3.90 GHz with TurboBoost)	2 x Intel 6248 CPU @ 2.50 GHz (3.90 GHz with TurboBoost)
Memory	384 GB RAM	384 GB RAM
Disks	10 x 2.5 in. 1.92 TB Intel S4510 RI SATA SSD	10 x 2.5 in. 1.92 TB Intel S4510 RI SATA SSD
NICs	Mellanox ConnectX-5 EX Dual Port 100 GbE	Mellanox ConnectX-5 EX Dual Port 100 GbE
Persistent memory	None	12 x 128 GB Intel Optane PM per node

	R440 S2D RN	R640 S2D RN	R740xd S2D RN	R740xd2 S2D RN
Best For	Edge/ROBO and space (depth) constrained locations	Density optimized node for applications needing balance of high-performance storage and compute	Capacity and performance optimized node for applications needing balance of compute and storage	Capacity optimized node for data intensive applications and use cases such as backup and archive
Storage Configurations	Hybrid & All-Flash	Hybrid, All-Flash, All-NVMe including Intel Optane DC Persistent Memory	Hybrid, All-Flash, and All-NVMe	Hybrid with SSDs and 3.5” HDDs

Your Browser is Out of Date

Evaluating Performance Capabilities of Dell EMC Solutions for Azure Stack HCI

Related Blog Posts

Boost Performance on Dell EMC HCI Solutions for Microsoft Server using Intel Optane Persistent Memory

Dell EMC Solutions for Azure Stack HCI Furthers Customer Value