Home > Storage > PowerScale (Isilon) > Industry Solutions and Verticals > Analytics > Multi-Cloud Data Services for Dell PowerScale in AWS: Amazon EMR for Data Analytics Solutions > Overview
This document describes a unified data analytics platform for massive-scale data engineering and collaborative data analytics workloads. This solution is built on Multi-Cloud Data Services for Dell PowerScale for Amazon Web Services (AWS) powered by Faction, a multi-cloud data services provider.
Amazon EMR on Multi-Cloud Data Services for PowerScale enables customers to easily deploy an EMR Hadoop cluster on fully managed PowerScale scale-out file storage-as-a-service. This end-to-end solution delivers a data platform for customers to realize their business objectives with agility, flexibility, and at higher cost efficiencies.
Hadoop tiered storage enables customer to easily expand storage of existing Amazon EMR Hadoop clusters with PowerScale. This capability enables you to supply immediate capacity, provide better storage efficiency, and reduce the total cost of ownership.
There are two components to this unified, integrated, data-analytics solution. The first component is Multi-Cloud Data Services for AWS services, which is built on a scale-out NAS platform powered by PowerScale OneFS. The second component is AWS EMR, the industry-leading cloud big data platform for processing vast amounts of data using open-source-tools as a compute platform for operational flexibility. This solution offers a high-bandwidth (up to 80 Gbps) and low-latency connection (as low as 1.2 milliseconds) from PowerScale storage to the AWS cloud using AWS Direct Connect Local. Faction powers this integrated solution, which provides a fully managed cloud-data-services platform. This platform also includes its patented low-latency, high-throughput network connectivity that can deliver ultra-high performance from PowerScale systems that are hosted next to the AWS cloud.