Home > Storage > PowerScale (Isilon) > Industry Solutions and Verticals > Analytics > Multi-Cloud Data Services for Dell PowerScale in AWS: Amazon EMR for Data Analytics Solutions > Introduction
Note: For simplicity, this document uses the PowerScale (CCV) IP address provided by Faction as the data endpoint. The Amazon EMR cluster services can connect to this endpoint using the HDFS protocol and read or write data into PowerScale. If the PowerScale system is configured with a DNS and SmartConnect, you can set a Fully Qualified Domain Name (FQDN) to the HDFS access zone. You can also use the FQDN instead of the IP address to read and write data to and from the PowerScale system.
This section demonstrates using the unified data analytics platform for massive-scale data engineering and collaborative analytics solutions using Amazon EMR YARN, HIVE, and Spark services.
The raw data is fetched from the PowerScale system in the on-premises data center. This data is actively replicated into the PowerScale system in the Faction cloud data center. The same data is also made available to the Amazon EMR cluster on AWS public cloud for in-place data analytics.