Home > Storage > PowerScale (Isilon) > Industry Solutions and Verticals > Analytics > Multi-Cloud Data Services for Dell PowerScale in AWS: Amazon EMR for Data Analytics Solutions > PowerScale as the Hadoop tiered storage for Amazon EMR
The main objective of this solution is to install Amazon EMR big data cluster and connect the EMR cluster services to PowerScale HDFS cluster, which supports cross-namespace analytics. This solution can use both direct-attached storage (DAS) and remote PowerScale HDFS storage and run analytics jobs and toolsets across data that spans these storage tiers.
Before you launch an Amazon EMR cluster, ensure you complete the tasks in Setting Up Amazon EMR.
To set up the Amazon EMR cluster, see getting started with Amazon EMR. All the steps remain same, except “Prepare Storage for Cluster Input and Output.” Instead of using an Amazon S3 bucket, you can use PowerScale HDFS cluster to store scripts, input, and output datasets. All that we need to do is point to the PowerScale HDFS through hdfs protocol like hdfs://powerscale_fqdn:8020/.