Home > Workload Solutions > Data Analytics > Guides > Dell PowerScale and Cloudera Private Cloud Data Platform on PowerEdge Servers Powered by AMD > Tuning OneFS for HDFS operations
The default TCP stack of OneFS requires tuning for Hadoop and 40 GbE connectivity. The tuning must be done within the CLI directly on PowerScale. A tcptune.sh script is available on GitHub.
To make the changes on the PowerScale OneFS cluster, run:
sh ./tcptune.sh Max
On a PowerScale cluster, the default HDFS block size is 128 MB, which optimizes performance for most use cases. Aligning HDFS client block size with OneFS HDFS block size lets PowerScale nodes read and write in large blocks. This can decrease drive-seek operations and increase performance for MapReduce jobs.
Run the isi statistics command to obtain statistics for client connections, the file system, and protocols. For HDFS protocol statistics, run isi statistics pstat –protocol=hdfs.
By analyzing the columns titled NetIn and NetOut, you can determine whether HDFS connections are predominantly reading or writing data. Looking at the distribution of input and output across the entire cluster shows whether Hadoop is using all the nodes for a MapReduce job.