Home > Workload Solutions > Data Analytics > White Papers > GenAI on Dell APEX File Storage for AWS using Databricks, Hugging Face, and MosaicML > Step 2: Prepare Data
Fine tuning the dataset ensures preparation of data to suit the requirements of the model. This is out of the scope of this validation, and pretuned data from the raw dataset is already placed in the storage.
The dataset is placed in Dell APEX File Storage; the data access pattern is S3A.
The access setup and exposing endpoint of the storage clusters to Compute cluster is provided by AWS network setup. Make sure necessary user authorization and authentication are handled appropriately.