Home > Data Protection > Data Protection (general) > Dell Hadoop Application Agent: Hadoop Protection > Performing Hadoop HDFS backup
The Dell Hadoop app agent supports full and subsequent (synthetic full) level backups for HDFS filesystem backups. The following figure shows the backup workflow for an HDFS full level backup.
The workflow for subsequent (synthetic full backup) is similar to the full backup with the addition of the Snap diff operation. In this step, the previous snapshot is compared with the new snapshot and only the changes are backed up.
Some points to note:
Before running the backup, use the following command to enable HDFS snapshots on the directories to be backup up:
#hdfs dfsadmin -allowSnapshot <source>.
Run the configuration checklist script dlp-config-check.sh to perform the mandatory prerequisite checks and ascertain any errors that might occur. Run the script to perform the backups successfully.
Run the dlp-config-check.sh script from the bin subdirectory of the installation directory.
./dlp-config-check.sh -z ../config/dlpm-env.cfg
The dlp-config-check.sh script performs seven checks and displays for each check the message PASSED or FAILED.
From the bin subdirectory, run the following command:
./dlp-admin.sh -b -z ../config/dlpm-env.cfg
The command for full and subsequent backup is the same as the Hadoop app agent and will automatically determine whether there is an existing full backup in the PowerProtect DD target directory and perform a subsequent (synthetic full) level backup. It follows an incremental forever strategy.
From the bin subdirectory, run the following command:
./dlp-admin.sh -b -z ../config/dlpm-env.cfg