The model training runtime improvement for each data science pipeline used in this test with Granulate (sAgent enabled) compared with baseline (sAgents deactivated) is shown in Figure 3. The results show that an improvement of at least 10% was always obtained. The average runtime improvement across all data science pipelines is 23%, while the maximum improvement was 37% for the LogR data science pipeline.
All the improvements were obtained by enabling the sAgents in the systems. No application code changes were made, and no fine-tuning of the platform or operating system settings was required.
Figure 3. Training time improvement with Granulate compared to baseline
A separate test was run to optimize cluster CPU utilization metrics during model training followed by serving of the various data science pipelines. Granulate’s sAgent reduced CPU utilization up to 40% for the Sales forecasting pipeline. The lowest improvement on CPU utilization was 11% for Customer segmentation. See Figure 4 for the various data science pipelines that were optimized using Granulate.
In a production environment, this optimization would result in:
Figure 4. CPU Utilization reduction for different data science pipelines
Table 2. System under test characteristics
| Controller node | Compute node |
Server | One PowerEdge R650 | 10 PowerEdge R650 |
Processors | Two Intel Xeon Gold 6348 @2.60GHz | Two Intel Xeon Gold 6348 @2.60GHz |
Memory | 256 GiB | 512 GiB |
Storage controller | One PERC H755 RAID controller | None |
Storage device | Two 480 GB BOSS; Four 3.84 TB SAS SSD | Two 480 GB BOSS; Four 3.2 TB Ent P5600 MU NVMe |
Network controller | One Intel Ethernet 25G 2-port E810-XXV OCP NIC | One Intel Ethernet 25G 2-port E810-XXV OCP NIC |
Connectivity | One Dell PowerSwitch S5212-ON 100/25 (cluster interconnect) | |
Operating system | Red Hat Enterprise Linux 8.4 | |
Framework | Cloudera CDP Private Cloud Base Edition Version 7.1.7 |