GenAI workloads are unpredictable and must be high-performance and congestion-free. To achieve this, monitoring of the environment is critical and must be part of the solution.
With Augtera network AI, observability spans end-to-end from the GPU to the network fabric. Augtera is a light footprint application deployed within the environment, and it brings visibility into the environment and leverages AI to:
- Prevent and lower incidents and fabric degradation
- Maximize utilization while minimizing congestion
- Automate corrective actions through Ansible playbooks
In addition to providing visibility, Augtera's network AI also provides root cause, remediation, and impact analysis as a comprehensive offering.
Figure 20 shows the type of observability Augtera’s network AI can provide to an operator while implementing a GenAI workload achieving optimum performance.
The integration of Dell Enterprise SONiC with Augtera AI adds an intelligence layer as part of a solution across the different elements that make up the Ethernet powered infrastructure portfolio for GenAI workloads by Dell Technologies.