Predictive Maintenance for IT Operations architecture
Dell servers include Integrated Dell Remote Access Controller (iDRAC) modules, which transmit telemetry to external predictive analytics agents:
- The telemetry enables the agents to comprehend current and future failure possibilities.
- The agents act to mitigate failures before they surface.
This type of maintenance improves Mean Time to Failure (MTTF), Mean Time Between Failures (MTBF), and Mean Time to Recovery (MTTR).
The iDRAC telemetry specification is enriched as new failure models emerge:
- Relevant data becomes available for external agents to consume.
- The agents analyze the data for correlated anomalies and make forecasting decisions that predict individual or collective IT infrastructure failure.
- The agents use classified fault predictions to triage actions that keep the infrastructure safe, secure, reliable, highly available, and error-free.
In this control system, the substrate that consists of IT infrastructure components continuously sends data. The agents work on the data to forecast and mitigate substrate failures.