Home > APEX > Compute & HCI > White Papers > Optimize Machine Learning Through MLOps with Dell APEX and cnvrg.io > Implementing MLOps
MLOps is a defined process and life cycle for ML data, models, and coding. The MLOps life cycle begins with data extraction and preparation as the dataset is massaged into a structure that can effectively feed the model. Then, experimentation on the data occurs as data scientists conduct short experiments to initially gauge the usefulness of the outcomes. Model training follows as the algorithm is given data to process and “learn.” Next, the outputs are evaluated and validated against the business objectives and expected outcomes. When the models are adjusted sufficiently, they can be deployed to start processing the data. Constant monitoring occurs along the way to ensure that the process is running smoothly. Automatic retraining can be implemented to help adjust the deployed process and tighten up the results with each iteration. (See the following figure.)
The primary goals of MLOps are:
Note: Source: ml-ops.org.
A key difference between ML and traditional IT workloads is that ML tends to be more hardware- and platform-driven. ML, especially in DL use cases, has a wider range of underlying hardware types that can include more discrete compute elements. These compute elements include:
Because of this difference, MLOps must comprehend both the classic data and compute models, and more of the many compute elements that can underly each stage of the ML pipeline.
Another difference is that MLOps must go beyond the infrastructure and application focus of DevOps. A proper MLOps structure needs to address all three domains of machine learning: data, ML models, and code.