Machine Learning-based System Development Process Model
(Redirected from ML Workflow)
Jump to navigation
Jump to search
A Machine Learning-based System Development Process Model is a software development process model to develop machine learning-based systems.
- Context:
- It can (typically) be referenced by a Machine Learning Development Team.
- It can (typically) include ML Systems Analysis, ML Software Design, ML Software Development, ML Software Testing, ML Software Documentation, SML oftware Evaluation, an ML Software Maintenance.
- It can be associated to a Machine Learning Development Methodology.
- It can be associated to a Data Science Workflow.
- It can be supported by a Machine Learning Workflow System, such as Spark MLlib Pipeline, TensorFlow ML Workflow, Uber's Michelangelo, and Kubeflow.
- It can (often) produce an ML Model Training System that creates trained ML models.
- It can (often) contain Model Data Preparation Code.
- …
- Example(s)
- Counter-Example(s):
- See: Business Process Model, ML Software Design, ML Software Development, ML Software Maintenance, Production Data Store, ML Pipeline.
References
2019b
- https://www.kubeflow.org/
- QUOTE: ... The Kubeflow project is dedicated to making deployments of machine learning (ML) workflows on Kubernetes simple, portable and scalable.
2017
- https://developers.google.com/machine-learning/guides/text-classification/
- QUOTE: Here’s a high-level overview of the workflow used to solve machine learning problems:
- QUOTE: Here’s a high-level overview of the workflow used to solve machine learning problems:
2016
- http://databricks.com/blog/2016/05/31/apache-spark-2-0-preview-machine-learning-model-persistence.html
- QUOTE: So far, we have only looked at saving and loading a single ML model. In practice, ML workflows consist of many stages, from feature extraction and transformation to model fitting and tuning. MLlib provides Pipelines to help users construct these workflows. (See this notebook for a tutorial on ML Pipelines analyzing a bike sharing dataset.)
MLlib allows users to save and load entire Pipelines. Let’s look at how this is done on an example Pipeline with these steps:
- Feature extraction: Binarizer to convert images to black and white
- Model fitting: Random Forest Classifier to take images and predict digits 0–9
- Tuning: Cross-Validation to tune the depth of the trees in the forest
- QUOTE: So far, we have only looked at saving and loading a single ML model. In practice, ML workflows consist of many stages, from feature extraction and transformation to model fitting and tuning. MLlib provides Pipelines to help users construct these workflows. (See this notebook for a tutorial on ML Pipelines analyzing a bike sharing dataset.)