Machine Learning (ML) Pipeline

Context:
- It can (typically) include ML data preparation jobs, ML training jobs, ML inference jobs, and/or online ML model evaluation jobs.
- It can (typically) be the result of Machine Learning Development Task.
- It can (often) follow a Machine Learning-based System Development Process Model.
- It can (often) include ML Pipeline Monitoring and ML Pipeline Alerting.
- It can range from being a Batch-based ML Pipeline to being a Real-Time ML Pipeline.
- It can range from being a Non-Scalable ML Pipeline to being a Scalable ML Pipeline.
- It can be produced within an ML Platform.
- …
Example(s):
- the one used at T-Mobile to Predict Customer Churn.
- the one used at PlayStation to Predict Personalized Game-Play Relevance.
- …
Counter-Example(s):
- a Data Warehouse ETL Pipeline.
See: ETL Pipeline, ML Development Process Model, ML Feature Creation System, Training Data Creation, ML Model Training, ML Model Deployment.

References

Semi Koen. (2019c). “Architecting a Machine Learning Pipeline." In: Medium
- QUOTE: Architecting a ML Pipeline: Traditionally, pipelines involve overnight batch processing, i.e. collecting data, sending it through an enterprise message bus and processing it to provide pre-calculated results and guidance for next day’s operations. Whilst this works in some industries, it is really insufficient in others, and especially when it comes to ML applications.
  The following diagram shows a ML pipeline applied to a real-time business problem where features and predictions are time sensitive (e.g. Netflix’s recommendation engines, Uber’s arrival time estimation, LinkedIn’s connections suggestions, Airbnb’s search engines etc).

"Building a Reproducible Machine Learning Pipeline." In: arXiv
- QUOTE: ... Many open-source tools exist for the individual tasks in a machine learning pipeline. For example, Git or Subversion for version control of software code, Scikit-Learn or MLlib for building models, and Docker for containerization. ...

https://conferences.oreilly.com/strata/strata-eu-2017/public/schedule/detail/60680
- QUOTE: ... explore use cases from the BMW Group where novel machine-learning pipelines (such as those based on XGBoost and convolutional neural nets, for example) support a broad variety of business stakeholders. ...