Machine Learning (ML) Model Deployment System

A Machine Learning (ML) Model Deployment System is a software deployment system that can solve an ML model deployment task (to convert a trained model into a deployed ML model).

Context:
- It can range from being an Online ML Model Deployment System to being an Offline ML Model Deployment System.
- It can be based on a ML Model Deployment Platform, such as MLflow and Seldon.
- …
Example(s):
- a Seldon.
- a Kubeflow.
- a MLCapsule System?
- …
Counter-Example(s):
- a ML Model Training System,
- a Data Processing System,
- an Infrastructure as a Service (IaaS),
- a Software as a Service (SaaS),
- a Platform as a service (PaaS),
See: Machine Learning System, Software Deployment, Software Development System, Application Programming Interface.

References

2018a

(Lai & Suda, 2018) ⇒ Liangzhen Lai, and Naveen Suda. (2018). “Rethinking Machine Learning Development and Deployment for Edge Devices.” arXiv:1806.07846
- QUOTE: Current ML development and deployment approach is shown in Figure 1. Application developers use the ML frameworks to construct their NN models, perform training and evaluate the accuracy. Based on the evaluation results, they can go back and refine or optimize their models. After this development stage, a trained model is generated by the framework. The deployment tool, typically offered by the deployment platform vendor, takes this trained model as the input and constructs a deployable solution that runs on the target platform.
  
  Figure 1: Current ML development and deployment approach. The shaded blocks are vendor-specific parts.
  (...) we proposed the ML development and deployment approach shown in Fig. 2. Unlike current approach, the users, which are the solution developers, will specify their network models using the deployment tool with an operator library from the deployment platform vendor. The deployment tool will generate a deployable but untrained solution, i.e., a model that can run on their platform, but without the trained parameters. As part of the deployment tool, the vendor should also be responsible to create an operator model library that can represent their operators and be used to create a trainable model inside the ML framework. This trainable model should have a forward inference path that behaves exactly the same as the implemented operators, and a backward path for the ML framework to perform training in order to generate the appropriate weights and parameters. In this approach, the validation results from the ML framework will be the same as the deployment results, so that the user can use these results to refine or further optimize their network models.
  
  Figure 2: Proposed ML development and deployment approach. The shaded blocks are vendor-specific parts.

2018b

(Hanzlik et al., 2018) ⇒ Lucjan Hanzlik, Yang Zhang, Kathrin Grosse, Ahmed Salem, Max Augustin, Michael Backes, and Mario Fritz. (2018). “MLCapsule: Guarded Offline Deployment of Machine Learning As a Service.” arXiv:1808.00590
- QUOTE: Machine learning as a service (MLaaS) has become increasingly popular during the past five years. Leading Internet companies, such as Google^[1] Amazon^[2] and Microsoft^[3] have deployed their own MLaaS. It offers a convenient way for a service provider to deploy a machine learning (ML) model and equally an instant way for a user/client to make use of the model in various applications. Such setups range from image analysis over translation to applications in the business domain.

Machine Learning (ML) Model Deployment System

References

2018a

2018b

2018c

2016

1998

Navigation menu

Search