GCP AI Platform Prediction
Jump to navigation
Jump to search
A GCP AI Platform Prediction is a ML prediction platform that is a component of [[]].
- See: Seldon Framework.
References
2020
- https://venturebeat.com/2020/09/25/google-launches-ai-platform-prediction-in-general-availability/
- QUOTE: ... AI Platform Prediction ostensibly makes it easier to deploy models trained using frameworks like XGBoost and scikit, courtesy of an engine that selects compatible cloud hardware (e.g., AI accelerator chips) automatically. On supported virtual machines, it shows metrics like graphics card, processor, RAM, and network usage, as well as things like model replica count over time. And on the security side, AI Platform Prediction ships with tools that allow users to define parameters and deploy models that only have access to resources and services within a defined network perimeter.
Beyond this, AI Platform Prediction provides information about model predictions and a visualization tool to help elucidate those predictions. Moreover, it continuously evaluates live models based on ground-truth labeling of requests sent to the model, providing an opportunity to improve performance through retraining. ...
- QUOTE: ... AI Platform Prediction ostensibly makes it easier to deploy models trained using frameworks like XGBoost and scikit, courtesy of an engine that selects compatible cloud hardware (e.g., AI accelerator chips) automatically. On supported virtual machines, it shows metrics like graphics card, processor, RAM, and network usage, as well as things like model replica count over time. And on the security side, AI Platform Prediction ships with tools that allow users to define parameters and deploy models that only have access to resources and services within a defined network perimeter.
2019
- https://venturebeat.com/2019/10/25/google-rolls-out-updates-to-ai-platform-prediction-and-ai-platform-training/
- QUOTE: ... Now, AI Platform Prediction — the component of AI Platform that enables model serving for online predictions in a serverless environment — lets developers choose from a set of machine types in Google’s Compute Engine service to run a model. Thanks to a new backend built on Google Kubernetes Engine, they’re able to add graphics chips like Nvidia’s T4 and have AI Platform Prediction handle provisioning, scaling, and serving. (Online Prediction previously only allowed you to choose from one or four vCPU machine types.)
Additionally, prediction requests and responses can now be logged to Google’s BigQuery, where they can be analyzed to detect skew and outliers. ...
- QUOTE: ... Now, AI Platform Prediction — the component of AI Platform that enables model serving for online predictions in a serverless environment — lets developers choose from a set of machine types in Google’s Compute Engine service to run a model. Thanks to a new backend built on Google Kubernetes Engine, they’re able to add graphics chips like Nvidia’s T4 and have AI Platform Prediction handle provisioning, scaling, and serving. (Online Prediction previously only allowed you to choose from one or four vCPU machine types.)