Accelerating High-Performance Machine Learning at Scale in Kubernetes
CNCF [Cloud Native Computing Foundation] via YouTube
Google, IBM & Meta Certificates — 40% Off for a Limited Time
Most AI Pilots Fail to Scale. MIT Sloan Teaches You Why — and How to Fix It
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore a hands-on guide for productionizing optimized machine learning models in cloud native ecosystems using production-ready open source frameworks in this 36-minute conference talk from KubeCon + CloudNativeCon North America 2022. Dive into a practical use case deploying the GPT-2 NLP model in Kubernetes using ONNX Runtime from the Seldon Core Triton server. Learn how to create a scalable production NLP microservice for intelligent text generation applications. Discover key challenges in the MLOps space and understand how various tools interoperate throughout the production machine learning lifecycle. Gain insights from industry experts Alejandro Saucedo and Elena Neroslavskaya on accelerating high-performance machine learning at scale in Kubernetes environments.
Syllabus
Accelerating High-Performance Machine Learning at Scale i... Alejandro Saucedo & Elena Neroslavskaya
Taught by
CNCF [Cloud Native Computing Foundation]