Reconciling Accuracy, Cost, and Latency of Inference Serving Systems
AI Institute at UofSC - #AIISC via YouTube
MIT Sloan AI Adoption: Build a Playbook That Drives Real Business ROI
UC San Diego Product Management Certificate — AI-Powered PM Training
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
In this 1 hour 21 minute talk, Professor Pooyan Jamshidi from the AI Institute at UofSC (#AIISC) explores the critical challenge of balancing accuracy, cost, and latency in inference serving systems. Discover how these three competing factors must be reconciled when deploying machine learning models in production environments, and learn about cutting-edge approaches to optimize these trade-offs for more efficient AI systems.
Syllabus
Reconciling Accuracy, Cost, and Latency of Inference Serving Systems: Prof Pooyan Jamshidi
Taught by
AI Institute at UofSC - #AIISC