Start speaking a new language. It’s just 3 weeks away.
Learn the Skills Netflix, Meta, and Capital One Actually Hire For
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore practical strategies for optimizing AI model efficiency in this 18-minute conference talk from the AI Engineer World's Fair. Learn to identify key sources of inefficiency in AI models and discover actionable techniques to improve latency and resource consumption for large-scale deployment. Examine the challenges posed by resource-demanding models and the growing on-device AI movement that imposes physical constraints on model deployment. Dive into practical solutions including model architecture selection, quantization techniques, and prompt optimization strategies. Gain insights from Dr. Shelby Heinecke, who leads an AI research team at Salesforce developing cutting-edge AI for products and academic research, with expertise spanning AI agents, LLMs, on-device AI, entity resolution, and recommendation systems.
Syllabus
A Practical Guide to Efficient AI: Shelby Heinecke
Taught by
AI Engineer