Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Watch a 17-minute video presentation exploring how Baseten has developed an innovative platform enabling progressive AI companies to build products with maximum speed and performance. Discover the comprehensive infrastructure solution for deploying and serving models with optimal performance, scalability, and cost-efficiency through the integration of NVIDIA's TensorRT-LLM on AWS. Gain insights into how leading companies are leveraging this platform to enhance their AI development capabilities while maintaining high performance standards and cost control in production environments.
Syllabus
Scaling Open Source AI Models in Production
Taught by
AWS Events