Evaluating Gemini Models Using Vertex AI Auto Side-by-Side Comparison
Google Cloud Events via YouTube
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn how to effectively evaluate and compare Gemini's performance against other language models using Vertex AI Auto SxS in this 35-minute technical presentation. Explore automated model evaluation techniques that go beyond basic recall metrics, with detailed demonstrations of both human-in-the-loop and fully automated comparison workflows. Master the practical implementation of Vertex AI Auto SxS through comprehensive demos, understand key evaluation metrics for generative AI, and discover best practices for assessing model performance specific to your use cases. Gain valuable insights during an interactive Q&A session covering model evaluation strategies and implementation challenges.
Syllabus
Evaluating Gen AI
Model Evaluation for Gen AI
Get started with Auto SxS
No human in the loop
Human in the loop
Key takeaways
Q&A
Taught by
Google Cloud Events