Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore advanced methods for evaluating and enhancing the performance of sophisticated GenAI agentic systems in this 19-minute conference talk. Discover how modern AI agents have evolved beyond simple information retrieval to handle complex multi-turn dialogues and execute autonomous multi-step tasks. Learn practical techniques for assessing LLM-driven agentic system behavior, identifying performance bottlenecks, and creating more reliable and better-aligned AI agents. Gain actionable insights into systematic approaches for improving agentic AI systems that can manage sophisticated workflows and deliver consistent results across various applications.
Syllabus
Sponsored by: Snorkel AI | Evaluating and Improving Performance of Agentic Systems
Taught by
Databricks