Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs
Association for Computing Machinery (ACM) via YouTube
Lead AI Strategy with UCSB's Agentic AI Program — Microsoft Certified
NY State-Licensed Certificates in Design, Coding & AI — Online
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Explore a 16-minute conference presentation from SIGIR 2024 that examines how user feedback impacts both crowdworkers and Large Language Models (LLMs) in dialogue system evaluation. Delve into research findings presented by Clemencia Siro, Mohammad Aliannejadi, and Maarten de Rijke as they challenge traditional evaluation methods for conversational AI systems. Learn about the complex interplay between human evaluators and automated systems, and understand how incorporating user feedback can reshape our approach to assessing dialogue system performance. Gain insights into the evolving landscape of AI evaluation methodologies through this ACM-sponsored talk that bridges the gap between human and machine-based assessment techniques.
Syllabus
SIGIR 2024 W1.3 [fp] Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback
Taught by
Association for Computing Machinery (ACM)