GPT-3 vs Humans: Rethinking Evaluation of Natural Language Generation
Center for Language & Speech Processing(CLSP), JHU via YouTube
Power BI Fundamentals - Create visualizations and dashboards from scratch
Master Agentic AI, GANs, Fine-Tuning & LLM Apps
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore the challenges and advancements in evaluating natural language generation systems in this 44-minute talk by Wei Xu from the Center for Language & Speech Processing at JHU. Delve into the comparison between GPT models and human performance on constrained text generation tasks, focusing on paraphrase generation and text simplification. Learn about the innovative Rank-and-Rate evaluation framework and discover how GPT-3.5 compares to fine-tuned T5 and human capabilities. Examine the limitations of existing automatic evaluation metrics and understand the potential of LENS, a learnable evaluation metric that outperforms current methods in both automatic evaluation and minimal risk decoding for text generation.
Syllabus
GPT-3 vs Humans: Rethinking Evaluation of Natural Language Generation - Wei Xu - February 2023
Taught by
Center for Language & Speech Processing(CLSP), JHU