GPT-3 vs Humans: Rethinking Evaluation of Natural Language Generation
Center for Language & Speech Processing(CLSP), JHU via YouTube
AI Engineer - Learn how to integrate AI into software applications
Earn Your Business Degree, Tuition-Free, 100% Online!
Overview
Google, IBM & Meta Certificates – 40% Off
One plan covers every Professional Certificate on Coursera.
Unlock All Certificates
Explore the challenges and advancements in evaluating natural language generation systems in this 44-minute talk by Wei Xu from the Center for Language & Speech Processing at JHU. Delve into the comparison between GPT models and human performance on constrained text generation tasks, focusing on paraphrase generation and text simplification. Learn about the innovative Rank-and-Rate evaluation framework and discover how GPT-3.5 compares to fine-tuned T5 and human capabilities. Examine the limitations of existing automatic evaluation metrics and understand the potential of LENS, a learnable evaluation metric that outperforms current methods in both automatic evaluation and minimal risk decoding for text generation.
Syllabus
GPT-3 vs Humans: Rethinking Evaluation of Natural Language Generation - Wei Xu - February 2023
Taught by
Center for Language & Speech Processing(CLSP), JHU