Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore a comprehensive analysis of code retrieval systems and their limitations in this 31-minute video examining recent research findings. Discover how current code retrievers heavily depend on textual features rather than understanding functional code semantics, revealing that Code RAG systems don't actually find code sequences as intended. Learn about the significant performance degradation of both embedding-based retrievers and LLM-based rerankers under normalization settings, particularly in full normalization scenarios. Understand the transition from traditional RAG (Retrieval-Augmented Generation) to RACG (Retrieval-Augmented Code Generation) as a more effective approach for code retrieval tasks. Examine the research findings from Carnegie Mellon University's study "SACL: Understanding and Combating Textual Bias in Code Retrieval with Semantic-Augmented Reranking and Localization" by Dhruv Gupta, Gayathri Ganesh Lakshmy, and Yiqing Xie, which demonstrates how semantic-augmented approaches can address textual bias in code retrieval systems.
Syllabus
AI: SWE Code-RAG at 9% Best (RAG to RACG)
Taught by
Discover AI