RoPE: Rotary Position Embedding for Extended Context Lengths in Transformers
Discover AI via YouTube
Get 20% off all career paths from fullstack to AI
Lead AI Strategy with UCSB's Agentic AI Program — Microsoft Certified
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Learn about Rotary Position Embedding (RoPE) in this 40-minute technical video that breaks down complex concepts into simple terms for understanding how self-attention works in Transformers with relative position encoding. Explore the mathematical foundations and practical applications of RoPE that enable Large Language Models (LLMs) to handle extended context lengths up to 100K tokens. Dive into the key concepts from the RoFormer paper, examining how rotary position embeddings enhance transformer architectures for improved performance in natural language processing tasks. Gain valuable insights into this advanced AI research topic through clear explanations and detailed breakdowns of the underlying mechanisms.
Syllabus
RoPE Rotary Position Embedding to 100K context length
Taught by
Discover AI