Power BI Fundamentals - Create visualizations and dashboards from scratch
Google Data Analytics, IBM AI & Meta Marketing — All in One Subscription
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Dive into an in-depth video interview with Junyang Lin, author of the recently announced Qwen 2 LLM. Explore the development process, key features, and innovations behind this large language model. Learn about the Qwen team at Alibaba, the model's architecture, tokenizer, training methodology, and pretraining data. Gain insights into the extended context length capabilities, post-training techniques, benchmarks, safety considerations, and future directions for Qwen 2. Discover how this LLM compares to others in the field and understand its potential impact on natural language processing applications.
Syllabus
00:00:00 - Intro
00:00:32 - Hyperstack GPUs sponsored
00:02:13 - Junyang & Qwen team at Alibaba
00:07:05 - Qwen2
00:13:05 - Tokenizer
00:15:55 - Training Qwen2
00:23:30 - Pretraining data
00:30:50 - Model Architecture changes?
00:37:35 - Context length
00:50:00 - Post-training
00:57:50 - Benchmarks/safety
00:59:20 - Future work
Taught by
Aleksa Gordić - The AI Epiphany