Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn how to implement complete sharding and parallelism solutions in JAX by exploring the training loop, data loading, checkpointing, and a practical Transformer block example in this 13-minute tutorial from Google's three-part series on scaling machine learning models.
Syllabus
Scaling Up (Part 3)
Taught by
Google Developers