FlexPipe - Maximizing Training Efficiency for Transformer-based Models with Variable-Length Inputs

Learn about FlexPipe, a novel distributed system framework designed to maximize training efficiency for transformer-based models when working with variable-length inputs in this 14-minute conference presentation from USENIX ATC '25. Discover how researchers from Jilin University and University of California, Riverside address the inefficiencies caused by substantial fluctuations in computation and memory requirements across training iterations due to static partitioning in distributed frameworks. Explore the first flexible pipeline framework that dynamically adjusts pipeline parallelism through a live flexibility mechanism without compromising training loss, featuring a novel optimization problem formulation aimed at maximizing training throughput by adjusting parallel configurations and an efficient heuristic algorithm to solve it. Examine experimental results demonstrating FlexPipe's achievement of an average 1.25× training throughput improvement compared to state-of-the-art methods, moving beyond traditional single-iteration optimizations to address system-level challenges in variable-length transformer training across distributed environments.

Syllabus

USENIX ATC '25 - FlexPipe: Maximizing Training Efficiency for Transformer-based Models with...

Taught by

USENIX

Reviews

Start your review of FlexPipe - Maximizing Training Efficiency for Transformer-based Models with Variable-Length Inputs

AI Engineer - Learn how to integrate AI into software applications

Learn AI, Data Science & Business — Earn Certificates That Get You Hired

Taught by

Lead AI-Native Products with Microsoft's Agentic AI Program

CrossPipe - Towards Optimal Pipeline Schedules for Cross-Datacenter Training

Obscura - Concealing Recomputation Overhead in Training of Large Language Models with Bubble-filling Pipeline Transformation

Universal Checkpointing - A Flexible and Efficient Distributed Checkpointing System for Large-Scale DNN Training with Reconfigurable Parallelism

WLB-LLM - Workload-Balanced 4D Parallelism for Large Language Model Training

Optimus - Accelerating Large-Scale Multi-Modal LLM Training by Bubble Exploitation

The Investment Banker Certification Ad

7 Best AI Video Generation Courses (Free & Paid)

[2026] 150 Courses & Webinars on AI in Healthcare

[2026] 140+ Universities Just Launched 900+ Online Courses. Here’s the Full List.

9 Best System Design Courses for 2026: From Coding to Architecting

10 Best Beginner AI Courses for Educators in 2026

Never Stop Learning.