HyCache - Hybrid Caching for Accelerating DNN Input Preprocessing Pipelines

Master Production-Ready Machine Learning, Step by Step

Learn More →

Build with Azure OpenAI, Copilot Studio & Agentic Frameworks — Microsoft Certified

Learn More →

Overview

AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off

One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.

Unlock All Certificates

Learn about HyCache, a novel hybrid caching runtime system designed to accelerate deep neural network input preprocessing pipelines in this 14-minute conference presentation from USENIX ATC '25. Discover how modern GPU advances have shifted the training bottleneck from model computation to CPU-based data loading and preprocessing, creating new performance challenges in end-to-end DNN training. Explore the limitations of existing caching approaches that rely solely on memory or storage and can only cache complete stage outputs from single preprocessing steps. Understand how HyCache overcomes these constraints by enabling partial caching of preprocessed data subsets from multiple intermediate stages across both memory and storage simultaneously. Examine the integer linear programming (ILP) approach used to automatically determine optimal caching strategies that balance recomputation costs against caching benefits without manual intervention. Analyze performance results demonstrating raw pipeline throughput improvements ranging from 1.11× to 10.1× speedups compared to state-of-the-art approaches across various preprocessing pipelines, presented by researchers from the Indian Institute of Science, University of Southern California, and independent research.

Syllabus

USENIX ATC '25 - HyCache: Hybrid Caching for Accelerating DNN Input Preprocessing Pipelines

Taught by

USENIX

Reviews

Start your review of HyCache - Hybrid Caching for Accelerating DNN Input Preprocessing Pipelines

Master Production-Ready Machine Learning, Step by Step

Build with Azure OpenAI, Copilot Studio & Agentic Frameworks — Microsoft Certified

Taught by

Stuck in Tutorial Hell? Learn Backend Dev the Right Way

Pecan: Cost-Efficient ML Data Preprocessing with Automatic Transformation Ordering and Hybrid Placement

Centimani - Enabling Fast AI Accelerator Selection for DNN Training

Universal Checkpointing - A Flexible and Efficient Distributed Checkpointing System for Large-Scale DNN Training with Reconfigurable Parallelism

mTuner - Accelerating Parameter-Efficient Fine-Tuning on Multi-GPU Servers with Elastic Tensor

AutoCCL - Automated Collective Communication Tuning for Accelerating Distributed and Parallel DNN Training

Learn AI, Data Science & Business — Earn Certificates That Get You Hired Ad

7 Best AI Video Generation Courses (Free & Paid)

[2026] 150 Courses & Webinars on AI in Healthcare

[2026] 140+ Universities Just Launched 900+ Online Courses. Here’s the Full List.

10 Best Beginner AI Courses for Educators in 2026

Learn Something New: 250 Most Popular Courses For October

Never Stop Learning.