StreamBox - A Lightweight GPU Sandbox for Serverless Inference Workflow

Explore a groundbreaking conference talk on StreamBox, a lightweight GPU sandbox designed for serverless inference workflows. Delve into the challenges of dynamic workloads and latency-sensitive DNN inference in serverless computing environments. Discover how StreamBox addresses the limitations of existing serverless inference systems by implementing fine-grained and auto-scaling memory management, enabling transparent and efficient intra-GPU communication across functions, and facilitating PCIe bandwidth sharing among concurrent streams. Learn about the significant improvements StreamBox offers, including up to 82% reduction in GPU memory footprint and a 6.7X increase in throughput compared to state-of-the-art systems. Gain insights into the potential impact of this innovative approach on scalable DNN inference serving and the future of serverless computing for GPU-intensive tasks.

Syllabus

USENIX ATC '24 - StreamBox: A Lightweight GPU SandBox for Serverless Inference Workflow

Taught by

USENIX

Reviews

Start your review of StreamBox - A Lightweight GPU Sandbox for Serverless Inference Workflow

MIT Sloan: Lead AI Adoption Across Your Organization — Not Just Pilot It

The Most Addictive Python and SQL Courses

Taught by

Learn the Skills Netflix, Meta, and Capital One Actually Hire For

Torpor - GPU-Enabled Serverless Computing for Low-Latency, Resource-Efficient Inference

Colocating ML Inference and Training with Fast GPU Memory Handover

No More GPU Cold Starts - Making Serverless ML Inference Truly Real-Time

PPipe - Efficient Video Analytics Serving on Heterogeneous GPU Clusters via Pool-Based Pipeline Parallelism

SAVE - Software-Implemented Fault Tolerance for Model Inference against GPU Memory Bit Flips

Get 20% off all career paths from fullstack to AI Ad

12 Best Deep Learning Courses for 2026

10 Best TensorFlow Courses for 2026

[2026] 10,000+ Free Courses from Tech Giants: Learn from Google, Microsoft, Amazon, and More

Best Generative AI Courses of 2026 — Based on Your Profession

[2026] 120+ Courses to Prepare your AWS Certifications

Never Stop Learning.