Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore how Ray Data has evolved into one of the most widely used libraries in the Ray ecosystem through this 30-minute conference talk from Ray Summit 2025. Learn from Balaji Veeramani of Anyscale as he demonstrates how Ray Data differs from traditional data processing engines by being purpose-built for multimodal, accelerator-native, and AI-centric pipelines. Discover the core capabilities of Ray Data and examine the major features added over the past year that support large-scale batch inference across GPUs and clusters, distributed training data preparation and ingestion for massive models, and high-performance multimodal data processing spanning images, video, text, and more. Gain insights into how Ray Data powers modern AI at scale, whether you're building LLM pipelines, multimodal training workflows, or high-throughput inference systems, and understand why this library has become essential for the new generation of AI workloads.
Syllabus
How Ray Data Powers Scalable AI Workloads | Ray Summit 2025
Taught by
Anyscale