Scaling XGBoost With Spark Connect ML on Grace Blackwell

Learn how to optimize XGBoost performance using NVIDIA's Grace Blackwell super chip architecture in this 36-minute conference talk from Databricks. Discover how XGBoost's distributed out-of-core implementation leverages the ultra-high bandwidth of NVLink-C2C connections between CPU and GPU to overcome memory limitations that typically constrain gradient boosting algorithms when working with large tabular datasets. Explore the technical implementation that enables XGBoost to scale up to over 1.2TB of data processing capacity on a single node without performance degradation, taking advantage of the fast chip-to-chip communication enabled by the Grace Blackwell architecture. Understand how this approach extends to GPU clusters using Spark, allowing XGBoost to efficiently handle terabytes of data across distributed systems. See a practical demonstration of integrating XGBoost's out-of-core algorithms with Spark 4.0's latest Connect ML framework for large-scale model training workflows, presented by NVIDIA engineers Bobby Wang and Jiaming Yuan who share their optimization work and real-world implementation strategies.

Syllabus

Scaling XGBoost With Spark Connect ML on Grace Blackwell

Taught by

Databricks

Reviews

Start your review of Scaling XGBoost With Spark Connect ML on Grace Blackwell

Learn Python with Generative AI - Self Paced Online

Python, Prompt Engineering, Data Science — Build the Skills Employers Want Now

Taught by

The Fastest Way to Become a Backend Developer Online

NVIDIA vGPU Support on Grace Blackwell Superchip - Architecture, Design, Upstreaming Status

Tracing the Path of a Row Through a GPU-Enabled Query Engine on the Grace-Blackwell Architecture

Spark RAPIDS ML - GPU Accelerated Distributed Machine Learning in Spark Clusters

Scaling Distributed XGBoost and Parallel Data Ingestion with Ray - FlightAware Case Study

Power BI Fundamentals - Create visualizations and dashboards from scratch Ad

9 Best System Design Courses for 2026: From Coding to Architecting

14 Best Machine Learning Courses for 2026: Scikit-learn, TensorFlow, and more

12 Best Applied AI & ML Courses for 2026

AI for Good: A DeepLearning.AI Course Review

Unveiling the Mathematical Beauty of Machine Learning: A Review of Steve Brunton’s Course

Never Stop Learning.