Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Low-Latency Strix Halo Cluster with RDMA - RoCE/Intel E810 and vLLM, Framework Desktop Boards

Donato Capitella via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn to build a high-performance 2-node Strix Halo cluster using RDMA networking and vLLM tensor parallelism in this technical video tutorial. Explore the hardware architecture featuring Framework Desktop motherboards with 128 GB unified memory each, connected via Intel E810 cards configured for RoCE (RDMA over Converged Ethernet). Discover the critical hardware considerations including direct-attached RDMA setup, custom cooling solutions for the E810 network cards, and understand why running x16 NICs in PCIe x4 slots doesn't significantly impact inference performance. Compare RDMA latency advantages over standard Ethernet and learn why low latency is essential for effective tensor parallelism. Navigate the software implementation using vLLM with Ray, including troubleshooting the main challenge of missing RCCL support for gfx1151 in upstream ROCm. Follow the detailed walkthrough of patching RCCL to enable multi-node tensor parallelism on Strix Halo architecture, complete with configuration tutorials and performance benchmarks. Access provided toolboxes and guides to reproduce the entire setup, making this advanced clustering approach accessible for high-performance AI inference applications.

Syllabus

– Introduction
– The Hardware
– RDMA / RoCE Network Card
– Custom Cooling for Intel E810
– PCIe Lane Caveat x16 to x4
– ROCm / RCCL gfx1151 Support
– Configuration Tutorial
– Benchmarks
– Conclusion

Taught by

Donato Capitella

Reviews

Start your review of Low-Latency Strix Halo Cluster with RDMA - RoCE/Intel E810 and vLLM, Framework Desktop Boards

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.