Holmes - Localizing Irregularities in LLM Training with Mega-scale GPU Clusters

Holmes - Localizing Irregularities in LLM Training with Mega-scale GPU Clusters

USENIX via YouTube Direct link

NSDI '25 - Holmes: Localizing Irregularities in LLM Training with Mega-scale GPU Clusters

1 of 1

1 of 1

NSDI '25 - Holmes: Localizing Irregularities in LLM Training with Mega-scale GPU Clusters

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Holmes - Localizing Irregularities in LLM Training with Mega-scale GPU Clusters

Automatically move to the next video in the Classroom when playback concludes

  1. 1 NSDI '25 - Holmes: Localizing Irregularities in LLM Training with Mega-scale GPU Clusters

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.