Taming the Wild West of Research Computing - How Policies Saved Us a Thousand Headaches
CNCF [Cloud Native Computing Foundation] via YouTube
Learn AI, Data Science & Business — Earn Certificates That Get You Hired
Get 20% off all career paths from fullstack to AI
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Learn how to implement effective governance and resource management for Kubernetes clusters in research computing environments through this 24-minute conference talk from CNCF. Discover practical strategies for controlling chaotic research workloads using policy-as-code approaches with Kyverno, Kueue, and Argo CD. Explore real-world solutions for preventing common issues like abandoned interactive GPU pods, misallocated CPU jobs on GPU nodes, and resource waste in bare-metal accelerated discovery clusters. Master techniques for enforcing fine-grained policies, implementing fair-share GPU scheduling, and automating governance without custom code or complex workarounds. Gain insights into applying GitOps discipline and smart policy design to maintain order in high-performance computing environments, whether managing research workloads or preventing general cluster chaos.
Syllabus
Taming the Wild West of Research Computing: How Policies Saved Us a Thousand... Alessandro Pomponio
Taught by
CNCF [Cloud Native Computing Foundation]