Taming the Wild West of Research Computing - How Policies Saved Us a Thousand Headaches
CNCF [Cloud Native Computing Foundation] via YouTube
Python, Prompt Engineering, Data Science — Build the Skills Employers Want Now
Cybersecurity: Ethical Hacking Fundamentals - Self Paced Online
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Learn how to implement effective governance and resource management for Kubernetes clusters in research computing environments through this 24-minute conference talk from CNCF. Discover practical strategies for controlling chaotic research workloads using policy-as-code approaches with Kyverno, Kueue, and Argo CD. Explore real-world solutions for preventing common issues like abandoned interactive GPU pods, misallocated CPU jobs on GPU nodes, and resource waste in bare-metal accelerated discovery clusters. Master techniques for enforcing fine-grained policies, implementing fair-share GPU scheduling, and automating governance without custom code or complex workarounds. Gain insights into applying GitOps discipline and smart policy design to maintain order in high-performance computing environments, whether managing research workloads or preventing general cluster chaos.
Syllabus
Taming the Wild West of Research Computing: How Policies Saved Us a Thousand... Alessandro Pomponio
Taught by
CNCF [Cloud Native Computing Foundation]