Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Accelerating Scientific Applications with Automatic BLAS GPU Offload on NVIDIA Grace-Hopper

MuST Program for Disordered Materials via YouTube

Overview

Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Learn about an innovative automatic GPU offloading technique for scientific applications in this technical presentation from Dr. Junjie Li at Texas Advanced Computing Center. Explore SCILIB-Accel, a groundbreaking tool that enables automatic BLAS offload on NVIDIA Grace-Hopper platforms without requiring code modifications. Discover how the tool leverages unified memory architectures and cache-coherent NVLink C2C interconnect to eliminate traditional GPU programming bottlenecks. Understand the GPU First-Use data movement policy, inspired by OpenMP First-Touch, and see how it minimizes CPU-GPU data transfers in scientific computing applications. Examine real-world performance results, including a 3x speedup achieved in quantum physics codes like the LSMS method in the MuST suite when comparing Grace-Hopper (1 CPU + 1 GPU) to Grace-Grace (dual CPU) configurations. Gain insights into how this pioneering tool is making high-performance automatic BLAS offload practical for scientific applications.

Syllabus

Accelerating Scientific Applications with Automatic BLAS GPU Offload on NVIDIA Grace-Hopper

Taught by

MuST Program for Disordered Materials

Reviews

Start your review of Accelerating Scientific Applications with Automatic BLAS GPU Offload on NVIDIA Grace-Hopper

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.