Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Efficient Modular Arithmetic using Vectorization on CPUs and GPUs

Simons Institute via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn how to accelerate modular arithmetic operations using vectorization techniques on both CPUs and GPUs in this 33-minute conference talk from the Simons Institute. Explore the computational challenges of modular multiplication in large prime fields, which forms the backbone of cryptographic and scientific algorithms and often dictates overall algorithm runtime. Discover a new RNS Montgomery multiplication method that simplifies existing approaches and eliminates unnecessary elementwise modular multiplications through vectorization. Understand how this breakthrough enables significant latency reduction for single modular multiplications under generic primes, moving beyond the traditional approach of only accelerating batches of independent operations. Examine the implications for number-theoretic cryptography, where virtually all operations can benefit from vectorization speedups. Gain insights into the technical details of this simplified RNS structure and its potential to transform computational efficiency across cryptographic applications.

Syllabus

Efficient Modular Arithmetic using Vectorization on CPUs and GPUs

Taught by

Simons Institute

Reviews

Start your review of Efficient Modular Arithmetic using Vectorization on CPUs and GPUs

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.