Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Running Speech-to-Speech Models on Mac or GPU

Trelis Research via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn how to run speech-to-speech AI models on Mac or GPU in this comprehensive 37-minute tutorial. Explore the process of building models like GPT-4o, dive into the Llama 3 Speech-to-Speech Model, and utilize HuggingFace's Speech-to-Speech repository. Follow step-by-step instructions for running these models on your Mac and remote GPU (CUDA) environments. Discover techniques to reduce latency using UDP ports instead of TCP. Access valuable resources, including GitHub repositories, slides, and research papers, to further enhance your understanding of speech-to-speech AI technology.

Syllabus

Introduction to Speech to Speech AI Models like GPT-4o
Video Overview
How to build speech-to-speech models like GPT-4o
Llama 3 Speech-to-Speech Model
HuggingFace Speech-to-Speech
Running speech to speech on your Mac
Running speech-to-speech on a remote GPU CUDA
Reducing latency with UDP ports instead of TCP
Video Resources

Taught by

Trelis Research

Reviews

Start your review of Running Speech-to-Speech Models on Mac or GPU

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.