Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn how to run speech-to-speech AI models on Mac or GPU in this comprehensive 37-minute tutorial. Explore the process of building models like GPT-4o, dive into the Llama 3 Speech-to-Speech Model, and utilize HuggingFace's Speech-to-Speech repository. Follow step-by-step instructions for running these models on your Mac and remote GPU (CUDA) environments. Discover techniques to reduce latency using UDP ports instead of TCP. Access valuable resources, including GitHub repositories, slides, and research papers, to further enhance your understanding of speech-to-speech AI technology.
Syllabus
Introduction to Speech to Speech AI Models like GPT-4o
Video Overview
How to build speech-to-speech models like GPT-4o
Llama 3 Speech-to-Speech Model
HuggingFace Speech-to-Speech
Running speech to speech on your Mac
Running speech-to-speech on a remote GPU CUDA
Reducing latency with UDP ports instead of TCP
Video Resources
Taught by
Trelis Research