MIT Sloan: Lead AI Adoption Across Your Organization — Not Just Pilot It
Learn AI, Data Science & Business — Earn Certificates That Get You Hired
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Learn how to run speech-to-speech AI models on Mac or GPU in this comprehensive 37-minute tutorial. Explore the process of building models like GPT-4o, dive into the Llama 3 Speech-to-Speech Model, and utilize HuggingFace's Speech-to-Speech repository. Follow step-by-step instructions for running these models on your Mac and remote GPU (CUDA) environments. Discover techniques to reduce latency using UDP ports instead of TCP. Access valuable resources, including GitHub repositories, slides, and research papers, to further enhance your understanding of speech-to-speech AI technology.
Syllabus
Introduction to Speech to Speech AI Models like GPT-4o
Video Overview
How to build speech-to-speech models like GPT-4o
Llama 3 Speech-to-Speech Model
HuggingFace Speech-to-Speech
Running speech to speech on your Mac
Running speech-to-speech on a remote GPU CUDA
Reducing latency with UDP ports instead of TCP
Video Resources
Taught by
Trelis Research