AI, Data Science & Cloud Certificates from Google, IBM & Meta
Free courses from frontend to fullstack and AI
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Learn to rapidly develop a real-time voice-to-voice translator using open-source tools in this 29-minute hands-on workshop. Build an AI agent that processes speech-to-text, performs language translation, and synthesizes speech responses while optimizing for minimal latency. Master speech-to-text processing with open-source models, implement language translation using LLM-powered tools, and create speech synthesis capabilities for real-time responses. Discover techniques for optimizing latency to ensure seamless interaction and explore deployment strategies using open-source frameworks and APIs. Follow along with a live coding demonstration where you'll build alongside the instructor, with access to a provided GitHub repository for reproducible learning and further iteration.
Syllabus
Speed Building an AI-Powered Voice-to-Voice Translator with Open-Source Tools by Grace Deng
Taught by
Open Data Science