Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn to create a real-time AI voice agent in this 13-minute tutorial that combines DeepSeek R1's 7B model with AssemblyAI for speech-to-text and ElevenLabs for text-to-speech capabilities. Master the implementation of real-time speech transcription using AssemblyAI, integrate DeepSeek R1 through Ollama for AI response generation, and convert text responses into natural-sounding speech using ElevenLabs. Follow along with practical demonstrations and step-by-step instructions for building a low-latency voice assistant capable of natural, real-time interactions. Starting with installation requirements and progressing through the Python implementation, discover how to create a seamless conversational AI system that can listen, process, and respond with human-like speech patterns.
Syllabus
00:00 - Intro
01:00 - Demo
01:49 - Installing AssemblyAI, Ollama for DeepSeek R1 and Elevenlabs
03:38 - Building the AI voice agent in python
12:35 - Demo
Taught by
AssemblyAI