Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Microsoft

Let Your Agentic Apps Talk with Azure Speech - BRK198

Microsoft via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore the next generation of Azure Speech capabilities in this 45-minute conference talk from Microsoft Ignite 2025. Discover how to build voice-enabled applications, intelligent customer-service AI agents, and multimodal translation tools using cutting-edge speech technologies. Learn about the generally available Voice Live API for creating real-time voice agents and explore new APIs that leverage large language models to unlock advanced capabilities. Examine partnerships utilizing Azure Speech for enhanced voice experiences and understand the launch of the LLM Speech API with improved transcription and context understanding. Master the selection of generative AI models and prompt setup for voice agents while witnessing demonstrations of custom speech models, audio enhancements, and voice options. Investigate AI assistant improvements featuring natural conversation and adaptive behavior, and discover the introduction of 41 upgraded Neural HD voices supporting 100 locales. Learn techniques for creating branded voices and custom avatars using new HD voice models, addressing audio interruptions, and ensuring clear voice recognition in various scenarios including healthcare applications. Gain insights into future expansion and innovation plans, including real-time translation capabilities that will shape the next wave of voice-enabled applications.

Syllabus

0:00 - Partnerships using Azure Speech for voice experiences
00:10:50 - Launch of LLM Speech API: improved transcription and context understanding
00:17:20 - Selecting generative AI models and prompt setup for voice agent
00:19:30 - Demonstration of custom speech models, audio enhancements, and voice options
00:25:00 - AI assistant improvements with natural conversation and adaptive behavior
00:25:33 - Introduction of 41 upgraded Neural HD voices supporting 100 locales
00:28:40 - Creation of branded voices and custom avatars using new HD voice models
00:36:10 - Addressing audio interruptions and ensuring clear patient voice recognition
00:44:34 - Future expansion and innovation plans including real-time translation capabilities

Taught by

Microsoft Ignite

Reviews

Start your review of Let Your Agentic Apps Talk with Azure Speech - BRK198

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.