Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

From Pixels to Words: Exploring Multi-Modal AI with Python

Data Science Conference via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore the transformative potential of Multi-Modal Learning with Python in this comprehensive conference talk from DSC EUROPE 24. Nandana Sreeraj demonstrates how combining visual, audio, and textual data creates more powerful AI systems through real-world examples like self-driving cars that simultaneously process road visuals, honking sounds, and traffic sign text. Learn practical Python implementations for integrating diverse data streams in AI applications, with examples accessible to both experienced data scientists and newcomers to artificial intelligence. The presentation provides an interactive journey through multi-modal learning techniques, offering inspiration and practical knowledge for enhancing AI projects by leveraging multiple data types together. This technical session was presented at the Data Science Conference in Belgrade on November 18th, 2024.

Syllabus

From Pixels to Words: Exploring Multi-Modal AI with Python | Nandana Sreeraj | DSC EUROPE 24

Taught by

Data Science Conference

Reviews

Start your review of From Pixels to Words: Exploring Multi-Modal AI with Python

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.