Free Online

Multimodal AI Courses and Certifications

Build AI systems that process text, images, audio, and video simultaneously using GPT-4, Llama, and cutting-edge neural architectures. Learn through hands-on tutorials on YouTube and Coursera, creating applications that combine vision, language, and speech for real-world multimodal solutions.

272 courses
Showing 272 courses
Filter by
Filters
  1. Level
  2. Duration
  3. Language
    • Coursera
    • 4 weeks, 10 hours a week
    • On-Demand
    • Paid Course
    • 3 courses
    • Udemy
    • 9 hours 24 minutes
    • On-Demand
    • Paid Course
    • YouTube
    • 1 hour 6 minutes
    • On-Demand
    • Free Video
    • Udemy
    • 20 hours 8 minutes
    • On-Demand
    • Paid Course
    • Udemy
    • 8 hours 7 minutes
    • On-Demand
    • Paid Course
    • Coursera
    • 13 hours 9 minutes
    • On-Demand
    • Paid Course
    • Coursera
    • 2 hours 57 minutes
    • On-Demand
    • Paid Course
    • Coursera
    • 7 hours 43 minutes
    • On-Demand
    • Paid Course
    • Coursera
    • 1 day 3 hours 16 minutes
    • On-Demand
    • Paid Course

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.