Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Google Cloud

Deploy and Scale AI Models with Cloud Run

Google Cloud via Coursera

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
AI inference is the process of using a trained machine learning model to make predictions on new, unseen data by applying learned patterns. This course is designed for developers, data scientists, and ML engineers interested in quickly deploying AI inference services on Cloud Run. It is useful for those familiar with cloud-based serverless application deployment solutions, but who may not have experience with running AI inference using Google Cloud serverless products. The course includes examples that deploys a model for AI inference with GPUs and integrates gen AI apps with data storage services.

Syllabus

  • A brief introduction to Cloud Run
    • An introduction to Cloud Run and its capabilities.
  • AI inference on Cloud Run
    • Deploy gen AI apps to Cloud Run for AI inference using machine learning models.

Taught by

Google Cloud Training

Reviews

Start your review of Deploy and Scale AI Models with Cloud Run

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.