Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Google

Cloud Run for AI Inference

Google via Google Skills

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
AI inference is the process of using a trained machine learning model to make predictions on new, unseen data by applying learned patterns. This course is designed for developers, data scientists, and ML engineers interested in quickly deploying AI inference services on Cloud Run. It is useful for those familiar with cloud-based serverless application deployment solutions, but who may not have experience with running AI inference using Google Cloud serverless products. The course includes examples that deploys a model for AI inference with GPUs and integrates gen AI apps with data storage services.

Syllabus

  • A brief introduction to Cloud Run
    • What's in it for me?
    • What is Cloud Run?
    • Quiz
  • AI inference on Cloud Run
    • Use GPUs for AI inference on Cloud Run
    • Deploy lightweight language models on Cloud Run
    • Optimize performance and cost-efficiency
    • Integrate AI inference services
    • Quiz
    • What did I walk away with?
  • Course resources
    • LoRA adapters
    • Course PDF
  • Your Next Steps
    • Course Badge

Reviews

Start your review of Cloud Run for AI Inference

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.