Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Google

AI Infrastructure: Deployment Types

Google via Google Skills

Overview

Build a Learning Habit
Download Class Central's free printable study calendar
Download for Free
This course provides a comprehensive guide to deploying, managing, and optimizing AI and high-performance computing (HPC) workloads on Google Cloud. Through a series of lessons and practical demonstrations, you’ll explore diverse deployment strategies, ranging from highly customizable environments using Google Compute Engine (GCE) to managed solutions like Google Kubernetes Engine (GKE). Specifically, you’ll learn how to create clusters and deploy GKE for inference.

Syllabus

  • Course overview
    • What's in it for me?
  • Cluster creation process
    • Process overview
    • Choosing a machine type
    • Choosing the consumption option
    • Choosing the deployment options
    • Choosing an orchestrator
    • Choosing an image
    • Quiz
  • Creating a cluster with Compute Engine
    • Choosing a GCE machine type
    • Choosing a GCE deployment option
    • Networking for GCE instances
    • Reference architecture
    • Quiz
  • Building with Google Kubernetes Engine (GKE)
    • Containerizing an AI application
    • Networking for GKE cluster deployments
    • Optimizing an AI workload
    • GPU sharing strategies
    • Quiz
  • Deploying with GKE for inference
    • Architecting for inference on GKE
    • GKE inference reference architecture
    • Optimizing inference with GKE Inference Gateway
    • Congratulations and next steps
    • Quiz
  • Course resources
    • Course resources
  • Your Next Steps
    • Claim credential

Reviews

Start your review of AI Infrastructure: Deployment Types

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.