Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

We Tried to Jailbreak Our AI - and Model Armor Stopped It

Google Cloud Tech via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Watch a 10-minute technical demonstration where Google Cloud experts Martin Omander and Aron Eidelman showcase Google's Model Armor security solution for AI applications. Learn about the OWASP Top 10 AI security risks and see live demonstrations of common attack vectors including jailbreaking attempts, sensitive data extraction, and malicious URL injection. Discover how Model Armor acts as a protective layer that intercepts and blocks these threats before they reach your AI model through simple API calls. Examine actual code implementations for sanitizing user prompts, filtering model responses, and redacting sensitive information like Social Security Numbers using Data Loss Prevention (DLP) techniques. Explore the technical architecture behind Model Armor's input and output validation systems, and understand why dedicated security solutions are more effective than relying solely on built-in model guardrails or using additional LLMs for protection. Get insights into configuring security policies for different applications, pricing considerations, and practical implementation strategies for securing AI applications in production environments.

Syllabus

00:00 - Why AI apps need a "bodyguard"
00:57 - What are the top AI security risks? OWASP Top 10
01:46 - [Demo] Trying to jailbreak our AI app
02:25 - [Demo] Stopping sensitive data SSN leaks
03:23 - [Demo] Redacting data instead of blocking DLP
04:06 - [Demo] Blocking malicious URLs
04:50 - How it works: A simple API call
05:11 - Code: Sanitizing user prompts Input check
05:21 - Code: Sanitizing model responses Output check
06:19 - Code: Redact sensitive data
08:11 - Q&A: Don't models already have guardrails?
07:23 - Q&A: Why not use another LLM to protect my LLM?
07:58 - Q&A: Configuring policies for different apps
08:50 - Q&A: How much does Model Armor cost?
09:10 - Final thoughts

Taught by

Google Cloud Tech

Reviews

Start your review of We Tried to Jailbreak Our AI - and Model Armor Stopped It

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.