Power BI Fundamentals - Create visualizations and dashboards from scratch
The Fastest Way to Become a Backend Developer Online
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
This talk by Jacob Hilton from the Alignment Research Center explores how to establish probabilistic safety guarantees for large language models by examining their internal mechanisms. Learn about innovative approaches to ensuring AI safety through model internals analysis, as presented at the Simons Institute's Safety-Guaranteed LLMs event. The 46-minute presentation delves into technical methods for creating more reliable safety assurances in advanced AI systems.
Syllabus
Probabilistic Safety Guarantees Using Model Internals
Taught by
Simons Institute