Statistics-Powered ML - Reliable Black-Box Inference from Untrusted Data
HUJI Machine Learning Club via YouTube
Power BI Fundamentals - Create visualizations and dashboards from scratch
Learn AI, Data Science & Business — Earn Certificates That Get You Hired
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Explore how fundamental statistical principles can enhance machine learning systems to provide reliable black-box inference from untrusted data in this conference talk. Learn about two critical challenges in high-stakes AI applications: data scarcity and test-time distribution shift, and discover how these issues can lead to misleading conclusions and unexpected failures. Examine a framework that safely enhances sample efficiency of statistical inference procedures like conformal prediction and hypothesis testing by adaptively leveraging synthetic data from generative models, with distribution-free error control guarantees requiring no assumptions about synthetic data quality. See practical applications across diverse domains including protein structure prediction and win-rate evaluation of large reasoning models. Understand a new approach to test-time training grounded in sequential statistical testing, featuring conformal betting martingales for principled data drift detection and an anti-drift correction mechanism based on optimal transport principles. Discover how this mechanism forms the foundation of a self-training scheme that promotes invariance to dynamically changing environments, combining theoretical rigor with practical utility for trustworthy AI systems.
Syllabus
Thursday, November 6th, 2025, 10:30 AM, room C221
Taught by
HUJI Machine Learning Club