The Most Addictive Python and SQL Courses
Power BI Fundamentals - Create visualizations and dashboards from scratch
Overview
Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Learn how to build and deploy a generative AI system for creating health insurance appeals from denials in this conference talk from GOTO Chicago 2024. Explore the complete journey from initial model fine-tuning attempts through production deployment, covering the technical challenges and solutions encountered when developing a real-world machine learning application. Discover the iterative process of model development, including what approaches worked and failed during fine-tuning experiments, and examine the middleware architecture that integrates both in-house and external models. Understand the critical privacy protection measures implemented in the system and see the frontend implementation challenges. Follow along with a live demonstration of the appeal generation system in action, and gain insights into optimization strategies for future model iterations. The presentation addresses the practical aspects of scaling an ML system from concept to production, including model serving infrastructure, configuration management, and deployment considerations. Whether interested in the healthcare application domain or the broader technical challenges of productionizing generative AI systems, gain valuable insights into the end-to-end development process of a mission-critical AI application designed to help individuals navigate insurance claim denials.
Syllabus
00:00 Intro
00:25 Content warning
03:38 What's the problem?
06:13 How to use computers to fix the problem?
07:13 What's needed to tune a LLM?
10:36 Putting it together
11:38 Downsides
13:04 What models did we make?
14:21 Model fine tune config
15:40 Model serving
18:38 Frontend?
19:46 Demo
25:18 How to do better for the next model?
27:55 Links
28:40 Outro
Taught by
GOTO Conferences