Get 20% off all career paths from fullstack to AI
NY State-Licensed Certificates in Design, Coding & AI — Online
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Learn to evaluate and compare four different local Large Language Models using MLflow and Ollama for financial news sentiment analysis tasks. Set up a comprehensive evaluation framework using a custom financial news dataset to determine which model performs best for your specific use case. Begin by exploring the dataset and configuring your notebook environment, then design effective LLM prompts and implement structured output formatting using Pydantic for consistent model responses. Develop evaluation metrics and create an automated evaluation loop to systematically test each model's performance on sentiment classification tasks. Conclude by reviewing and analyzing the experimental results through MLflow's interface to identify the optimal model for financial sentiment analysis applications.
Syllabus
00:00 - Welcome
01:12 - Notebook setup and dataset review
04:16 - LLM prompt and structured output with Pydantic
07:50 - Evaluation metrics and loop
13:47 - Review experiments in MLflow
Taught by
Venelin Valkov