Future-Proof Your Career: AI Manager Masterclass
PowerBI Data Analyst - Create visualizations and dashboards from scratch
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Learn to evaluate and compare four different local Large Language Models using MLflow and Ollama for financial news sentiment analysis tasks. Set up a comprehensive evaluation framework using a custom financial news dataset to determine which model performs best for your specific use case. Begin by exploring the dataset and configuring your notebook environment, then design effective LLM prompts and implement structured output formatting using Pydantic for consistent model responses. Develop evaluation metrics and create an automated evaluation loop to systematically test each model's performance on sentiment classification tasks. Conclude by reviewing and analyzing the experimental results through MLflow's interface to identify the optimal model for financial sentiment analysis applications.
Syllabus
00:00 - Welcome
01:12 - Notebook setup and dataset review
04:16 - LLM prompt and structured output with Pydantic
07:50 - Evaluation metrics and loop
13:47 - Review experiments in MLflow
Taught by
Venelin Valkov