Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Test-Time Compute Explained - Benchmarking and Optimizing AI Agents

Nvidia via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn how to benchmark and optimize AI agents using test-time compute scaling techniques through NVIDIA's NeMo Agent Toolkit in this comprehensive tutorial. Discover how agents can enhance performance by sampling tools, testing multiple trajectories, and re-running functions to improve workflow outcomes. Explore the open-source NeMo Agent Toolkit, a framework-agnostic library that provides profiling, evaluation, and optimization capabilities for connected AI agent systems. Master the step-by-step process of implementing the test-time compute module, including searching, editing, scoring, and selection strategies. Understand the fundamentals of test-time compute and get an overview of the test-time compute module's architecture and capabilities. Learn to write custom scoring, searching, editing, and selection strategies tailored to your specific use cases. Discover how to make your custom strategies discoverable within the NeMo Agent Toolkit ecosystem for seamless integration. Follow along with a practical RAG (Retrieval-Augmented Generation) example that demonstrates real-world application of these concepts. Explore the prebuilt test-time compute functions available in the toolkit to accelerate your development process. Gain hands-on experience with configuring existing LLMs and tools to leverage these advanced optimization techniques for improved AI agent performance.

Syllabus

00:00 – Test Time Compute Explained
02:12 – Overview of Test Time Compute Module
3:18 – How to Write a Custom Scoring, Searching, Editing or Selection Strategy
6:13 – Make Your Strategy Discoverable in NeMo Agent Toolkit
7:43 – RAG Example
10:00 - Prebuilt TTC Functions

Taught by

NVIDIA Developer

Reviews

Start your review of Test-Time Compute Explained - Benchmarking and Optimizing AI Agents

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.