Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Synthetic Data Generation via SDG-Hub - Random Samples

Neural Magic via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Join this 55-minute AI seminar from Neural Magic's "Random Samples" series where experts introduce SDG Hub, an open-source toolkit developed at Red Hat for customizing language models through synthetic data generation. Learn what synthetic data means for LLMs and explore SDG Hub's architecture including prompts, blocks, and flows. Discover strategies for selecting appropriate teacher models based on specific use cases like reasoning or translation. Follow along with two practical demonstrations: building a document-grounded skill using pre-built pipelines and customizing a reasoning model by creating new components. The session concludes with a demonstration of SDG Hub's new graphical interface that enables non-experts to visually construct synthetic data pipelines. Access the original InstructLab Paper at https://arxiv.org/abs/2403.01081 and explore the code repository at https://github.com/Red-Hat-AI-Innovation-Team/sdg_hub.

Syllabus

Random Samples: Synthetic Data Generation via SDG-Hub [May 2, 2025]

Taught by

Neural Magic

Reviews

Start your review of Synthetic Data Generation via SDG-Hub - Random Samples

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.