Earn Your CS Degree, Tuition-Free, 100% Online!
Lead AI-Native Products with Microsoft's Agentic AI Program
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Join this 55-minute AI seminar from Neural Magic's "Random Samples" series where experts introduce SDG Hub, an open-source toolkit developed at Red Hat for customizing language models through synthetic data generation. Learn what synthetic data means for LLMs and explore SDG Hub's architecture including prompts, blocks, and flows. Discover strategies for selecting appropriate teacher models based on specific use cases like reasoning or translation. Follow along with two practical demonstrations: building a document-grounded skill using pre-built pipelines and customizing a reasoning model by creating new components. The session concludes with a demonstration of SDG Hub's new graphical interface that enables non-experts to visually construct synthetic data pipelines. Access the original InstructLab Paper at https://arxiv.org/abs/2403.01081 and explore the code repository at https://github.com/Red-Hat-AI-Innovation-Team/sdg_hub.
Syllabus
Random Samples: Synthetic Data Generation via SDG-Hub [May 2, 2025]
Taught by
Neural Magic