Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Hugging Face Course - Loading and Managing Custom Datasets - Chapter 5

HuggingFace via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn to work with custom datasets in the Hugging Face ecosystem through this comprehensive tutorial covering dataset loading, manipulation, and management techniques. Master the process of loading custom datasets from various sources and formats, then discover how to slice and dice your data efficiently for specific use cases. Explore the seamless integration between Hugging Face Datasets and popular DataFrame libraries like Pandas to leverage familiar data manipulation workflows. Understand memory mapping and streaming capabilities that enable you to work with large datasets without overwhelming your system resources. Gain practical skills in saving and reloading datasets for reproducible workflows and learn how to upload your processed datasets to the Hugging Face Hub for sharing with the community. Conclude by implementing text embeddings and semantic search functionality to unlock advanced natural language processing capabilities with your datasets.

Syllabus

Loading a custom dataset
Slice and dice a dataset
Datasets + DataFrames = ❤️
Saving and reloading a dataset
Memory mapping & streaming
Uploading a dataset to the Hub
Text embeddings & semantic search

Taught by

Hugging Face

Reviews

Start your review of Hugging Face Course - Loading and Managing Custom Datasets - Chapter 5

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.