Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore the powerful capabilities of HuggingFace Datasets for building efficient NLP pipelines in this 34-minute tutorial. Learn how to import, load, select, and write datasets from the extensive repository of over 1,400 high-quality language-focused datasets. Discover essential functions for data processing, including modifying dataset features, troubleshooting, batching, tokenization, and filtering. Gain practical insights into leveraging this essential tool for NLP practitioners to streamline your natural language processing workflows and enhance your projects.
Syllabus
Intro
Importing Datasets
Loading Datasets
Selecting Datasets
Writing Datasets
Dataset Features
Dataset Example
Modifying Dataset Features
Troubleshooting
Batching
Tokenization
Filtering
Taught by
James Briggs