Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore the powerful capabilities of HuggingFace Datasets for building efficient NLP pipelines in this 34-minute tutorial. Learn how to import, load, select, and write datasets from the extensive repository of over 1,400 high-quality language-focused datasets. Discover essential functions for data processing, including modifying dataset features, troubleshooting, batching, tokenization, and filtering. Gain practical insights into leveraging this essential tool for NLP practitioners to streamline your natural language processing workflows and enhance your projects.
Syllabus
Intro
Importing Datasets
Loading Datasets
Selecting Datasets
Writing Datasets
Dataset Features
Dataset Example
Modifying Dataset Features
Troubleshooting
Batching
Tokenization
Filtering
Taught by
James Briggs