Hugging Face Course - Fast Tokenizers and Token Classification Pipelines - Chapter 6
HuggingFace via YouTube
Lead AI Strategy with UCSB's Agentic AI Program — Microsoft Certified
Stuck in Tutorial Hell? Learn Backend Dev the Right Way
Overview
Syllabus
Why are fast tokenizers called fast?
Fast tokenizer superpowers
Inside the Token classification pipeline (PyTorch)
Inside the Token classification pipeline (TensorFlow)
Inside the Question answering pipeline (PyTorch)
Inside the Question answering pipeline (TensorFlow)
Training a new tokenizer
What is normalization?
What is pre-tokenization?
Byte Pair Encoding Tokenization
WordPiece Tokenization
Unigram Tokenization
Building a new tokenizer
Taught by
Hugging Face