Hugging Face Course - Fast Tokenizers and Token Classification Pipelines - Chapter 6
HuggingFace via YouTube
Learn Backend Development Part-Time, Online
Google Data Analytics, IBM AI & Meta Marketing — All in One Subscription
Overview
Syllabus
Why are fast tokenizers called fast?
Fast tokenizer superpowers
Inside the Token classification pipeline (PyTorch)
Inside the Token classification pipeline (TensorFlow)
Inside the Question answering pipeline (PyTorch)
Inside the Question answering pipeline (TensorFlow)
Training a new tokenizer
What is normalization?
What is pre-tokenization?
Byte Pair Encoding Tokenization
WordPiece Tokenization
Unigram Tokenization
Building a new tokenizer
Taught by
Hugging Face