Hugging Face Course - Using Transformers - Chapter 2

Learn the inner workings of Hugging Face's pipeline function and master the fundamentals of transformer models and tokenization in this comprehensive tutorial covering both PyTorch and TensorFlow implementations. Explore what happens behind the scenes when using pipeline functions, discover how to instantiate transformer models from scratch, and gain a thorough understanding of different tokenization approaches including word-based, character-based, and subword-based methods. Master the tokenization pipeline process and learn essential techniques for batching inputs together effectively in both PyTorch and TensorFlow frameworks, providing you with the foundational knowledge needed to work with transformer models and natural language processing tasks.

Syllabus

What happens inside the pipeline function? (PyTorch)
What happens inside the pipeline function? (TensorFlow)
Instantiate a Transformers model (PyTorch)
Instantiate a Transformers model (TensorFlow)
Tokenizers Overview
Word-based tokenizers
Character-based tokenizers
Subword-based tokenizers
The tokenization pipeline
Batching inputs together (PyTorch)
Batching inputs together (TensorFlow)