Class Central

Rankings
Career Certificates

Subjects

View all

Technology
Business
Creative
STEM
Health & Wellness
People & Society
Personal Growth & Lifestyle

View all Subjects

Universities
The Report

Courses from 1000+ universities

Rankings

Best Courses

Best of All Time
Best of the Year

Most Popular Courses

Most Popular of All Time
Most Popular of the Year

Career Certificates

Generative AI
UX Design
Data Science
Finance
DevOps
Project Management
View all Career Certificates

Technology

Computer Science
Artificial Intelligence
Data Science
Web Development
Programming
Databases
IT & Networking
DevOps
UX/UI Design
Blockchain & Cryptocurrency
Extended Reality (XR)
Generative AI
Cloud Computing
Cybersecurity
Game Development
Product Management
View all Technology

Business

Management & Leadership
Finance
Entrepreneurship
Marketing
Strategic Management
Project Management
Sales
Business Software
Real Estate
Professional Development
Accounting
Supply Chain Management
Business Strategy
Human Resources
View all Business

Creative

Music
Digital Media
Visual Arts
Design
Crafts
Performing Arts
AI Art
View all Creative

STEM

Mathematics
Engineering
Science
Statistics & Probability
Chemistry
Physics
Environmental Science
Astronomy
Biology
Agriculture
View all STEM

Health & Wellness

Disease & Disorders
Public Health
Health Care
Nursing
Mental Health
Continuing Medical Education (CME)
Medicine
Wellness
Nutrition
View all Health & Wellness

People & Society

Humanities
Education & Teaching
Social Sciences
History
Literature
Sociology
Economics
Psychology
Anthropology
Political Science
Law
Language Learning
Writing
Philosophy
Religion
View all People & Society

Personal Growth & Lifestyle

Personal Development
Sports & Recreation
Personal Finance
Parenting & Family
Food & Drink
Self-Defense & Martial Arts
Gardening
Productivity & Time Management
Games
Study Skills
Travel
Pets & Pet Care
Beauty & Makeup
Home & Living
Critical Thinking
View all Personal Growth & Lifestyle

The Report

Banning Telegram Can’t Stop Paper Leaks: Inside India’s NEET Exam Scandal

India banned Telegram after the NEET paper leak led to a retest for 2.28 million students. Class Central studied the scam, the money trail, and other platforms the leaks could move to.

Manoj Sharma Jul 29, 2026

Latest

5 Best AWS Security Courses for 2026: IAM to SCS-C03
10 Best Embroidery Courses for 2026: Begin With Basics, Expand Your Skills
Top 9 AI Governance Courses: Safe, Ethical, and Legal AI Deployment
Coursera Lays Off ~150 Staff After Udemy Merger
9 Best React Native Courses for 2026: Learn the Expo Workflow

Write for The Report

Visit The Report

600 Free Google Certifications
Trending

Most common

Popular subjects

Artificial Intelligence
34,635 courses
Language Learning
3,555 courses
Data Analysis
13,181 courses

Popular courses

Mathematical and Computational Methods
Georgetown University
AP® Microeconomics
Massachusetts Institute of Technology
Competitive Strategy
Ludwig-Maximilians-Universität München

Organize and share your learning with Class Central Lists.

View our Lists Showcase

0 Reviews

Start learning

Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Technology
Artificial Intelligence
Machine Learning

Technology
Artificial Intelligence
Natural Language Processing (NLP)

Technology
Artificial Intelligence
Machine Learning
Model Evaluation
Model Selection

Technology
Artificial Intelligence
Machine Learning
Fine-Tuning

Technology
Data Science
Data Analysis

Technology
Artificial Intelligence
Machine Learning

Deconstructing Text Embedding Models - Understanding Tokenizers and Model Selection

EuroPython Conference via YouTube

Write review

Start learning Write review

Details

Start learning

Provider

YouTube
Pricing

Free Video
Languages

English
Effort

44 minutes
Sessions

Self-Paced
Level

Advanced

Found in

Machine Learning Courses
Natural Language Processing (NLP) Courses
Model Selection Courses
Fine-Tuning Courses
Data Analysis Courses

Explore the intricacies of text embedding models in this 44-minute EuroPython Conference talk. Delve into the critical role of tokenizers in model selection, moving beyond reliance on benchmarks like the Massive Text Embedding Benchmark (MTEB). Learn to assess model suitability for specific datasets based on tokenizer performance, and discover strategies for optimizing tokenizers during the fine-tuning process of embedding models. Gain insights into making informed decisions when choosing text embedding models for unique data characteristics.

Syllabus

Deconstructing the text embedding models — Kacper Łukawski

Taught by

EuroPython Conference

Related Courses

Ad

AI, Data Science & Cloud Certificates from Google, IBM & Meta

Learn More →
Fine-tuning Tiny LLM for Sentiment Analysis - TinyLlama and LoRA on a Single GPU
Fine-tuning Multi-modal Video and Text Models
Learn Hugging Face by Building a Custom AI Model
Pretraining LLMs
Generative AI Text and Multimodal Embedding Models for Real-World Use Cases