Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Unicode Normalization for NLP in Python

James Briggs via YouTube

Overview

Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore Unicode normalization techniques for Natural Language Processing in Python in this 15-minute video. Learn how to handle annoying font variants, social media text, and diacritics from European languages that can trip up NLP models. Discover the hidden properties of characters like 'Ç' and their impact on text processing. Master the art of dealing with text variants using Unicode normalization to improve the readability and consistency of your input data for more effective NLP applications.

Syllabus

Intro
Diacritics
Decomposition
Conversion
Normal Form

Taught by

James Briggs

Reviews

Start your review of Unicode Normalization for NLP in Python

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.