Google AI Professional Certificate - Learn AI Skills That Get You Hired
Get 20% off all career paths from fullstack to AI
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore machine learning techniques for handling non-curated data in this 43-minute EuroPython Conference talk. Delve into practical solutions for two common dirty-data problems: missing values and non-normalized entries. Learn how to implement standard machine learning tools like scikit-learn when dealing with these data errors. Discover the importance of imputation and adding missingness indicators for handling missing values, and understand how to create vectorial representations for non-normalized categories. Gain insights from theoretical analyses and recent machine learning publications to improve your data science workflow and efficiency when working with imperfect datasets.
Syllabus
Gael Varoquaux - Machine learning on non curated data
Taught by
EuroPython Conference