Why Pay Per Course When You Can Get All of Coursera for 40% Off?
10,000+ courses, Google, IBM & Meta certificates, one annual plan at 40% off. Upgrade now.
Get Full Access
Learn to identify and handle outliers in data science through a comprehensive lecture covering multiple detection and mitigation strategies. Explore the fundamental three-step approach of fitting models, calculating residuals, and pruning large residuals, while understanding when this method fails. Master robust statistical techniques including robust mean estimation, eigenvector pruning, geometric median, Tukey median, and median-of-means approaches. Discover advanced outlier detection methods such as DB-Scan clustering for outlier identification and reverse nearest neighbor algorithms to effectively manage anomalous data points in your datasets.