Advanced Dimensionality Reduction - t-SNE vs UMAP vs PCA Deep Dive
DigitalSreeni via YouTube
-
19
-
- Write review
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore advanced dimensionality reduction techniques in this comprehensive tutorial that moves beyond basic PCA to tackle complex data visualization challenges. Master the theory and implementation of t-SNE and UMAP for revealing hidden patterns in high-dimensional datasets through non-linear methods that outperform traditional linear approaches. Learn the fundamental concepts of probability distributions and neighbor embedding in t-SNE, including critical perplexity parameter tuning within the 5-50 range and identifying optimal scenarios for cluster separation. Dive deep into UMAP's topological data analysis foundations, understanding how n_neighbors and min_dist parameters affect results and why UMAP uniquely preserves both local and global data structure. Apply these techniques to a practical breast cancer classification case study using 569 tumor samples with 30 features, with methods applicable to diverse domains including customer segmentation, genomics, image analysis, text mining, and social networks. Gain hands-on experience with complete Python implementations featuring parameter tuning with visual comparisons, clustering performance evaluation using silhouette scores, side-by-side method comparisons, runtime and scalability analysis, and creation of publication-quality visualizations.
Syllabus
371 - Advanced Dimensionality Reduction: t-SNE vs UMAP vs PCA Deep Dive
Taught by
DigitalSreeni