PowerBI Data Analyst - Create visualizations and dashboards from scratch
Save 43% on 1 Year of Coursera Plus
Overview
Syllabus
01:08 - Download Dataset
01:43 - Solving Big Data Problems with GPU Processing
02:46 - Google Colab Setup with Free T4 GPU
03:02 - Local Setup with NVIDIA GPU
03:43 - RAPIDS Installation Guide
05:07 - Solving Jupyter Kernel Crash with cuDF Pandas
05:29 - Handling Missing Values
05:53 - Detect Missing Values
06:29 - Replace with Zero
07:31 - Replace with Mean
08:57 - Investigate Columns with Ambiguous Names
11:21 - Drop Columns If No Other Option
12:01 - Split Data For Training & Testing
12:07 - Shuffle Data
13:39 - Features & Targets Split
14:02 - Train & Test Split
16:20 - Load XGBoost Model on GPU
17:55 - Train XGBoost Model
18:08 - Test XGBoost Model and Get Predictions
18:45 - Solve ValueError : DataFrame.dtypes must be int float bool or category
20:15 - Evaluate Trained Model
22:39 - Data Optimization & Anomalies
22:41 - Detect Data Anomalies with Aggregation
23:47 - Solve XGBoostError : No GPU Memory Left with RMM
25:04 - Handle Negative Charges and Unrealistic Distances
28:19 - Detect and Handle Unrealistic Transactions
30:28 - Second Train Run on Optimized Data
31:45 - Best Practices
31:45 - Plot Training Results & Feature Importance
32:17 - Hyperparameter Tuning
32:49 - Date Extraction : From String to Int or Category
33:05 - K-Fold Validation
33:45 - Thanks for Watching!
Taught by
Python Simplified