AI, Data Science & Business Certificates from Google, IBM & Microsoft
Lead AI-Native Products with Microsoft's Agentic AI Program
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off your first 3 months — limited time.
Unlock All Certificates
Explore how SQL can enhance data organization for machine learning in this 11-minute video presentation by Columbia PhD student Zachary Huang. Learn about JoinBoost, a lightweight Python library that transforms tree training algorithms over normalized databases into pure SQL queries. Discover how this innovative approach addresses the mismatch between ML data organization requirements and traditional database structures, offering a simplified, all-in-one data stack solution. Gain insights into JoinBoost's compatibility with various DBMS and data stacks, its exceptional performance and scalability, and how it outperforms specialized ML libraries like LightGBM in terms of speed and scalability for random forests and gradient boosting algorithms.
Syllabus
Introduction
Background
Example
Problem Statement
Taught by
Snorkel AI