Courses from 1000+ universities
$7.2 billion in combined revenue since 2020. $8 billion in lost market value. This merger marks the end of an era in online education.
600 Free Google Certifications
Computer Science
Psychology
Microsoft Excel
Lean Production
Viruses & How to Beat Them: Cells, Immunity, Vaccines
Learn Like a Pro: Science-Based Tools to Become Better at Anything
Organize and share your learning with Class Central Lists.
View our Lists Showcase
Explore large-scale geospatial indexing and analysis using Apache Spark, covering challenges, frameworks, data structures, and practical applications in cloud-first architecture with Databricks.
Explore an open-source platform for scalable model serving and monitoring using KFServing, Hopsworks Feature Store, and Spark Streaming. Learn continuous data drift detection and see a live demonstration.
Learn to perform funnel analysis at scale using Apache Spark, Druid, and DataSketches. Discover techniques for measuring campaign effectiveness and analyzing user behavior in chronological order for large-scale advertising campaigns.
Introduction to Dagster: A data orchestrator for the entire application lifecycle. Covers principles, development, deployment, and monitoring. Includes demo and code snippets showcasing Dagit UI and Dagster programming model.
Learn to efficiently distribute hyperparameter tuning workloads using Apache Spark, avoiding common pitfalls and optimizing performance. Explore best practices with Hyperopt and joblib-spark for scaling ML model training.
Explore Zillow's centralized platform for data quality, enabling stakeholders to define expectations, perform validations, and monitor data health across complex organizations.
Learn how small files impact data pipeline performance, understand cloud storage and Apache Spark interactions, and discover DeltaLake solutions for improved efficiency in handling massive datasets.
Explore how Databricks leverages Amundsen for efficient data discovery, improving productivity by surfacing relevant datasets, SQL dashboards, and metadata programmatically.
Explore automated testing for chatbots and voice assistants, covering key challenges, best practices, and hands-on experience with Botium to ensure AI-powered conversational interfaces perform as intended.
Explore enterprise-level security practices for Azure Databricks, covering RBAC, network isolation, and Azure-specific features to deploy and manage secure analytics and AI environments.
Discover a CI/CD-driven approach to automated metadata management in data lakes, focusing on balancing development speed, governance, and schema evolution for rapidly growing organizations.
Explore Data Mesh architecture, its principles, benefits, and implementation challenges. Learn about open-source tools like Apache Spark for building effective Data Mesh systems in real-world scenarios.
Master essential SQL skills for data science: statistics, data prep, advanced filtering, window functions, and analytics tool integration.
Discover how Samsung SmartSSD® with Xilinx FPGAs accelerates Spark, offering 2x-8x performance gains without code changes. Learn about the technology, benchmarks, and TCO savings.
Explore Photon, a C++ component for efficient query execution in Databricks Lakehouse. Learn about its vectorized processing, compatibility with Apache Spark, and performance enhancements for data-driven decision-making.
Get personalized course recommendations, track subjects and courses with reminders, and more.