Courses from 1000+ universities
$7.2 billion in combined revenue since 2020. $8 billion in lost market value. This merger marks the end of an era in online education.
600 Free Google Certifications
Marketing
Cybersecurity
Machine Learning
Circuits and Electronics 1: Basic Circuit Analysis
Academic Writing Made Easy
Nutrition, Exercise and Sports
Organize and share your learning with Class Central Lists.
View our Lists Showcase
Explore best practices for optimizing Apache Spark jobs, including resource allocation, parameter tuning, and performance enhancement techniques. Learn to identify bottlenecks and implement solutions for improved efficiency.
Explore real-time forecasting at scale using Delta Lake and Delta Caching. Learn efficient data sampling, storage, and caching techniques for handling massive datasets and achieving rapid forecast response times.
Explore building a scalable, fault-tolerant streaming microservice architecture using Apache Spark, Kafka, gRPC, and Delta Lake for real-time data processing and reliable insights.
Explore productionizing machine learning using Apache Spark, MLflow, and ONNX with SQL Server. Learn model management, lifecycle orchestration, and leveraging SQL Server for ML model storage and deployment.
Explore Mars Petcare's data platform using Delta Lake and Spark ETL pipeline 'Kyte'. Learn about advantages over Azure Data Factory and leveraging Delta Lake for ETL configurations and data science.
Aprenda a aplicar Deep Reinforcement Learning em escala usando Spark e MLflow para personalização de jogos e aumento de engajamento, com dicas práticas para superar desafios de produção.
Explore Uber's Zeus: a highly scalable, distributed shuffle service powering data processing. Learn its architecture, integration with Spark, and performance advantages over traditional external shuffle methods.
Leveraging Apache Spark for predictive maintenance in aviation, enabling efficient data processing, anomaly detection, and component health scoring to improve fleet readiness and prevent failures.
Explore Spark's Catalyst Optimizer challenges, including UDF issues, codegen limitations, and JVM tuning. Learn to diagnose and solve complex query problems for improved Spark SQL performance.
Optimize data integration pipelines using Spark-Jobserver for faster execution, context reuse, and efficient resource management. Learn configuration, security, and API integration for improved performance.
Practical methods for balancing data encryption and analytics in Apache Spark, addressing CCPA and governance requirements while maintaining analytical value.
Explore Zalando's journey from centralized Data Lake to distributed Data Mesh architecture, focusing on data ownership, quality, and accessibility in large-scale e-commerce environments.
Learn to run Apache Spark jobs on Kubernetes, unifying analytics and data science on a cloud-native architecture. Discover how this approach simplifies infrastructure, enables on-demand deployment, and eliminates big data cluster overhead.
Explore LinkedIn's journey scaling Apache Spark, addressing infrastructure challenges, optimizing resource management, and enhancing user productivity through automated analysis and efficiency improvements.
Explore end-to-end machine learning with MLflow on Databricks: from data engineering to model deployment, using health data to predict life expectancy through Spark, hyperopt, and interactive dashboards.
Get personalized course recommendations, track subjects and courses with reminders, and more.