Courses from 1000+ universities
$7.2 billion in combined revenue since 2020. $8 billion in lost market value. This merger marks the end of an era in online education.
600 Free Google Certifications
Marketing
Cybersecurity
Machine Learning
Circuits and Electronics 1: Basic Circuit Analysis
Academic Writing Made Easy
Nutrition, Exercise and Sports
Organize and share your learning with Class Central Lists.
View our Lists Showcase
Optimize exabyte-scale data lakes with open-source caching framework. Improve performance, reduce costs, and tackle challenges in complex data environments using Hadoop, Parquet, Hudi, and Alluxio.
Explore challenges and design of snapshot feature for object storage systems, focusing on Apache Ozone. Learn about benefits over object versioning and implementation details.
Explore four key technical value drivers of data lakehouses and how Apache Iceberg enhances these capabilities for data practitioners across various roles.
Explore Ozone's self-healing capabilities for managing massive data storage, from single bit flips to complete node failures, ensuring data integrity at every stage.
Explore key metrics for monitoring Kafka at scale, focusing on lag performance. Learn to interpret indicators, create dashboards, and set up proactive alerts for effective troubleshooting.
Explore Apache NiFi's latest features, processors, and best practices. Build efficient data flows using cutting-edge techniques, with tips and guides for optimal implementation.
Harness Generative AI Functions in Apache Kafka and Pulsar for real-time AI applications. Explore Vector Search, Computing Embeddings, and Chat Completion use cases with a live demonstration.
Explore efficient data streaming into medallion architecture using Apache Hudi. Learn about record-level indexing, fast upserts, and database-style change data capture for optimized incremental processing.
Optimize travel using real-time transit data from NYC's MTA. Learn to process and analyze public data feeds with Apache tools for efficient route planning and decision-making.
Learn to deploy and scale machine learning models efficiently using Apache Beam for distributed inference on CPUs and GPUs, with practical insights on parallelizing workloads.
Unpack critiques of Apache Airflow, analyze its strengths and weaknesses, debunk myths, and compare with competitors. Make informed decisions about workflow management platforms.
Explore unstructured data processing using ML, LLMs, and Apache tools. Learn to handle audio, images, and text with vector embeddings for natural language analysis in data engineering pipelines.
Explore the creation of an exabyte-scale Data Lakehouse using Apache Ozone. Learn about integration efforts, scalability benefits, and optimizations for high-performance queries and reduced costs.
Explore Apache Iceberg's REST catalog for unified, secure data access. Learn its benefits, migration process, and extended functionality beyond Spark and Flink. Gain insights for implementing REST catalog in your data infrastructure.
Explore semantic layers for efficient data management. Learn to define business views, query measures, and optimize performance using Apache Calcite's advanced features.
Get personalized course recommendations, track subjects and courses with reminders, and more.