Apache Spark on Kubernetes - Lessons Learned from Launching Millions of Spark Executors
Databricks via YouTube
AI, Data Science & Business Certificates from Google, IBM & Microsoft
Power BI Fundamentals - Create visualizations and dashboards from scratch
Overview
Syllabus
Intro
Data Platform
Elastic Self Service Spark
Code to Deployment
Security
Monitoring
Orchestration Architecture
Varying Workload Pattern
One Interface over Multi-Cloud
Optimize Kubernetes for Spark Workload
Granular Concurrency Check at Orchestration
Avoid Partially Running Applications
Timeout Partially Running Applications
Mitigate Cluster Storage Stress
Utilization-based Allocation Recommendation
Dynamic Allocation
Push-button Cloud Management
Scale up Spark on Kubernetes
Taught by
Databricks