Coursera Plus Annual is 40% Off — Ends June 29.

View

Class Central

Rankings
Career Certificates

Subjects

View all

Computer Science
Health & Medicine
Mathematics
Business
Humanities
Engineering
Science
Education & Teaching
Social Sciences
Art & Design
Artificial Intelligence
Data Science
Programming
Personal Development
Information Security (InfoSec)

View all Subjects

Universities
The Report

Courses from 1000+ universities

Rankings

Best Courses

Best of All Time
Best of the Year

Most Popular Courses

Most Popular of All Time
Most Popular of the Year

Career Certificates

Generative AI
UX Design
Data Science
Finance
DevOps
Project Management
View all Career Certificates

Computer Science

Algorithms and Data Structures
Internet of Things
Information Technology
Computer Networking
DevOps
Cryptography
Quantum Computing
Human-Computer Interaction (HCI)
Distributed Systems
Blockchain Development
Operating Systems
Computer Graphics
Automata Theory
Compilers
Mainframe
Digital Image Processing
Computational Models
Algorithm Design
Information Theory
Algorithms
View all Computer Science

Health & Medicine

Nutrition & Wellness
Disease & Disorders
Public Health
Health Care
Nursing
Anatomy
Veterinary Science
Continuing Medical Education (CME)
Blood Pressure
Wellbeing
Women's Health
Dysphagia
Occupational Therapy
Hygiene
Healthcare Innovation
Music Therapy
Holistic Health
Ayurveda
Herbalism
Massage Therapy
View all Health & Medicine

Mathematics

Statistics & Probability
Foundations of Mathematics
Calculus
Discrete Mathematics
Trigonometry
Geometry
Algebra
Precalculus
Number Theory
Combinatorics
Mathematical logic
Linear Programming
Graph Theory
Set Theory
Group Theory
Differential Equations
Polynomials
Integration
Mathematical Analysis
Mathematical Thinking
View all Mathematics

Business

Management & Leadership
Finance
Entrepreneurship
Marketing
Strategic Management
Industry Specific
Business Intelligence
Project Management
Sales
Design Thinking
Business Software
Customer Service
Nonprofit Management
Operations Management
Corporate Governance
Business Plan
Business Proposal
Product Design
Crowdsourcing
Business Planning
View all Business

Humanities

History
Literature
Language Learning
Grammar & Writing
Philosophy
Religion
ESL
Culture
Sports
Journalism
Linguistics
Food
Library Science
Reading
Crisis Management
Games
Emergency Management
Performing Arts
Religious Studies
Spirituality
View all Humanities

Engineering

Electrical Engineering
Mechanical Engineering
Civil Engineering
Nanotechnology
GIS
Textiles
Manufacturing
BIM
CAD
Chemical Engineering
Energy Systems
Aerospace Engineering
Finite Element Analysis
SCADA
GD&T (Geometric Dimensioning and Tolerancing)
Geotechnical Engineering
Reliability Engineering
Petroleum Engineering
Control Theory
Environmental Engineering
View all Engineering

Science

Chemistry
Physics
Environmental Science
Astronomy
Biology
Agriculture
Materials Science
Earth Science
Applied Science
Forensic Science
Meteorology
Horology
Paleontology
Fire Science
Sensory Science
Ergonomics
Physical Sciences
Complex Systems
Scientific Method
Medicine
View all Science

Education & Teaching

K12
Higher Education
STEM
Teacher Professional Development
Course Development
Online Education
Pedagogy
Social-emotional Learning (SEL)
Instructional Design
Homeschooling
Special Education
Adult Education
Course Design
Educational Technology
Curriculum Development
Student-Centered Learning
Student Engagement
Classroom Management
Early Childhood Education
Elementary Education
View all Education & Teaching

Social Sciences

Sociology
Economics
Psychology
Anthropology
Political Science
Law
Urban Planning
Human Rights
Governance
Archaeology
Social Work
Early Childhood Development
Structural Equation Modeling
Cultural Studies
Community Engagement
Philanthropy
Behavioral Science
Media Studies
Global Development
Social Development
View all Social Sciences

Art & Design

Music
Digital Media
Visual Arts
Design & Creativity
Photography
Art Therapy
Art Composition
Character Design
Fashion Design
Golden Ratio
Copic Markers
Jewelry Design
Animal Illustration
Anime Drawing
Street Art
Observational Drawing
Greeting Cards
Clay Modeling
Epoxy Resin
Miniature Art
View all Art & Design

Artificial Intelligence

Computer Vision
Natural Language Processing (NLP)
Neural Networks
Autonomous Vehicles
Chatbot
IBM Watson
Intelligent Systems
Genetic Algorithms
Intelligence
Heuristics
Ontology
Collective Intelligence
Constraint Programming
Semantics
Computational Linguistics
Computational Creativity
Evolutionary Algorithms
Speech Recognition
Bots
Speech Synthesis
View all Artificial Intelligence

Data Science

Bioinformatics
Big Data
Data Mining
Jupyter Notebooks
Process Mining
Stata
Text Mining
Social Network Analysis
Computational Analysis
Data Collection
Information Retrieval
Data Processing
Data Wrangling
Data Extraction
Data Manipulation
Network Analysis
Data Preparation
Big Data Analytics
Graph Analysis
Data Engineering
View all Data Science

Programming

Mobile Development
Web Development
Databases
Programming Languages
Software Development
Domain-Specific Languages (DSL)
Hardware Description Languages (HDL)
Aspect-oriented programming
Object-oriented programming
Visual Programming
Competitive Programming
Database Programming
Generic Programming
Programming Language Development
Leetcode
GNU Toolchain
Windsurf
Vibe Coding
Cloud Computing
Game Development
View all Programming

Personal Development

Communication Skills
Career Development
Self Improvement
Presentation Skills
Resilience
Gratitude
Growth Mindset
Self-Assessment
Survival Skills
Sleep Improvement
Career Planning
Empowerment
Generosity
Personal Growth
Courage
Humility
Social Skills
Dog Training
Passion
Life Coaching
View all Personal Development

Information Security (InfoSec)

Ethical Hacking
Digital Forensics
Reverse Engineering
Penetration Testing
Malware Analysis
DevSecOps
OSINT (Open Source Intelligence)
Red Team
Blue Team
Network Security
Cybersecurity
Threat Intelligence
View all Information Security (InfoSec)

The Report

A Simplilearn Certificate Goes on LinkedIn Each Minute; Here’s How Krishna Kumar Built It From a Blog

17 years ago, Krishna Kumar started offering free PMP prep online. Today, it’s a leading digital upskilling platform that helps millions upskill in AI, cybersecurity, data science, and more.

Class Central Team Jun 22, 2026

Latest

7 Best Free Prolog Courses Online for 2026
6 Best Oceanography Courses Online in 2026 (Free & Paid)
Best Chemistry Courses Online for 2026 (Free & Paid)
8 Best Logo Design Courses Online in 2026 (Free & Paid)
8 Best Nutrition Courses Online for 2026 (Free & Paid)

Write for The Report

Visit The Report

600 Free Google Certifications
Trending

Most common

Popular courses

Learning How to Learn: Powerful mental tools to help you master tough subjects
Deep Teaching Solutions
Biochemistry: Biomolecules, Methods, and Mechanisms
Massachusetts Institute of Technology
Introduction to Computational Thinking and Data Science
Massachusetts Institute of Technology

Organize and share your learning with Class Central Lists.

View our Lists Showcase

0 Reviews

Start learning

Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Data Science
Big Data
Apache Spark

Artificial Intelligence

Programming
Databases

Data Science
Big Data

Data Science
Big Data
Apache Spark

Data Science
Data Processing

Computer Science
Software Engineering
Performance Tuning

Computer Science
Distributed Computing
Cluster Computing

Data Science
Big Data
Parquet

Optimizing Spark SQL Jobs with Parallel and Asynchronous IO

Databricks via YouTube

Write review

Start learning Write review

Details

Start learning

Provider

YouTube
Pricing

Free Video
Languages

English
Effort

21 minutes
Sessions

Self-Paced
Level

Advanced

Found in

Apache Spark Courses
Artificial Intelligence Courses
Databases Courses
Big Data Courses
Data Processing Courses
Performance Tuning Courses
Cluster Computing Courses
Parquet Courses

Discover optimization techniques for Spark SQL jobs in this 21-minute Databricks conference talk. Learn how to improve performance in large-scale big data clusters using parallel and asynchronous I/O operations. Explore file-level and row group-level parallel read implementations, asynchronous spill optimization, and the innovative parquet column family design. Gain insights into how these techniques can accelerate Apache Spark jobs, potentially improving end-to-end performance by 5% to 30%. Delve into the implementation details of these features and understand their impact on job acceleration in EB-level data platforms.

Syllabus

Introduction
Why Does IO Matter
Parquet
Spiral Circles
Sequential vs Parallel IO
Group Level Parallel IO
Column Family Parallel IO
Asynchronous Sphere

Taught by

Databricks

Related Courses

Ad

Live Online Classes in Design, Coding & AI — Small Classes, Free Retakes

Learn More →
Recent Parquet Improvements in Apache Spark - Vectorized Complex Types and Column Index Support
The Apache Spark File Format Ecosystem - Optimizing Storage for Performance
Accelerating Data Processing in Spark SQL with Pandas UDFs - Optimization Techniques
Optimizing Apache Spark and SQL for Improved Performance
Optimizing Spark and Cloud Data Storage for Analytics