Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

University of Colorado Boulder

BiteSize Python: NumPy and Pandas

University of Colorado Boulder via Coursera

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
This course delves into advanced data structures in Python, focusing on the powerful capabilities of the NumPy and Pandas libraries. It introduces the ndarray, a multidimensional array object provided by NumPy, enabling efficient storage and manipulation of large datasets. Additionally, learners will explore the Series and DataFrame structures offered by Pandas, which facilitate data analysis and manipulation in a more user-friendly manner. Throughout the course, students will engage in practical exercises and case studies to reinforce their understanding of how these advanced data structures can be applied in real-world scenarios.

Syllabus

  • ndarray
    • This module introduces the ndarray, the core data structure of the NumPy library that allows for efficient manipulation of large, multi-dimensional arrays. It begins with an overview of what an ndarray is and compares its capabilities to Python's built-in list data structure. The module then covers how to create ndarray objects, access and manipulate both 1D and 2D arrays, and perform various operations on these arrays. By the end of this module, learners will gain a solid understanding of how to effectively use ndarray for numerical and data analysis tasks.
  • NumPy
    • This module delves deeper into the NumPy library, focusing on its powerful features and functionalities. It covers universal functions (ufuncs) that allow for element-wise operations on ndarray, enabling efficient computation across large datasets. The module also explores various statistical methods available in NumPy, linear algebra operations for solving mathematical problems, random number generation for simulations and modeling, and masking techniques for filtering data. By the end of this module, learners will be equipped with the skills to leverage NumPy's capabilities for advanced numerical analysis.
  • Series
    • This module introduces the Series data structure in Pandas, which is a one-dimensional labeled array capable of holding any data type. It begins by defining what a Series is and its significance in data analysis. The module covers various methods to create a Series, including using lists, dictionaries, and NumPy arrays. Learners will also explore how to access and manipulate elements within a Series, as well as perform mathematical operations on Series data. By the end of this module, students will understand how to utilize Series for effective data manipulation and analysis.
  • DataFrame
    • This module introduces the DataFrame data structure in Pandas, which is a two-dimensional labeled data structure that can hold heterogeneous data types. The module begins by defining what a DataFrame is and its significance in data analysis and manipulation. Learners will explore various methods to create DataFrames from sources such as dictionaries, lists, and external files (e.g., CSV). The module covers how to access data within a DataFrame using labels and indices, manipulate rows and columns, and perform operations such as merging and concatenating multiple DataFrames. By the end of this module, students will be proficient in utilizing DataFrames for data manipulation tasks.
  • Pandas
    • This module provides an in-depth exploration of the Pandas library, which is essential for data manipulation and analysis in Python. It starts with an overview of what Pandas is and its significance in data science. The module highlights useful functionalities within Pandas, including data loading, cleaning, and preparation. Learners will examine how to generate descriptive statistics for both numerical and categorical columns, use the groupby() method for data aggregation, and handle missing and duplicate values effectively. By the end of this module, students will have a solid understanding of how to leverage Pandas for comprehensive data analysis.

Taught by

Di Wu

Reviews

Start your review of BiteSize Python: NumPy and Pandas

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.