Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Coursera

Data Representation and Serialization

Packt via Coursera

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
This course features Coursera Coach! A smarter way to learn with interactive, real-time conversations that help you test your knowledge, challenge assumptions, and deepen your understanding as you progress through the course. Unlock the power of structured data in this hands-on course on data representation and serialization. You'll learn to confidently navigate a range of data formats like CSV, JSON, YAML, and XML, understanding when and how to use each one effectively. Whether you’re managing datasets or building data-driven applications, mastering these formats is essential for clean, efficient, and interoperable code. The course begins with CSV, diving into the basics before progressing to Python’s csv module and third-party tools like pandas and tablib. Next, you’ll explore JSON, comparing its structure to CSV, then manipulating JSON using Python's standard library and alternatives like simplejson and ujson. Each section provides hands-on experience to help reinforce the material. You’ll then move into YAML, exploring its relationship to JSON and learning to read and write YAML using the PyYAML package. Finally, you’ll tackle XML and HTML, using multiple Python tools including xml, lxml, xmltodict, and BeautifulSoup to parse, write, and analyze structured documents, along with important security considerations. This course is ideal for developers, data professionals, and students looking to build a strong foundation in working with structured data formats. Basic Python knowledge is recommended.

Syllabus

  • CSV
    • In this module, we will dive into the world of CSV files, exploring their structure, how to read and write them using Python’s built-in csv module, and tools like pandas and tablib for more efficient handling and conversion of CSV data.
  • JSON
    • In this module, we will explore JSON as a data representation format, learning how to create, manipulate, and store JSON files using Python's standard libraries and external packages like pandas and tablib.
  • YAML
    • In this module, we will dive into YAML, comparing it with other formats like JSON, and explore tools like PyYAML and tablib to work with YAML files and convert between different data formats.
  • XML and HTML
    • In this module, we will explore XML and HTML formats, learning how to work with them in Python through modules like xml, lxml, and xmltodict, as well as leveraging BeautifulSoup to handle HTML data.

Taught by

Packt - Course Instructors

Reviews

Start your review of Data Representation and Serialization

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.