This course features Coursera Coach!
A smarter way to learn with interactive, real-time conversations that help you test your knowledge, challenge assumptions, and deepen your understanding as you progress through the course.
Unlock the power of structured data in this hands-on course on data representation and serialization. You'll learn to confidently navigate a range of data formats like CSV, JSON, YAML, and XML, understanding when and how to use each one effectively. Whether you’re managing datasets or building data-driven applications, mastering these formats is essential for clean, efficient, and interoperable code.
The course begins with CSV, diving into the basics before progressing to Python’s csv module and third-party tools like pandas and tablib. Next, you’ll explore JSON, comparing its structure to CSV, then manipulating JSON using Python's standard library and alternatives like simplejson and ujson. Each section provides hands-on experience to help reinforce the material.
You’ll then move into YAML, exploring its relationship to JSON and learning to read and write YAML using the PyYAML package. Finally, you’ll tackle XML and HTML, using multiple Python tools including xml, lxml, xmltodict, and BeautifulSoup to parse, write, and analyze structured documents, along with important security considerations.
This course is ideal for developers, data professionals, and students looking to build a strong foundation in working with structured data formats. Basic Python knowledge is recommended.
Overview
Syllabus
- CSV
- In this module, we will dive into the world of CSV files, exploring their structure, how to read and write them using Python’s built-in csv module, and tools like pandas and tablib for more efficient handling and conversion of CSV data.
- JSON
- In this module, we will explore JSON as a data representation format, learning how to create, manipulate, and store JSON files using Python's standard libraries and external packages like pandas and tablib.
- YAML
- In this module, we will dive into YAML, comparing it with other formats like JSON, and explore tools like PyYAML and tablib to work with YAML files and convert between different data formats.
- XML and HTML
- In this module, we will explore XML and HTML formats, learning how to work with them in Python through modules like xml, lxml, and xmltodict, as well as leveraging BeautifulSoup to handle HTML data.
Taught by
Packt - Course Instructors