Gain hands-on experience with Python to collect data from websites and automate the processing of many pages using loops. This course covers HTML and CSS basics, core Python concepts, practical web scraping activities, data storage and scheduling methods, and real-world case studies showing how scraped data can be used effectively.
Overview
Syllabus
Section 1
Introduction to HTML/CSS (Online)
- How a website is served
- Anatomy of a web page
- HTML tags
- IDs
- CSS classes
Python Part 1
- Integers + Floats
- Variables
- Strings
- Methods + Functions
Section 2
Webscraping Exercise Part 1
- Introduction to using packages
- Requests packages
- Performing a GET request
- String manipulation
- Debugging a Python script
Python Part 2
- Booleans
- Conditional Statements
- Lists
- Indexing into strings + lists
- Looping over lists
Section 3
Webscraping Exercises Part 2
- Beautiful Soup package
- Honing in on data
- Common webscraping pitfalls
- Using loops in webscraping
Storage and Scheduling
- Text files
- CSVs
- Scheduling a Python Script
Taught by
Garfield Stinvil, Colin Jaffe, and Brian McClain