Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Stanford University

Addressing Challenges of Public Web Data

Stanford University via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore the critical challenges facing public web data accessibility in this Stanford HAI seminar featuring the Common Crawl Foundation's groundbreaking work on preserving and democratizing humanity's digital knowledge. Learn about Common Crawl's free public web dataset, which has served as a vital resource since 2008, and discover insights from their latest data product that leverages metadata to examine pressing concerns including robots.txt exclusions, legal demands, and emerging "bot defenses." Gain understanding of the foundation's advocacy for greater transparency in web data practices and their proposed solutions for ensuring the future accessibility of public web information. The presentation includes a comprehensive lecture followed by an interactive Q&A session, providing deep insights into the intersection of web crawling, data preservation, and digital rights.

Syllabus

00:00:00 Introduction
00:01:01 Lecture
00:48:53 Q&A

Taught by

Stanford HAI

Reviews

Start your review of Addressing Challenges of Public Web Data

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.