Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

SpaCy for Digital Humanities with Python

Python Tutorials for Digital Humanities via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn natural language processing (NLP) techniques using spaCy, a powerful Python library designed for digital humanities applications. Master the fundamentals of spaCy through hands-on tutorials covering installation, data cleaning, and text preprocessing. Explore core NLP concepts including sentence segmentation, named entity recognition (NER), part-of-speech tagging, and lemmatization to extract meaningful information from textual data. Discover how to identify and extract specific linguistic elements such as nouns, noun chunks, verbs, and verb phrases from your texts. Develop skills in data visualization using DisplaCy to create visual representations of linguistic structures and customize these visualizations for your specific research needs. Delve into advanced named entity recognition techniques, including machine learning approaches and custom entity training using spaCy's EntityRuler. Understand the principles of word vectors and learn to generate custom word embeddings using Gensim, then integrate them into spaCy models for enhanced text analysis. Build custom NER pipelines tailored to specific domains, such as Holocaust studies and Classical Latin texts, while mastering precision and recall evaluation metrics. Navigate the transition from spaCy 2.x to 3.x, learning new pipeline architectures, custom factories, and configuration file management. Explore specialized applications including text classification with Classy Classification, coreference resolution, weak supervision techniques with Skweak, and prepositional phrase extraction. Master advanced features like SpanCat for span categorization, document serialization with DocBin, and the integration of multiple NLP models into comprehensive processing pipelines. Gain practical experience with real-world projects and annotation workflows using tools like Prodigy for efficient model training and evaluation.

Syllabus

Introduction to SpaCy and Cleaning Data (SpaCy and Python Tutorials for DH - 01)
How to Install SpaCy and Models (Spacy and Python Tutorial for DH 02)
How to Separate Sentences in SpaCy (SpaCy and Python Tutorials for DH - 03)
Spacy and Named Entity Recognition NER (Spacy and Python Tutorial for DH 04)
Finding Parts of Speech (SpaCy and Python Tutorial for DH 05)
Extracting Nouns and Noun Chunks (SpaCy and Python Tutorial for DH 06)
Extracting Verbs and Verb Phrases (SpaCy and Python Tutorial for DH 07)
Lemmatization: Finding the Roots of Words (Spacy and Python Tutorial for DH 08)
Data Visualization with DisplaCy (Spacy and Python Tutorial for DH 09)
Customizing DisplaCy Render Data Visualization (Spacy and Python Tutorial for DH 10)
Finding Quotes in Sentences (SpaCy and Python Tutorial for DH 11)
Introduction to Named Entity Recognition (NER for DH 01)
Machine Learning NER with Python and spaCy (NER for DH 03 )
How to Use spaCy's EntityRuler (Named Entity Recognition for DH 04 | Part 01)
How to Use spaCy to Create an NER training set (Named Entity Recognition for DH 04 | Part 02)
How to Train a spaCy NER model (Named Entity Recognition for DH 04 | Part 03)
Examining a spaCy Model in the Folder (Named Entity Recognition for DH 05)
What are Word Vectors (Named Entity Recognition for DH 06)
How to Generate Custom Word Vectors in Gensim (Named Entity Recognition for DH 07)
How to Load Custom Word Vectors into spaCy Models (Named Entity Recognition for DH 08)
Getting the Data for Custom Labels (Holocaust NER for DH 09.01)
How to Add a Custom NER Pipe in spaCy and a Custom Label (NER for DH 09.02 )
How to Training Custom Entities into spaCy Models (Named Entity Recognition for DH 09 03) - spaCy 2
How to Add and Place Pipes from other Models into a New Model (NER for DH 09 04)
How to Add Custom Functions to spaCy Pipeline (NER for DH 09.05)
Precision vs. Recall and Adding PERSON to Holocaust NER Pipeline (Named Entity Recognition DH 09.06)
Finalizing the Holocaust NER Pipeline (Named Entity Recognition for DH 09.07)
Classical Latin Named Entity Recognition (NER for DH 10.01)
How to Package spaCy Models (Even with Custom Factories) (NER for DH 11)
How to Add a Pipe in SpaCy 3x (SpaCy 3x Tutorials)
How to Add Cutom Names to Pipes and Position them in a Pipeline in spaCy 3x SpaCy 3x Tutorials
How to Add Custom Factories with Language component in spaCy 3x (SpaCy 3x Tutorials)
How to Convert spaCy 2x Training Data to 3x (Named Entity Recognition in spaCy Tutorials)
How to Create a Config.cfg File in spaCy 3x for Named Entity Recognition (NER)
How to Train an NER Model in spaCy 3x
How to Structure an Informal NER Test with spaCy 3 (Named Entity Recognition Tutorials)
How to Structure a Formal Test with Confusion Matrix in spaCy 3 for NER Models (NER for DH)
How to package a spacy model with custom component or factory
How to Create and Add an EntityRuler in spaCy 3
How to Load Custom Word Vectors into a spaCy Model
Adding ENT_TYPE patterns into an EntityRuler in spaCy
The EASIEST! way to do Text Classification with spaCy and Classy Classification
How to Easily Add a Coreference Resolution Model into a spCy Pipeline with Crosslingual Coreference
Weak Supervision NLP with spaCy and Skweak
How to Extract Prepositional Phrases from Texts using spaCy - Bonus Content (Intro to Set_Extension)
How to Serialize (Save) spaCy Doc Containers to Disk with DocBin and Pickle
The Easiest Way to do Coreference Resolution with spaCy with spaCy-Experimental
When to use NER, EntityRuler, SpanCat, or SpanRuler in spaCy
LatinCy | How to use spaCy for Latin NLP in Python #nlp #spacy
SpanCat with spaCy on Real Data | Part 01 - The Project and Cultivating Data for Annotation
How to Prepare Annotations in Prodigy for Training a SpanCat Model in spaCy (Part 2 | SpanCat) #nlp
Training a spaCy SpanCat Model to Annotate in Texts more quickly in Prodigy | SpanCat 03

Taught by

Python Tutorials for Digital Humanities

Reviews

Start your review of SpaCy for Digital Humanities with Python

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.