Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Topic Modeling and Text Classification with Python for Digital Humanities

Python Tutorials for Digital Humanities via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn topic modeling and text classification techniques specifically designed for digital humanities applications through this comprehensive tutorial series that covers both fundamental concepts and practical implementation. Master the essential terminology and concepts including topics, clusters, bigrams, trigrams, and TF-IDF before progressing to hands-on implementation using industry-standard Python libraries. Explore rules-based topic modeling methods using TF-IDF with Scikit Learn, including data visualization with Matplotlib and K-Means clustering techniques. Dive into machine learning-based approaches through Latent Dirichlet Allocation (LDA) using Gensim, learning to create, optimize, and save topic models while handling bigrams, trigrams, and frequent word removal. Develop text classification skills using spaCy 3.x, focusing on dataset cultivation, training set generation with word vectors, and best practices for digital humanities research. Gain exposure to advanced libraries including TensorFlow and Keras, and discover modern approaches like Top2Vec for efficient topic modeling, all while building practical skills applicable to digital humanities research and text analysis projects.

Syllabus

Why use Topic Modeling (Topic Modeling in Python for DH 01.01)
What are Topics and Clusters (Topic Modeling in Python for DH 01.02)
What are Bigrams and Trigrams (Topic Modeling and Python for DH 01.03)
What is TF-IDF for Beginners (Topic Modeling in Python for DH 02.01)
What is Scikit Learn and How to Install Scikit Learn (Topic Modeling in Python for DH 02.02)
TF-IDF in Python with Scikit Learn (Topic Modeling for DH 02.03)
Plotting TF-IDF and K-Means Data with Matplotlib (Topic Modeling in Python for DH 02.04)
What is Laten Dirichlet Allocation LDA (Topic Modeling for Digital Humanities 03.01)
Libraries for LDA Topic Modeling - Gensim and JupyterLab (Topic Modeling for DH 03.02)
How to Create an LDA Topic Model in Python with Gensim (Topic Modeling for DH 03.03)
How to Create Bigrams and Trigrams and Remove Frequent Words (Topic Modeling for DH 03.04)
How to Save and Load LDA Models with Gensim in Python (Topic Modeling for DH 03.05)
What is Text Classification (Topic Modeling in Python for DH 04.01)
Creating a text classification model in spacy 3x (Topic Modeling in Python for DH 04.02)
How to Cultivate Good Datasets for Text Classification (Topic Modeling in Python for DH 04.03)
How to Use Word Vectors to Generate a Text Classification Training Set (Topic Modeling for DH 04.04)
The Best Way to do Topic Modeling in Python - Top2Vec Introduction and Tutorial

Taught by

Python Tutorials for Digital Humanities

Reviews

Start your review of Topic Modeling and Text Classification with Python for Digital Humanities

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.