Data-Intensive Text Processing with MapReduce
Center for Language & Speech Processing(CLSP), JHU via YouTube
Master Agentic AI, GANs, Fine-Tuning & LLM Apps
Google, IBM & Meta Certificates — 40% Off for a Limited Time
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore data-intensive text processing techniques using the MapReduce programming model in this comprehensive lecture from Johns Hopkins University's Center for Language & Speech Processing. Learn how to handle large-scale text data processing challenges through distributed computing approaches, understanding the fundamental concepts of MapReduce and its applications in natural language processing and computational linguistics. Discover practical strategies for implementing text processing algorithms that can scale to handle massive datasets, including techniques for parallel processing, data distribution, and efficient computation across clusters. Gain insights into the architectural principles behind MapReduce and how it enables researchers and practitioners to process text corpora that would be intractable with traditional single-machine approaches. Understand the trade-offs and considerations involved in designing MapReduce-based solutions for various text processing tasks, from basic word counting to more complex linguistic analysis operations.
Syllabus
Jimmy Lin: Data-Intensive Text Processing with MapReduce
Taught by
Center for Language & Speech Processing(CLSP), JHU