Data-Intensive Text Processing with MapReduce
Center for Language & Speech Processing(CLSP), JHU via YouTube
35% Off Finance Skills That Get You Hired - Code CFI35
AI Adoption - Drive Business Value and Organizational Impact
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore data-intensive text processing techniques using the MapReduce programming model in this comprehensive lecture from Johns Hopkins University's Center for Language & Speech Processing. Learn how to handle large-scale text data processing challenges through distributed computing approaches, understanding the fundamental concepts of MapReduce and its applications in natural language processing and computational linguistics. Discover practical strategies for implementing text processing algorithms that can scale to handle massive datasets, including techniques for parallel processing, data distribution, and efficient computation across clusters. Gain insights into the architectural principles behind MapReduce and how it enables researchers and practitioners to process text corpora that would be intractable with traditional single-machine approaches. Understand the trade-offs and considerations involved in designing MapReduce-based solutions for various text processing tasks, from basic word counting to more complex linguistic analysis operations.
Syllabus
Jimmy Lin: Data-Intensive Text Processing with MapReduce
Taught by
Center for Language & Speech Processing(CLSP), JHU