An electronic copy of book is available for Library Members Sign in to view the book
This book presents a practical, implementation-oriented introduction to processing massive text datasets using the MapReduce programming model, especially for natural-language processing (NLP) and information-retrieval tasks. It covers MapReduce program design, distributed data processing, and shows how classical NLP tasks (e.g. word counts, indexing, statistical language modeling) can be scaled to large datasets. The goal is to enable researchers and engineers to build scalable, data-intensive text-processing systems using the MapReduce paradigm on distributed platforms.
Sub Title:
Edition:
Volume:
Publisher: Morgan & Claypool Publishers
Publishing Year: 2010
ISBN: 978-1-60845-342-9
Pages: 252