Data-Intensive Text Processing with MapReduce

Nonfiction, Computers, Advanced Computing, Natural Language Processing, Artificial Intelligence, Reference & Language, Language Arts, Linguistics
Cover of the book Data-Intensive Text Processing with MapReduce by Jimmy Lin, Chris Dyer, Morgan & Claypool Publishers
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart
Author: Jimmy Lin, Chris Dyer ISBN: 9781608453436
Publisher: Morgan & Claypool Publishers Publication: October 10, 2010
Imprint: Morgan & Claypool Publishers Language: English
Author: Jimmy Lin, Chris Dyer
ISBN: 9781608453436
Publisher: Morgan & Claypool Publishers
Publication: October 10, 2010
Imprint: Morgan & Claypool Publishers
Language: English

Our world is being revolutionized by data-driven methods: access to large amounts of data has generated new insights and opened exciting new opportunities in commerce, science, and computing applications. Processing the enormous quantities of data necessary for these advances requires large clusters, making distributed computing paradigms more crucial than ever. MapReduce is a programming model for expressing distributed computations on massive datasets and an execution framework for large-scale data processing on clusters of commodity servers. The programming model provides an easy-to-understand abstraction for designing scalable algorithms, while the execution framework transparently handles many system-level details, ranging from scheduling to synchronization to fault tolerance. This book focuses on MapReduce algorithm design, with an emphasis on text processing algorithms common in natural language processing, information retrieval, and machine learning. We introduce the notion of MapReduce design patterns, which represent general reusable solutions to commonly occurring problems across a variety of problem domains. This book not only intends to help the reader "think in MapReduce", but also discusses limitations of the programming model as well. Table of Contents: Introduction / MapReduce Basics / MapReduce Algorithm Design / Inverted Indexing for Text Retrieval / Graph Algorithms / EM Algorithms for Text Processing / Closing Remarks

View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart

Our world is being revolutionized by data-driven methods: access to large amounts of data has generated new insights and opened exciting new opportunities in commerce, science, and computing applications. Processing the enormous quantities of data necessary for these advances requires large clusters, making distributed computing paradigms more crucial than ever. MapReduce is a programming model for expressing distributed computations on massive datasets and an execution framework for large-scale data processing on clusters of commodity servers. The programming model provides an easy-to-understand abstraction for designing scalable algorithms, while the execution framework transparently handles many system-level details, ranging from scheduling to synchronization to fault tolerance. This book focuses on MapReduce algorithm design, with an emphasis on text processing algorithms common in natural language processing, information retrieval, and machine learning. We introduce the notion of MapReduce design patterns, which represent general reusable solutions to commonly occurring problems across a variety of problem domains. This book not only intends to help the reader "think in MapReduce", but also discusses limitations of the programming model as well. Table of Contents: Introduction / MapReduce Basics / MapReduce Algorithm Design / Inverted Indexing for Text Retrieval / Graph Algorithms / EM Algorithms for Text Processing / Closing Remarks

More books from Morgan & Claypool Publishers

Cover of the book Survive and Thrive by Jimmy Lin, Chris Dyer
Cover of the book 3D Scientific Visualization with Blender by Jimmy Lin, Chris Dyer
Cover of the book Electromagnetics in Magnetic Resonance Imaging by Jimmy Lin, Chris Dyer
Cover of the book Relativity, Symmetry, and the Structure of Quantum Theory, Volume 2 by Jimmy Lin, Chris Dyer
Cover of the book Web Indicators for Research Evaluation by Jimmy Lin, Chris Dyer
Cover of the book Selective Photonic Disinfection by Jimmy Lin, Chris Dyer
Cover of the book Classical Theory of Free-Electron Lasers by Jimmy Lin, Chris Dyer
Cover of the book Web Corpus Construction by Jimmy Lin, Chris Dyer
Cover of the book Outside the Research Lab, Volume 2 by Jimmy Lin, Chris Dyer
Cover of the book Women and Physics by Jimmy Lin, Chris Dyer
Cover of the book An Introduction to Planetary Nebulae by Jimmy Lin, Chris Dyer
Cover of the book AdS/CFT Correspondence in Condensed Matter by Jimmy Lin, Chris Dyer
Cover of the book A Concise Introduction to Quantum Mechanics by Jimmy Lin, Chris Dyer
Cover of the book Concepts in Physical Metallurgy by Jimmy Lin, Chris Dyer
Cover of the book The Midlife Crisis of the Nuclear Nonproliferation Treaty by Jimmy Lin, Chris Dyer
We use our own "cookies" and third party cookies to improve services and to see statistical information. By using this website, you agree to our Privacy Policy