Text Data Management and Analysis

A Practical Introduction to Information Retrieval and Text Mining

Nonfiction, Computers, Database Management, Information Storage & Retrievel, General Computing
Cover of the book Text Data Management and Analysis by ChengXiang Zhai, Sean Massung, Association for Computing Machinery and Morgan & Claypool Publishers
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart
Author: ChengXiang Zhai, Sean Massung ISBN: 9781970001181
Publisher: Association for Computing Machinery and Morgan & Claypool Publishers Publication: June 30, 2016
Imprint: ACM Books Language: English
Author: ChengXiang Zhai, Sean Massung
ISBN: 9781970001181
Publisher: Association for Computing Machinery and Morgan & Claypool Publishers
Publication: June 30, 2016
Imprint: ACM Books
Language: English

Recent years have seen a dramatic growth of natural language text data, including web pages, news articles, scientific literature, emails, enterprise documents, and social media such as blog articles, forum posts, product reviews, and tweets. This has led to an increasing demand for powerful software tools to help people analyze and manage vast amounts of text data effectively and efficiently. Unlike data generated by a computer system or sensors, text data are usually generated directly by humans, and are accompanied by semantically rich content. As such, text data are especially valuable for discovering knowledge about human opinions and preferences, in addition to many other kinds of knowledge that we encode in text. In contrast to structured data, which conform to well-defined schemas (thus are relatively easy for computers to handle), text has less explicit structure, requiring computer processing toward understanding of the content encoded in text. The current technology of natural language processing has not yet reached a point to enable a computer to precisely understand natural language text, but a wide range of statistical and heuristic approaches to analysis and management of text data have been developed over the past few decades. They are usually very robust and can be applied to analyze and manage text data in any natural language, and about any topic.

This book provides a systematic introduction to all these approaches, with an emphasis on covering the most useful knowledge and skills required to build a variety of practically useful text information systems. The focus is on text mining applications that can help users analyze patterns in text data to extract and reveal useful knowledge. Information retrieval systems, including search engines and recommender systems, are also covered as supporting technology for text mining applications. The book covers the major concepts, techniques, and ideas in text data mining and information retrieval from a practical viewpoint, and includes many hands-on exercises designed with a companion software toolkit (i.e., MeTA) to help readers learn how to apply techniques of text mining and information retrieval to real-world text data and how to experiment with and improve some of the algorithms for interesting application tasks. The book can be used as a textbook for a computer science undergraduate course or a reference book for practitioners working on relevant problems in analyzing and managing text data.

View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart

Recent years have seen a dramatic growth of natural language text data, including web pages, news articles, scientific literature, emails, enterprise documents, and social media such as blog articles, forum posts, product reviews, and tweets. This has led to an increasing demand for powerful software tools to help people analyze and manage vast amounts of text data effectively and efficiently. Unlike data generated by a computer system or sensors, text data are usually generated directly by humans, and are accompanied by semantically rich content. As such, text data are especially valuable for discovering knowledge about human opinions and preferences, in addition to many other kinds of knowledge that we encode in text. In contrast to structured data, which conform to well-defined schemas (thus are relatively easy for computers to handle), text has less explicit structure, requiring computer processing toward understanding of the content encoded in text. The current technology of natural language processing has not yet reached a point to enable a computer to precisely understand natural language text, but a wide range of statistical and heuristic approaches to analysis and management of text data have been developed over the past few decades. They are usually very robust and can be applied to analyze and manage text data in any natural language, and about any topic.

This book provides a systematic introduction to all these approaches, with an emphasis on covering the most useful knowledge and skills required to build a variety of practically useful text information systems. The focus is on text mining applications that can help users analyze patterns in text data to extract and reveal useful knowledge. Information retrieval systems, including search engines and recommender systems, are also covered as supporting technology for text mining applications. The book covers the major concepts, techniques, and ideas in text data mining and information retrieval from a practical viewpoint, and includes many hands-on exercises designed with a companion software toolkit (i.e., MeTA) to help readers learn how to apply techniques of text mining and information retrieval to real-world text data and how to experiment with and improve some of the algorithms for interesting application tasks. The book can be used as a textbook for a computer science undergraduate course or a reference book for practitioners working on relevant problems in analyzing and managing text data.

More books from Association for Computing Machinery and Morgan & Claypool Publishers

Cover of the book The VR Book by ChengXiang Zhai, Sean Massung
Cover of the book Communities of Computing by ChengXiang Zhai, Sean Massung
Cover of the book Frontiers of Multimedia Research by ChengXiang Zhai, Sean Massung
Cover of the book A Framework for Scientific Discovery through Video Games by ChengXiang Zhai, Sean Massung
Cover of the book Reactive Internet Programming by ChengXiang Zhai, Sean Massung
Cover of the book An Architecture for Fast and General Data Processing on Large Clusters by ChengXiang Zhai, Sean Massung
Cover of the book Embracing Interference in Wireless Systems by ChengXiang Zhai, Sean Massung
Cover of the book Smarter Than Their Machines by ChengXiang Zhai, Sean Massung
Cover of the book Shared-Memory Parallelism Can be Simple, Fast, and Scalable by ChengXiang Zhai, Sean Massung
Cover of the book The Handbook of Multimodal-Multisensor Interfaces, Volume 3 by ChengXiang Zhai, Sean Massung
Cover of the book Verified Functional Programming in Agda by ChengXiang Zhai, Sean Massung
Cover of the book The Sparse Fourier Transform by ChengXiang Zhai, Sean Massung
Cover of the book Declarative Logic Programming by ChengXiang Zhai, Sean Massung
Cover of the book Ada's Legacy by ChengXiang Zhai, Sean Massung
Cover of the book The Handbook of Multimodal-Multisensor Interfaces, Volume 2 by ChengXiang Zhai, Sean Massung
We use our own "cookies" and third party cookies to improve services and to see statistical information. By using this website, you agree to our Privacy Policy