Bad Data Handbook

Cleaning Up The Data So You Can Get Back To Work

Nonfiction, Computers, Database Management
Cover of the book Bad Data Handbook by Q. Ethan McCallum, O'Reilly Media
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart
Author: Q. Ethan McCallum ISBN: 9781449324971
Publisher: O'Reilly Media Publication: November 7, 2012
Imprint: O'Reilly Media Language: English
Author: Q. Ethan McCallum
ISBN: 9781449324971
Publisher: O'Reilly Media
Publication: November 7, 2012
Imprint: O'Reilly Media
Language: English

What is bad data? Some people consider it a technical phenomenon, like missing values or malformed records, but bad data includes a lot more. In this handbook, data expert Q. Ethan McCallum has gathered 19 colleagues from every corner of the data arena to reveal how they’ve recovered from nasty data problems.

From cranky storage to poor representation to misguided policy, there are many paths to bad data. Bottom line? Bad data is data that gets in the way. This book explains effective ways to get around it.

Among the many topics covered, you’ll discover how to:

  • Test drive your data to see if it’s ready for analysis
  • Work spreadsheet data into a usable form
  • Handle encoding problems that lurk in text data
  • Develop a successful web-scraping effort
  • Use NLP tools to reveal the real sentiment of online reviews
  • Address cloud computing issues that can impact your analysis effort
  • Avoid policies that create data analysis roadblocks
  • Take a systematic approach to data quality analysis
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart

What is bad data? Some people consider it a technical phenomenon, like missing values or malformed records, but bad data includes a lot more. In this handbook, data expert Q. Ethan McCallum has gathered 19 colleagues from every corner of the data arena to reveal how they’ve recovered from nasty data problems.

From cranky storage to poor representation to misguided policy, there are many paths to bad data. Bottom line? Bad data is data that gets in the way. This book explains effective ways to get around it.

Among the many topics covered, you’ll discover how to:

More books from O'Reilly Media

Cover of the book sendmail Cookbook by Q. Ethan McCallum
Cover of the book Database Reliability Engineering by Q. Ethan McCallum
Cover of the book Programming the iPhone User Experience by Q. Ethan McCallum
Cover of the book Developing with Couchbase Server by Q. Ethan McCallum
Cover of the book iPhoto '09: The Missing Manual by Q. Ethan McCallum
Cover of the book Accessible EPUB 3 by Q. Ethan McCallum
Cover of the book Java Web Services: Up and Running by Q. Ethan McCallum
Cover of the book Oracle SQL*Plus: The Definitive Guide by Q. Ethan McCallum
Cover of the book Web Performance Daybook Volume 2 by Q. Ethan McCallum
Cover of the book Digital Audio Essentials by Q. Ethan McCallum
Cover of the book Building Hypermedia APIs with HTML5 and Node by Q. Ethan McCallum
Cover of the book David Pogue's Digital Photography: The Missing Manual by Q. Ethan McCallum
Cover of the book Think Stats by Q. Ethan McCallum
Cover of the book Access Data Analysis Cookbook by Q. Ethan McCallum
Cover of the book Getting Started with OAuth 2.0 by Q. Ethan McCallum
We use our own "cookies" and third party cookies to improve services and to see statistical information. By using this website, you agree to our Privacy Policy