Data Science on the Google Cloud Platform

Implementing End-to-End Real-Time Data Pipelines: From Ingest to Machine Learning

Nonfiction, Computers, Database Management, Data Processing, Advanced Computing, Programming, Data Modeling & Design
Cover of the book Data Science on the Google Cloud Platform by Valliappa Lakshmanan, O'Reilly Media
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart
Author: Valliappa Lakshmanan ISBN: 9781491974513
Publisher: O'Reilly Media Publication: December 12, 2017
Imprint: O'Reilly Media Language: English
Author: Valliappa Lakshmanan
ISBN: 9781491974513
Publisher: O'Reilly Media
Publication: December 12, 2017
Imprint: O'Reilly Media
Language: English

Learn how easy it is to apply sophisticated statistical and machine learning methods to real-world problems when you build on top of the Google Cloud Platform (GCP). This hands-on guide shows developers entering the data science field how to implement an end-to-end data pipeline, using statistical and machine learning methods and tools on GCP. Through the course of the book, you’ll work through a sample business decision by employing a variety of data science approaches.

Follow along by implementing these statistical and machine learning solutions in your own project on GCP, and discover how this platform provides a transformative and more collaborative way of doing data science.

You’ll learn how to:

  • Automate and schedule data ingest, using an App Engine application
  • Create and populate a dashboard in Google Data Studio
  • Build a real-time analysis pipeline to carry out streaming analytics
  • Conduct interactive data exploration with Google BigQuery
  • Create a Bayesian model on a Cloud Dataproc cluster
  • Build a logistic regression machine-learning model with Spark
  • Compute time-aggregate features with a Cloud Dataflow pipeline
  • Create a high-performing prediction model with TensorFlow
  • Use your deployed model as a microservice you can access from both batch and real-time pipelines
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart

Learn how easy it is to apply sophisticated statistical and machine learning methods to real-world problems when you build on top of the Google Cloud Platform (GCP). This hands-on guide shows developers entering the data science field how to implement an end-to-end data pipeline, using statistical and machine learning methods and tools on GCP. Through the course of the book, you’ll work through a sample business decision by employing a variety of data science approaches.

Follow along by implementing these statistical and machine learning solutions in your own project on GCP, and discover how this platform provides a transformative and more collaborative way of doing data science.

You’ll learn how to:

More books from O'Reilly Media

Cover of the book XMPP: The Definitive Guide by Valliappa Lakshmanan
Cover of the book Maintainable JavaScript by Valliappa Lakshmanan
Cover of the book Microsoft Project 2010: The Missing Manual by Valliappa Lakshmanan
Cover of the book OpenStack Operations Guide by Valliappa Lakshmanan
Cover of the book Monad (AKA PowerShell) by Valliappa Lakshmanan
Cover of the book iPhone Open Application Development by Valliappa Lakshmanan
Cover of the book Streaming Architecture by Valliappa Lakshmanan
Cover of the book Web Audio API by Valliappa Lakshmanan
Cover of the book Beautiful Security by Valliappa Lakshmanan
Cover of the book grep Pocket Reference by Valliappa Lakshmanan
Cover of the book Office 2007: The Missing Manual by Valliappa Lakshmanan
Cover of the book iOS 11 Swift Programming Cookbook by Valliappa Lakshmanan
Cover of the book eBay Hacks by Valliappa Lakshmanan
Cover of the book We the Media by Valliappa Lakshmanan
Cover of the book Functional Thinking by Valliappa Lakshmanan
We use our own "cookies" and third party cookies to improve services and to see statistical information. By using this website, you agree to our Privacy Policy