Spark

Big Data Cluster Computing in Production

Nonfiction, Computers, Database Management
Cover of the book Spark by Ema Orhian, Ilya Ganelin, Kai Sasaki, Brennon York, Wiley
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart
Author: Ema Orhian, Ilya Ganelin, Kai Sasaki, Brennon York ISBN: 9781119254058
Publisher: Wiley Publication: March 28, 2016
Imprint: Wiley Language: English
Author: Ema Orhian, Ilya Ganelin, Kai Sasaki, Brennon York
ISBN: 9781119254058
Publisher: Wiley
Publication: March 28, 2016
Imprint: Wiley
Language: English

Production-targeted Spark guidance with real-world use cases

Spark: Big Data Cluster Computing in Production goes beyond general Spark overviews to provide targeted guidance toward using lightning-fast big-data clustering in production. Written by an expert team well-known in the big data community, this book walks you through the challenges in moving from proof-of-concept or demo Spark applications to live Spark in production. Real use cases provide deep insight into common problems, limitations, challenges, and opportunities, while expert tips and tricks help you get the most out of Spark performance. Coverage includes Spark SQL, Tachyon, Kerberos, ML Lib, YARN, and Mesos, with clear, actionable guidance on resource scheduling, db connectors, streaming, security, and much more.

Spark has become the tool of choice for many Big Data problems, with more active contributors than any other Apache Software project. General introductory books abound, but this book is the first to provide deep insight and real-world advice on using Spark in production. Specific guidance, expert tips, and invaluable foresight make this guide an incredibly useful resource for real production settings.

  • Review Spark hardware requirements and estimate cluster size
  • Gain insight from real-world production use cases
  • Tighten security, schedule resources, and fine-tune performance
  • Overcome common problems encountered using Spark in production

Spark works with other big data tools including MapReduce and Hadoop, and uses languages you already know like Java, Scala, Python, and R. Lightning speed makes Spark too good to pass up, but understanding limitations and challenges in advance goes a long way toward easing actual production implementation. Spark: Big Data Cluster Computing in Production tells you everything you need to know, with real-world production insight and expert guidance, tips, and tricks.

View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart

Production-targeted Spark guidance with real-world use cases

Spark: Big Data Cluster Computing in Production goes beyond general Spark overviews to provide targeted guidance toward using lightning-fast big-data clustering in production. Written by an expert team well-known in the big data community, this book walks you through the challenges in moving from proof-of-concept or demo Spark applications to live Spark in production. Real use cases provide deep insight into common problems, limitations, challenges, and opportunities, while expert tips and tricks help you get the most out of Spark performance. Coverage includes Spark SQL, Tachyon, Kerberos, ML Lib, YARN, and Mesos, with clear, actionable guidance on resource scheduling, db connectors, streaming, security, and much more.

Spark has become the tool of choice for many Big Data problems, with more active contributors than any other Apache Software project. General introductory books abound, but this book is the first to provide deep insight and real-world advice on using Spark in production. Specific guidance, expert tips, and invaluable foresight make this guide an incredibly useful resource for real production settings.

Spark works with other big data tools including MapReduce and Hadoop, and uses languages you already know like Java, Scala, Python, and R. Lightning speed makes Spark too good to pass up, but understanding limitations and challenges in advance goes a long way toward easing actual production implementation. Spark: Big Data Cluster Computing in Production tells you everything you need to know, with real-world production insight and expert guidance, tips, and tricks.

More books from Wiley

Cover of the book Introducing Revit Architecture 2009 by Ema Orhian, Ilya Ganelin, Kai Sasaki, Brennon York
Cover of the book Asymmetric Synthesis II by Ema Orhian, Ilya Ganelin, Kai Sasaki, Brennon York
Cover of the book Strategic Marketing For Health Care Organizations by Ema Orhian, Ilya Ganelin, Kai Sasaki, Brennon York
Cover of the book The Little Book of Being Brilliant by Ema Orhian, Ilya Ganelin, Kai Sasaki, Brennon York
Cover of the book Probiotic Dairy Products by Ema Orhian, Ilya Ganelin, Kai Sasaki, Brennon York
Cover of the book Fisher Investments on Utilities by Ema Orhian, Ilya Ganelin, Kai Sasaki, Brennon York
Cover of the book Essentials of KTEA-3 and WIAT-III Assessment by Ema Orhian, Ilya Ganelin, Kai Sasaki, Brennon York
Cover of the book Sparks by Ema Orhian, Ilya Ganelin, Kai Sasaki, Brennon York
Cover of the book The FX Bootcamp Guide to Strategic and Tactical Forex Trading by Ema Orhian, Ilya Ganelin, Kai Sasaki, Brennon York
Cover of the book What's Stopping You? Being More Confident by Ema Orhian, Ilya Ganelin, Kai Sasaki, Brennon York
Cover of the book No More Consultants by Ema Orhian, Ilya Ganelin, Kai Sasaki, Brennon York
Cover of the book How the Immune System Works by Ema Orhian, Ilya Ganelin, Kai Sasaki, Brennon York
Cover of the book The Encyclopaedia of Sports Medicine, Genetic and Molecular Aspects of Sports Performance by Ema Orhian, Ilya Ganelin, Kai Sasaki, Brennon York
Cover of the book Handbook of Market Risk by Ema Orhian, Ilya Ganelin, Kai Sasaki, Brennon York
Cover of the book Facebook Application Development For Dummies by Ema Orhian, Ilya Ganelin, Kai Sasaki, Brennon York
We use our own "cookies" and third party cookies to improve services and to see statistical information. By using this website, you agree to our Privacy Policy