Learning Real-time Processing with Spark Streaming

Nonfiction, Computers, Database Management, Data Processing, Programming, Programming Languages
Cover of the book Learning Real-time Processing with Spark Streaming by Sumit Gupta, Packt Publishing
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart
Author: Sumit Gupta ISBN: 9781783987672
Publisher: Packt Publishing Publication: September 28, 2015
Imprint: Packt Publishing Language: English
Author: Sumit Gupta
ISBN: 9781783987672
Publisher: Packt Publishing
Publication: September 28, 2015
Imprint: Packt Publishing
Language: English

Building scalable and fault-tolerant streaming applications made easy with Spark streaming

About This Book

  • Process live data streams more efficiently with better fault recovery using Spark Streaming
  • Implement and deploy real-time log file analysis
  • Learn about integration with Advance Spark Libraries GraphX, Spark SQL, and MLib.

Who This Book Is For

This book is intended for big data developers with basic knowledge of Scala but no knowledge of Spark. It will help you grasp the basics of developing real-time applications with Spark and understand efficient programming of core elements and applications.

What You Will Learn

  • Install and configure Spark and Spark Streaming to execute applications
  • Explore the architecture and components of Spark and Spark Streaming to use it as a base for other libraries
  • Process distributed log files in real-time to load data from distributed sources
  • Apply transformations on streaming data to use its functions
  • Integrate Apache Spark with the various advance libraries like MLib and GraphX
  • Apply production deployment scenarios to deploy your application

In Detail

Using practical examples with easy-to-follow steps, this book will teach you how to build real-time applications with Spark Streaming.

Starting with installing and setting the required environment, you will write and execute your first program for Spark Streaming. This will be followed by exploring the architecture and components of Spark Streaming along with an overview of libraries/functions exposed by Spark. Next you will be taught about various client APIs for coding in Spark by using the use-case of distributed log file processing. You will then apply various functions to transform and enrich streaming data. Next you will learn how to cache and persist datasets. Moving on you will integrate Apache Spark with various other libraries/components of Spark like Mlib, GraphX, and Spark SQL. Finally, you will learn about deploying your application and cover the different scenarios ranging from standalone mode to distributed mode using Mesos, Yarn, and private data centers or on cloud infrastructure.

Style and approach

A Step-by-Step approach to learn Spark Streaming in a structured manner, with detailed explanation of basic and advance features in an easy-to-follow Style. Each topic is explained sequentially and supported with real world examples and executable code snippets that appeal to the needs of readers with the wide range of experiences.

View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart

Building scalable and fault-tolerant streaming applications made easy with Spark streaming

About This Book

Who This Book Is For

This book is intended for big data developers with basic knowledge of Scala but no knowledge of Spark. It will help you grasp the basics of developing real-time applications with Spark and understand efficient programming of core elements and applications.

What You Will Learn

In Detail

Using practical examples with easy-to-follow steps, this book will teach you how to build real-time applications with Spark Streaming.

Starting with installing and setting the required environment, you will write and execute your first program for Spark Streaming. This will be followed by exploring the architecture and components of Spark Streaming along with an overview of libraries/functions exposed by Spark. Next you will be taught about various client APIs for coding in Spark by using the use-case of distributed log file processing. You will then apply various functions to transform and enrich streaming data. Next you will learn how to cache and persist datasets. Moving on you will integrate Apache Spark with various other libraries/components of Spark like Mlib, GraphX, and Spark SQL. Finally, you will learn about deploying your application and cover the different scenarios ranging from standalone mode to distributed mode using Mesos, Yarn, and private data centers or on cloud infrastructure.

Style and approach

A Step-by-Step approach to learn Spark Streaming in a structured manner, with detailed explanation of basic and advance features in an easy-to-follow Style. Each topic is explained sequentially and supported with real world examples and executable code snippets that appeal to the needs of readers with the wide range of experiences.

More books from Packt Publishing

Cover of the book Mastering Apache Solr 7.x by Sumit Gupta
Cover of the book Windows Azure programming patterns for Start-ups by Sumit Gupta
Cover of the book Zabbix 1.8 Network Monitoring by Sumit Gupta
Cover of the book Android Programming with Kotlin for Beginners by Sumit Gupta
Cover of the book Oracle APEX 4.0 Cookbook by Sumit Gupta
Cover of the book Getting started with Audacity 1.3 by Sumit Gupta
Cover of the book React Design Patterns and Best Practices by Sumit Gupta
Cover of the book NumPy Cookbook - Second Edition by Sumit Gupta
Cover of the book Clojure Data Structures and Algorithms Cookbook by Sumit Gupta
Cover of the book VMware vSphere 6.5 Cookbook by Sumit Gupta
Cover of the book IT Inventory and Resource Management with OCS Inventory NG 1.02 by Sumit Gupta
Cover of the book PHPList 2 E-mail Campaign Manager by Sumit Gupta
Cover of the book Getting Started with TensorFlow by Sumit Gupta
Cover of the book CCENT/CCNA: ICND1 100-105 Certification Guide by Sumit Gupta
Cover of the book Learning Bing Maps API by Sumit Gupta
We use our own "cookies" and third party cookies to improve services and to see statistical information. By using this website, you agree to our Privacy Policy