Beginning Apache Pig

Big Data Processing Made Easy

Nonfiction, Computers, Database Management, General Computing, Programming
Cover of the book Beginning Apache Pig by Balaswamy Vaddeman, Apress
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart
Author: Balaswamy Vaddeman ISBN: 9781484223376
Publisher: Apress Publication: December 10, 2016
Imprint: Apress Language: English
Author: Balaswamy Vaddeman
ISBN: 9781484223376
Publisher: Apress
Publication: December 10, 2016
Imprint: Apress
Language: English

Learn to use Apache Pig to develop lightweight big data applications easily and quickly. This book shows you many optimization techniques and covers every context where Pig is used in big data analytics. Beginning Apache Pig shows you how Pig is easy to learn and requires relatively little time to develop big data applications.

The book is divided into four parts: the complete features of Apache Pig; integration with other tools; how to solve complex business problems; and optimization of tools.

You'll discover topics such as MapReduce and why it cannot meet every business need; the features of Pig Latin such as data types for each load, store, joins, groups, and ordering; how Pig workflows can be created; submitting Pig jobs using Hue; and working with Oozie. You'll also see how to extend the framework by writing UDFs and custom load, store, and filter functions. Finally you'll cover different optimization techniques such as gathering statistics about a Pig script, joining strategies, parallelism, and the role of data formats in good performance.

What You Will Learn

• Use all the features of Apache Pig

• Integrate Apache Pig with other tools

• Extend Apache Pig

• Optimize Pig Latin code

• Solve different use cases for Pig Latin

Who This Book Is For

All levels of IT professionals: architects, big data enthusiasts, engineers, developers, and big data administrators

View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart

Learn to use Apache Pig to develop lightweight big data applications easily and quickly. This book shows you many optimization techniques and covers every context where Pig is used in big data analytics. Beginning Apache Pig shows you how Pig is easy to learn and requires relatively little time to develop big data applications.

The book is divided into four parts: the complete features of Apache Pig; integration with other tools; how to solve complex business problems; and optimization of tools.

You'll discover topics such as MapReduce and why it cannot meet every business need; the features of Pig Latin such as data types for each load, store, joins, groups, and ordering; how Pig workflows can be created; submitting Pig jobs using Hue; and working with Oozie. You'll also see how to extend the framework by writing UDFs and custom load, store, and filter functions. Finally you'll cover different optimization techniques such as gathering statistics about a Pig script, joining strategies, parallelism, and the role of data formats in good performance.

What You Will Learn

• Use all the features of Apache Pig

• Integrate Apache Pig with other tools

• Extend Apache Pig

• Optimize Pig Latin code

• Solve different use cases for Pig Latin

Who This Book Is For

All levels of IT professionals: architects, big data enthusiasts, engineers, developers, and big data administrators

More books from Apress

Cover of the book Physics for JavaScript Games, Animation, and Simulations by Balaswamy Vaddeman
Cover of the book Programming 101 by Balaswamy Vaddeman
Cover of the book Microsoft Computer Vision APIs Distilled by Balaswamy Vaddeman
Cover of the book MicroPython for the Internet of Things by Balaswamy Vaddeman
Cover of the book Docker for Data Science by Balaswamy Vaddeman
Cover of the book The Definitive Guide to AdonisJs by Balaswamy Vaddeman
Cover of the book Rapid Game Development Using Cocos2d-JS by Balaswamy Vaddeman
Cover of the book Beginning Rails 4 by Balaswamy Vaddeman
Cover of the book Learn Sprite Kit for iOS Game Development by Balaswamy Vaddeman
Cover of the book Building a Virtual Assistant for Raspberry Pi by Balaswamy Vaddeman
Cover of the book Pro Android Wearables by Balaswamy Vaddeman
Cover of the book MATLAB Mathematical Analysis by Balaswamy Vaddeman
Cover of the book Expert ASP.NET Web API 2 for MVC Developers by Balaswamy Vaddeman
Cover of the book Running Mainframe z on Distributed Platforms by Balaswamy Vaddeman
Cover of the book Mastering 3D Printing by Balaswamy Vaddeman
We use our own "cookies" and third party cookies to improve services and to see statistical information. By using this website, you agree to our Privacy Policy