Parallel R

Data Analysis in the Distributed World

Nonfiction, Computers, Programming
Cover of the book Parallel R by Q. Ethan McCallum, Stephen Weston, O'Reilly Media
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart
Author: Q. Ethan McCallum, Stephen Weston ISBN: 9781449320331
Publisher: O'Reilly Media Publication: October 21, 2011
Imprint: O'Reilly Media Language: English
Author: Q. Ethan McCallum, Stephen Weston
ISBN: 9781449320331
Publisher: O'Reilly Media
Publication: October 21, 2011
Imprint: O'Reilly Media
Language: English

It’s tough to argue with R as a high-quality, cross-platform, open source statistical software product—unless you’re in the business of crunching Big Data. This concise book introduces you to several strategies for using R to analyze large datasets. You’ll learn the basics of Snow, Multicore, Parallel, and some Hadoop-related tools, including how to find them, how to use them, when they work well, and when they don’t.

With these packages, you can overcome R’s single-threaded nature by spreading work across multiple CPUs, or offloading work to multiple machines to address R’s memory barrier.

  • Snow: works well in a traditional cluster environment
  • Multicore: popular for multiprocessor and multicore computers
  • Parallel: part of the upcoming R 2.14.0 release
  • R+Hadoop: provides low-level access to a popular form of cluster computing
  • RHIPE: uses Hadoop’s power with R’s language and interactive shell
  • Segue: lets you use Elastic MapReduce as a backend for lapply-style operations
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart

It’s tough to argue with R as a high-quality, cross-platform, open source statistical software product—unless you’re in the business of crunching Big Data. This concise book introduces you to several strategies for using R to analyze large datasets. You’ll learn the basics of Snow, Multicore, Parallel, and some Hadoop-related tools, including how to find them, how to use them, when they work well, and when they don’t.

With these packages, you can overcome R’s single-threaded nature by spreading work across multiple CPUs, or offloading work to multiple machines to address R’s memory barrier.

More books from O'Reilly Media

Cover of the book Building Web Apps with WordPress by Q. Ethan McCallum, Stephen Weston
Cover of the book QuickBase: The Missing Manual by Q. Ethan McCallum, Stephen Weston
Cover of the book Communicating Data with Tableau by Q. Ethan McCallum, Stephen Weston
Cover of the book Programming MapPoint in .NET by Q. Ethan McCallum, Stephen Weston
Cover of the book Production-Ready Microservices by Q. Ethan McCallum, Stephen Weston
Cover of the book Introduction to JavaScript Object Notation by Q. Ethan McCallum, Stephen Weston
Cover of the book Zero Configuration Networking: The Definitive Guide by Q. Ethan McCallum, Stephen Weston
Cover of the book Programming Python by Q. Ethan McCallum, Stephen Weston
Cover of the book HTML5 & CSS3 (Prags) by Q. Ethan McCallum, Stephen Weston
Cover of the book Mastering Azure Analytics by Q. Ethan McCallum, Stephen Weston
Cover of the book Das Sensor-Buch by Q. Ethan McCallum, Stephen Weston
Cover of the book Oracle PL/SQL Best Practices by Q. Ethan McCallum, Stephen Weston
Cover of the book Ethernet: The Definitive Guide by Q. Ethan McCallum, Stephen Weston
Cover of the book Twisted Network Programming Essentials by Q. Ethan McCallum, Stephen Weston
Cover of the book Fundamentals of Deep Learning by Q. Ethan McCallum, Stephen Weston
We use our own "cookies" and third party cookies to improve services and to see statistical information. By using this website, you agree to our Privacy Policy