Synopses & Reviews
Get the edge on processing big data using the Hadoop software library
Allowing for the distributed processing of large data sets across clusters of computers, Hadoop uses a simple programming model that is designed to scale up anything from single servers to thousands of machines. Advanced Hadoop Programming: Pushing the Limits is ideal reading for software architects, engineers, and data scientists who are looking to advance their knowledge of the Apache Hadoop software library to the current frontier of thinking on MapReduce and Hadoop programming.
- Shows experienced programmers how to maximize their usage of the Hadoop Software Library
- Includes coverage of YARN, MapReduce V2.0, MPI: Message Passing Interface programming model, BSP: Bulk Synchronous Parallel programming model, and other parallel programming models
- Discusses graph processing, Spark, Apache Mesos cluster manager, and more
- Addresses integrating Hadoop with NoSQL databases
Advanced Hadoop Programming: Pushing the Limits shows you how to take full advantage of Hadoop and its advanced functionalities.