Title: Hadoop with Python
Author: Donald Miner, Zachary Radka
License: Available for free by O’Reilly
Why This Book?
Hadoop is one of the most popular open-source distributed processing framework that store big data and manage data processing. Hadoop is mostly written in Java but there are scope of other programming languages too, such as Python.
Python can be used in Hadoop in distribute file system and it is what this book teaches you. You will also MapReduce, the Apache Pig platform and Pig Latin script, and the Apache Spark cluster-computing framework in Hadoop with Python.
Two authors tried their best to clear every concept excellently through the use of various examples.