Tag Archives: Big Data Books

Hadoop with Python


Title: Hadoop with Python Author: Donald Miner, Zachary Radka License: Available for free by O’Reilly Why This Book? Hadoop is one of the most popular open-source distributed processing framework that store big data and manage data processing. Hadoop is mostly written in Java but there are scope of other programming languages too, such as Python. […]

Disruptive Possibilities: How Big Data Changes Everything


Title: Disruptive Possibilities: How Big Data Changes Everything Author: Jeff Needham License Details: N/A Book Description: Big data has more disruptive potential than any information technology developed in the past 40 years. As author Jeffrey Needham points out in this revealing book, big data can provide unprecedented visibility into the operational efficiency of enterprises and agencies. […]

An Introduction to Data Science


Title: An Introduction to Data Science Author: Jefferey Stanton License Details: Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported: Book Description: In this Introduction to Data Science eBook, a series of data problems of increasing complexity is used to illustrate the skills and capabilities needed by data scientists. The open source data analysis program known as “R” and its […]

Hadoop and Kerberos: The Madness Beyond the Gate

Title: Hadoop and Kerberos: The Madness Beyond the Gate Author: Steve Laughran License Detail: Apache License 2.0 Book Description: Just as the infamous Necronomicon is a collection of notes scrawled in blood as a warning to others, this book is Incomplete. Based on experience and superstition, rather than understanding and insight. Contains information that will drive the reader insane. […]

Big Data on Real-World Applications


Book Title: Big Data on Real-World Applications Authors: Alberto Cano Jose Mari Luna Sebastian Ventura Soto License: Creative-Commons 3.0 Unported Book Description As technology advances, high volumes of valuable data are generated day by day in modern organizations. The management of such huge volumes of data has become a priority in these organizations, requiring new techniques […]