Data Engineering Skills - Hadoop Shell: A Comprehensive Guide to Hadoop FS Commands

Author:   Neeraj Malhotra
Publisher:   Createspace Independent Publishing Platform
ISBN:  

9781717577511


Pages:   138
Publication Date:   27 April 2018
Format:   Paperback
Availability:   Available To Order   Availability explained
We have confirmation that this item is in stock with the supplier. It will be ordered in for you and dispatched immediately.

Our Price $105.57 Quantity:  
Add to Cart

Share |

Data Engineering Skills - Hadoop Shell: A Comprehensive Guide to Hadoop FS Commands


Add your own review!

Overview

"Hadoop is the most adopted distributed storage and processing framework for very large datasets in the world today. Although it had started as a small research project, less famously known as Apache Nutch, back in 2006 but later moved to a new subproject called Hadoop. Doug Cutting who was one of the founders of Hadoop, named it after his son's toy elephant. His son used to call the toy as hadoop, so that's how Hadoop got its name. The idea of Hadoop originated from a white paper that Google had published back in 2003 called ""Google File System"". This paper talked about specifically how Google designed its applications around a distributed storage and processing framework. Doug Cutting and Mike Cafarella took same concept and made it more generalized so it fits use cases of many other companies around the globe. Hadoop is famous for its distributed storage which is provided by its file system - commonly known as HDFS and distributed processing engine which is supported by something called - MapReduce. The MapReduce enabled processing of distributed datasets possible by running the code where data resides, which was a big paradigm shift compared to previous generations of processing engines. Earlier data needed to be transferred to machines where code is residing so further processing can be done on that data and results could be generated. But since data is usually bigger in size than actual code is, it used to take more time in setting the environment than actual processing would take. Hadoop adopted opposite approach where data doesn't move between machines much but code binaries are sent to machine where data is residing and then that code will locally run on that particular machine and return the results back. This approach provides obvious benefits in overall performance as setting time has reduced substantially and multiple processes can be ran on same data across distributed network of machines in parallel. I decided to write this book as the first in a series of books that I am planning to publish in future on various big data technologies. The goal of this book is to help data engineers build enough foundation in Hadoop before moving on to more high level technologies such as Spark, Hive, etc. This book is designed to be more hands on rather than plain theory. In this book, I will explain the Hadoop framework and how it works behind the scenes. Then we will shift our focus to learn specifically about Hadoop Shell. Hadoop comes with an inbuilt shell which is inspired from Linux Shell and has many similar concepts. To make our learning interesting, I have categorized various important shell commands in such a way that can be used to solve some real world like problems. These problems are inspired by real scenarios faced during several years of my working as a big data specialist."

Full Product Details

Author:   Neeraj Malhotra
Publisher:   Createspace Independent Publishing Platform
Imprint:   Createspace Independent Publishing Platform
Dimensions:   Width: 21.60cm , Height: 0.90cm , Length: 27.90cm
Weight:   0.458kg
ISBN:  

9781717577511


ISBN 10:   1717577512
Pages:   138
Publication Date:   27 April 2018
Audience:   General/trade ,  General
Format:   Paperback
Publisher's Status:   Active
Availability:   Available To Order   Availability explained
We have confirmation that this item is in stock with the supplier. It will be ordered in for you and dispatched immediately.

Table of Contents

Reviews

Author Information

Tab Content 6

Author Website:  

Customer Reviews

Recent Reviews

No review item found!

Add your own review!

Countries Available

All regions
Latest Reading Guide

MRG2025CC

 

Shopping Cart
Your cart is empty
Shopping cart
Mailing List