|
![]() |
|||
|
||||
OverviewIf your organization is about to enter the world of big data, you not only need to decide whether Apache Hadoop is the right platform to use, but also which of its many components are best suited to your task. This field guide makes the exercise manageable by breaking down the Hadoop ecosystem into short, digestible sections. You ll quickly understand how Hadoop s projects, subprojects, and related technologies work together.Each chapter introduces a different topic such as core technologies or data transfer and explains why certain components may or may not be useful for particular needs. When it comes to data, Hadoop is a whole new ballgame, but with this handy reference, you ll have a good grasp of the playing field.Topics include: Core technologies Hadoop Distributed File System (HDFS), MapReduce, YARN, and SparkDatabase and data management Cassandra, HBase, MongoDB, and HiveSerialization Avro, JSON, and ParquetManagement and monitoring Puppet, Chef, Zookeeper, and OozieAnalytic helpers Pig, Mahout, and MLLibData transfer Scoop, Flume, distcp, and StormSecurity, access control, auditing Sentry, Kerberos, and KnoxCloud computing and virtualization Serengeti, Docker, and Whirr Full Product DetailsAuthor: Kevin Sitto , Marshall PresserPublisher: O'Reilly Media Imprint: O'Reilly Media ISBN: 9781336095755ISBN 10: 133609575 Pages: 132 Publication Date: 01 January 2015 Audience: General/trade , General Format: Electronic book text Publisher's Status: Active Availability: Available To Order ![]() We have confirmation that this item is in stock with the supplier. It will be ordered in for you and dispatched immediately. Table of ContentsReviewsAuthor InformationTab Content 6Author Website:Countries AvailableAll regions |