|
![]() |
|||
|
||||
OverviewGet a solid grounding in Apache Oozie, the workflow scheduler system for managing Hadoop jobs. With this hands-on guide, two experienced Hadoop practitioners walk you through the intricacies of this powerful and flexible platform, with numerous examples and real-world use cases.Once you set up your Oozie server, you ll dive into techniques for writing and coordinating workflows, and learn how to write complex data pipelines. Advanced topics show you how to handle shared libraries in Oozie, as well as how to implement and manage Oozie s security capabilities.Install and configure an Oozie server, and get an overview of basic conceptsJourney through the world of writing and configuring workflowsLearn how the Oozie coordinator schedules and executes workflows based on triggersUnderstand how Oozie manages data dependenciesUse Oozie bundles to package several coordinator apps into a data pipelineLearn about security features and shared library managementImplement custom extensions and write your own EL functions and actionsDebug workflows and manage Oozie s operational details Full Product DetailsAuthor: Mohammad Kamrul Islam , Aravind SrinivasanPublisher: O'Reilly Media Imprint: O'Reilly Media ISBN: 9781449369774ISBN 10: 1449369774 Publication Date: 12 May 2015 Audience: General/trade , General Format: Electronic book text Publisher's Status: Active Availability: Available To Order ![]() We have confirmation that this item is in stock with the supplier. It will be ordered in for you and dispatched immediately. Table of ContentsReviewsAuthor InformationMohammad Kamrul Islam is currently working at Uber in data engineering team as a Staff Software Engineer. Previously, he worked at Linkedin for more than two years as Staff Software Engineer in the Hadoop development team. Before that, he worked at Yahoo for nearly five years as an Oozie architect/technical lead. His fingerprints can befound all over Oozie and is a respected voice in the Oozie community. He has been intimately involved with the Apache Hadoop ecosystem since 2009. Mohammad has a Ph.D. in Computer Science with a specialization in parallel job scheduling from Ohio State University. He received his MSCS degree from Wright State University, Ohio andBSCS from Bangladesh University of Engineering and Technology (BUET). He is a Project Management Committee (PMC) member of both Apache Oozie and Apache TEZ and frequently contributes to Apache YARN/MapReduce and Apache Hive. He was elected as the PMC chair and Vice-President of Oozie as part of the Apache Software Foundation from 2013 through 2015. Tab Content 6Author Website:Countries AvailableAll regions |