
Apache Oozie: The Workflow Scheduler for Hadoop (Sách keo gáy, bìa mềm)
Categories:Computers - Networking
Year:2015
Edition:1
Language:english
Pages:271
Get a solid grounding in Apache Oozie, the workflow scheduler system
for managing Hadoop jobs. With this hands-on guide, two experienced
Hadoop practitioners walk you through the intricacies of this powerful
and flexible platform, with numerous examples and real-world use cases.
Once
you set up your Oozie server, you’ll dive into techniques for writing
and coordinating workflows, and learn how to write complex data
pipelines. Advanced topics show you how to handle shared libraries in
Oozie, as well as how to implement and manage Oozie’s security
capabilities.
Install and configure an Oozie server, and get an overview of basic concepts
Journey through the world of writing and configuring workflows
Learn how the Oozie coordinator schedules and executes workflows based on triggers
Understand how Oozie manages data dependencies
Use Oozie bundles to package several coordinator apps into a data pipeline
Learn about security features and shared library management
Implement custom extensions and write your own EL functions and actions
Debug workflows and manage Oozie’s operational details