Workflow Definition
A workflow definition is a DAG with control flow nodes (start, end, decision, fork, join, kill) or action nodes (map-reduce, pig, etc.), nodes are connected by transitions arrows.
The workflow definition language is XML based and it is called hPDL (Hadoop Process Definition Language).
Oozie does not support cycles in workflow definitions, workflow definitions must be a strict DAG.
Workflow Nodes
Workflow nodes are classified in control flow nodes and action nodes:
- Control flow nodes: nodes that control the start and end of the workflow and workflow job execution path.
- Action nodes: nodes that trigger the execution of a computation/processing task.
Node names and transitions must be conform to the following pattern =[a-zA-Z][\-_a-zA-Z0-0]*=, of up to 20 characters long.

Reference
https://oozie.apache.org/docs/4.0.1/WorkflowFunctionalSpec.html
本文详细介绍了Apache Oozie工作流定义的基本概念,包括工作流定义语言hPDL,控制流节点如开始、结束、决策等,以及行动节点如Map-Reduce任务。文章还强调了工作流定义必须遵循的DAG结构要求。
256

被折叠的 条评论
为什么被折叠?



