问题描述:
在集群运行过程中发现一旦主备切换后,原来正常运行的任务在新的主节点上不能自动从启
解决方法:
在yarn-site.xml中增加以下配置项:
<property>
<description>Enable RM to recover state after starting. If true, then yarn.resourcemanager.store.class must be specified</description>
<name>yarn.resourcemanager.recovery.enabled</name>
<value>true</value>
</property>
<property>
<description>The class to use as the persistent store.</description>
<name>yarn.resourcemanager.store.class</name>
<value>org.apache.hadoop.yarn.server.resourcemanager.recovery.ZKRMStateStore</value>
</property>