[ambari hdp]YarnSchedulerBackend$YarnSchedulerEndpoint: Container marked as failed

本文记录了在Ambari HDP2.6.3版本下提交Spark任务时遇到“Container marked as failed exitcode1”错误的解决过程。通过调整executor数量与内存分配,成功解决了内存不足导致的任务失败问题。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

最近在使用ambari hdp 2.6.3版本,过程中提交spark程序时报如下错误:

YarnSchedulerBackend$YarnSchedulerEndpoint: Container marked as failed exit code 1

看了官方的解释,链接如下,大概意思是说你在提交spark任务时的contanier的内存总大小(每个excutor个数乘上每个excutor的内存),超过了在ambari yarn中配置的container的总大小。

https://community.hortonworks.com/questions/42782/container-marked-as-failed-spark-yarn.html

 

我的集群配置如下:

spark 任务提交时的配置如下:

尽量减少executor的数量和增加每个excutor的内存,开始的时候我的executor的个数是3,每个executor的内存是500M,后来

修改executor的个数为1,每个executor的内存为800M就ok了,根据经验这个错误的出现也有可能是executor的内存台太小而任务需要的内存比较大,此时相应的将executor的内存设置的大些,就可以成功了运行任务,希望能对你有帮助,同时作为一次小计,供以后查阅。

 

2025-04-02 11:24:49,591 ERROR org.apache.spark.scheduler.cluster.YarnScheduler: Lost executor 5 on dominos-usdp-v3-pro02: Container from a bad node: container_e35_1709867558487_220002_02_000002 on host: dominos-usdp-v3-pro02. Exit status: 143. Diagnostics: [2025-04-02 11:24:49.169]Container killed on request. Exit code is 143 [2025-04-02 11:24:49.170]Container exited with a non-zero exit code 143. [2025-04-02 11:24:49.170]Killed by external signal . 2025-04-02 11:24:49,592 WARN org.apache.spark.scheduler.TaskSetManager: Lost task 0.0 in stage 2.0 (TID 5) (dominos-usdp-v3-pro02 executor 5): ExecutorLostFailure (executor 5 exited caused by one of the running tasks) Reason: Container from a bad node: container_e35_1709867558487_220002_02_000002 on host: dominos-usdp-v3-pro02. Exit status: 143. Diagnostics: [2025-04-02 11:24:49.169]Container killed on request. Exit code is 143 [2025-04-02 11:24:49.170]Container exited with a non-zero exit code 143. [2025-04-02 11:24:49.170]Killed by external signal . 2025-04-02 11:24:49,594 WARN org.apache.spark.scheduler.cluster.YarnSchedulerBackend$YarnSchedulerEndpoint: Requesting driver to remove executor 5 for reason Container from a bad node: container_e35_1709867558487_220002_02_000002 on host: dominos-usdp-v3-pro02. Exit status: 143. Diagnostics: [2025-04-02 11:24:49.169]Container killed on request. Exit code is 143 [2025-04-02 11:24:49.170]Container exited with a non-zero exit code 143. [2025-04-02 11:24:49.170]Killed by external signal . 2025-04-02 11:26:37,187 ERROR org.apache.spark.scheduler.cluster.YarnScheduler: Lost executor 6 on dominos-usdp-v3-pro03: Container from a bad node: container_e35_1709867558487_220002_02_000003 on host: dominos-usdp-v3-pro03. Exit status: 143. Diagnostics: [2025-04-02 11:26:36.792]Container killed on request. Exit code is 143 [2025-04-02 11:26:36.793]Container exited with a non-zero exit code 143.
最新发布
04-03
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值