工作中经常会出现 Spark SQL 执行很慢或者失败的情况,如果要排查问题,就必须要学会看 Spark Web UI。可以参考官网来学习:https://spark.apache.org/docs/3.2.1/web-ui.html#content。关于 Spark Web UI,上面有很多个 tab 页,后面逐一学习。
昨天已经学习了 Jobs,今天学习 Stages。
Stages Tab
The Stages tab displays a summary page that shows the current state of all stages of all jobs in the Spark application.
At the beginning of the page is the summary with the count of all stages by status (active, pending, completed, skipped, and failed)
Stages 选项卡显示一个摘要页面,该页面显示 Spark 应用程序中所有作业的所有阶段的当前状态。 页面开头是按状态(活动、待处理、已完成、已跳过和失败)统计所有阶段的摘要。
In Fair scheduling mode there is a table that displays pools properties
在公平调度模式下,有一个显示池属性的表。
After that are the details of stages per status (active, pending, completed, skipped, failed). In active stages, it’s possible to kill the stage with the