CDH6.3.0环境使用set hive.execution.engine=spark启用spark引擎执行hql报错
背景描述
此前在开发环境测试使用hudi将数据写入hive表时,在/opt/cloudera/parcels/CDH/lib/hive/lib路径下新增hudi相关jar包:hudi-hadoop-mr-bundle-0.11.1.jar
报错内容
FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.spark.SparkTask. hbase-default.xml file seems to be for an older version of HBase (2.4.9), this version is 2.1.0-cdh6.3.0
从内容看是在task拆分阶段返回了报错,报错原因是hbase的版本过低。通过查阅网上资料最终把错误定位在/opt/cloudera/parcels/CDH/lib/hive/lib路径下的jar包里,排查发现hudi相关jar包:hudi-hadoop-mr-bundle-0.11.1.jar,尝试移除该jar包后,重试hive on spark可以正常执行hql