mysql explain的bug

最新推荐文章于 2025-09-11 20:53:34 发布

转载最新推荐文章于 2025-09-11 20:53:34 发布 · 93 阅读

0 ·

CC 4.0 BY-SA版权

原文链接：https://yq.aliyun.com/articles/434651

文章标签：

#数据库 #大数据 #运维

通过对Zabbix监控数据的分析，采用SQL查询优化的方法来评估Hadoop集群中Impala的内存使用情况。通过将子查询转换为多表连接的方式显著提高了查询效率。

最近在做hadoop集群的容量数据，主要依据zabbix的监控数据，因为要计算impala的内存使用情况，就使用了下面的sql

select a.host,avg(b.value) from

(select a.host,b.itemid,b.key_ from hosts a,items b where

a.hostid=b.hostid and a.host like '%hadoop-datanode%' and b.key_='impala.get[mem]')a
join

(select itemid,clock,value from history) b on a.itemid=b.itemid

and b.clock between unix_timestamp('2014-02-28 00:00:00') and

unix_timestamp('2014-03-06 00:00:00') group by a.host;

在使用explain时发现巨慢，一个生成执行计划的操作都这么慢？

考虑到sql的性能优化，就把上面的查询写成了3个表的join:

select

a.host,avg(c.value) from hosts a,items b,history c where

a.hostid=b.hostid and a.host like '%hadoop-datanode%' and

b.key_='impala.get[mem]'
and

b.itemid=c.itemid and c.clock between

unix_timestamp('2014-02-28 00:00:00') and unix_timestamp('2014-03-06 00:00:00') group by a.host;