
hive
diudiu2025
这个作者很懒,什么都没留下…
展开
-
hive初次使用报错
hive(元数据存储在mysql) 启动Exception in thread "main" java.lang.RuntimeException: Hive metastore database is not initialized. Please use schematool (e.g. ./schematool -initSchema -dbType ...) to create t转载 2016-07-12 17:02:39 · 9232 阅读 · 0 评论 -
Hive对有null值得一列做avg,count等操作时会过滤掉有NULL值的这一行
WITH tmp AS(SELECT null as col1union allSELECT 2 as col1union allSELECT 4 as col1)SELECT avg(1) from tmp结果是3;WITH tmp AS(SELECT null as col1union allSELECT 2 as col1union原创 2017-11-21 22:26:56 · 11225 阅读 · 0 评论 -
hive踩坑记录:count(distinct col1,col2) 遇见某列中有null值,结果不准
count(distinct col1,col2) 遇见某列中中有null值,结果不准SELECT count(DISTINCT col1,col2)from(SELECT 2 as col1,1 as col2union allSELECT null as col1,2 as col2union allSELECT null as col1,3 as col2un...原创 2017-12-28 10:47:53 · 3308 阅读 · 0 评论 -
hive对列按顺序转换为行
--创造数据create table persona.test_hz 已有数据1 1 a1 2 b1 3 c1 4 d1 5 e2 5 e2 4 d2 3 c2 2 b2 1 a3 1 a3 2 b3 3 c3 4 d3 5 e4 5 e4 4 d4 3 c4 2 b4 1 aselect id,collect_list(value) from (select * from persona.tes...原创 2018-06-27 19:52:33 · 3481 阅读 · 0 评论 -
Hive分析窗口函数(一) SUM,AVG,MIN,MAX
Hive中提供了越来越多的分析函数,用于完成负责的统计分析。抽时间将所有的分析窗口函数理一遍,将陆续发布。今天先看几个基础的,SUM、AVG、MIN、MAX。用于实现分组内所有和连续累积的统计。Hive版本为 apache-hive-0.13.1数据准备 CREATE EXTERNAL TABLE lxw1234 ( cookieid string, createt...转载 2019-01-23 10:43:35 · 364 阅读 · 0 评论