Hive数仓导入导出,分隔符
移除点击此处添加图片说明文字
首先看下导出吧,我们准备sql。那么首先我们看到,两张表给运营提供数据。另一方面,让我们看看hive的强大。
先看一下两张表数据量:
select count(dt) from d.d_safedata_base_day 数据量:10836008
select count(dt) from d.d_safedata_applist_day 数据量: 346756144

drop table testing.weiboapplist;
接着创建中间表,查询微博的应用用户使用情况。
create table testing.weiboapplist as
select
a.imei,
a.imsi,
a.uuid,
a.app_pakeage_name,
a.app_name ,
a.version_code,
a.version_name,
b.datetime,
a.dt,
b.mac,
b.mobile_brand,
b.product_code,
b.model
from
(
select
imei,imsi,uuid,app_pakeage_name,app_name,version_code,version_name,dt,timestamp
from d.d_safedata_applist_day where dt between '20170601' and '20170630' and uuid <> '' and uuid is not null and app_name='微博'
group by imei,imsi,uuid,app_pakeage_name,app_name,version_code,version_name,dt,timestamp
) a