hive 表创建及字段信息管理

原创

已于 2023-09-04 15:17:08 修改 · 1.2w 阅读

31 ·

CC 4.0 BY-SA版权

文章标签：

#hive

于 2020-11-25 16:31:07 首次发布

1. 分区表创建及数据导入

1.1 创建分区表

-- 以日期pt分区，字段用\t分隔，输入格式为txt,存储格式为orc
use db_name;
drop table if exists tablename;
CREATE TABLE IF NOT EXISTS tablename (
    aid string,
    gender int,                   --性别
    age string,                   --年龄
    num bigint,                
    value1 array<int>,      
    value2 array<string> 
) partitioned by (pt string comment "YYYY-MM-DD.HH_MM")
-- NULL DEFINED as 'null'
stored as orc  -- textfile
-- row format delimited fields terminated by '\t'
-- STORED AS INPUTFORMAT 'org.apache.hadoop.mapred.TextInputFormat'
-- OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat'
;

-- 存储格式亦可指定为txt
OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'

注意：内部表与外部表的区别