hive相关命令总结

最新推荐文章于 2023-12-13 08:45:14 发布

吃菜拌胡椒

最新推荐文章于 2023-12-13 08:45:14 发布

阅读量237

点赞数

分类专栏：大数据文章标签： hive

本文链接：https://blog.youkuaiyun.com/qq_42221092/article/details/104670978

版权

3 篇文章

订阅专栏

hive导出数据

hive -e “sql语句” > 路径

这个方法最为常见,sql的查询结果将直接保存到/home/output/out.txt中

hive -e "select user, login_timestamp from user_login" > /home/output/out.txt

当sql脚本过多时，也可以使用 -f sql文件名，按下面的方式执行查询，并保存结果

hive -f file.sql > /home/hadoop/output/cai/out.txt

hive导出数据默认分隔符为"\t"，需要转换成","
在执行语句后加入 | tr "\t" ","

hive -e "select * from table "  | tr "\t" ","> /home/output/out.csv

有些文件包含中文在导出csv后可能回出现乱码情况

增加：set hive.cli.print.header=true;

hive -e "set hive.cli.print.header=true; select user, login_timestamp from user_login" > /home/output/out.txt

可能原因是被更高资源的任务抢占了，导致失败次数超过设定的失败次数，进而报错。尝试通过下面代码解决。

set hive.vectorized.execution.enabled=false;

set hive.mapred.mode = strict;
set hive.mapred.mode = nonstrict;

set hive.cli.print.header=true;

set hive.cli.print.current.db=true;

select reflect("java.net.URLDecoder", "decode", "%E4%B8%AD%E5%9B%BD", "UTF-8");