hive 记录帖 待整理

本文展示了一个使用Hive SQL进行复杂查询的例子,包括使用窗口函数row_number()进行行号分配,并结合子查询和条件过滤来精确获取所需数据。此外,还详细列出了查询计划,帮助理解Hive如何执行SQL语句。
select loanno,amount,limitpaydate,curplannum,
       row_number() over(PARTITION  by loanno ORDER by curplannum) as rn
 from ods.o_m20_cf_plan_s t
      where dt='2016-04-21'
      and substr(limitpaydate,1,10)>='2016-04-21'
      and loanno='2015112010543834'; 

2015112010543834    4500.0    2016-04-21 00:00:00    5    1
2015112010543834    4500.0    2016-05-21 00:00:00    6    2
2015112010543834    4500.0    2016-06-21 00:00:00    7    3
2015112010543834    4500.0    2016-10-21 00:00:00    11    4
2015112010543834    4500.0    2016-07-21 00:00:00    8    1
2015112010543834    4500.0    2016-08-21 00:00:00    9    2
2015112010543834    4500.0    2016-09-21 00:00:00    10    3


STAGE DEPENDENCIES:
  Stage-1 is a root stage
  Stage-0 depends on stages: Stage-1

STAGE PLANS:
  Stage: Stage-1
    Map Reduce
      Map Operator Tree:
          TableScan
            alias: t
            filterExpr: ((substr(limitpaydate, 1, 10) >= '2016-04-21') and (loanno = '2015112010543834')) (type: boolean)
            Statistics: Num rows: 4792293 Data size: 10240206016 Basic stats: COMPLETE Column stats: NONE
            Filter Operator
              predicate: ((substr(limitpaydate, 1, 10) >= '2016-04-21') and (loanno = '2015112010543834')) (type: boolean)
              Statistics: Num rows: 798715 Data size: 1706699934 Basic stats: COMPLETE Column stats: NONE
              Reduce Output Operator
                key expressions: '2015112010543834' (type: string), curplannum (type: bigint)
                sort order: ++
                Statistics: Num rows: 798715 Data size: 1706699934 Basic stats: COMPLETE Column stats: NONE
                value expressions: '2015112010543834' (type: string), curplannum (type: bigint), limitpaydate (type: string), amount (type: double)
      Reduce Operator Tree:
        Extract
          Statistics: Num rows: 798715 Data size: 1706699934 Basic stats: COMPLETE Column stats: NONE
          PTF Operator
            Statistics: Num rows: 798715 Data size: 1706699934 Basic stats: COMPLETE Column stats: NONE
            Select Operator
              expressions: '2015112010543834' (type: string), _col10 (type: double), _col8 (type: string), _col7 (type: bigint), _wcol0 (type: int)
              outputColumnNames: _col0, _col1, _col2, _col3, _col4
              Statistics: Num rows: 798715 Data size: 1706699934 Basic stats: COMPLETE Column stats: NONE
              File Output Operator
                compressed: false
                Statistics: Num rows: 798715 Data size: 1706699934 Basic stats: COMPLETE Column stats: NONE
                table:
                    input format: org.apache.hadoop.mapred.TextInputFormat
                    output format: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat
                    serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe

  Stage: Stage-0
    Fetch Operator
      limit: -1
      Processor Tree:
        ListSink
select * from (
SELECT loanno 
  ,amount 
  ,limitpaydate 
  ,curplannum 
  ,row_number() over(partition BY loanno ORDER BY curplannum ASC) AS rn
FROM ods.o_m20_cf_plan_s                                             t
WHERE dt='2016-04-21'
    AND SUBSTR(limitpaydate,1,10)>='2016-04-21'
) t1 where loanno='2015112010543834'
2015112010543834    4500.0  2016-04-21 00:00:00 5   1
2015112010543834    4500.0  2016-05-21 00:00:00 6   2
2015112010543834    4500.0  2016-06-21 00:00:00 7   3
2015112010543834    4500.0  2016-07-21 00:00:00 8   4
2015112010543834    4500.0  2016-08-21 00:00:00 9   5
2015112010543834    4500.0  2016-09-21 00:00:00 10  6
2015112010543834    4500.0  2016-10-21 00:00:00 11  7

STAGE DEPENDENCIES:
  Stage-1 is a root stage
  Stage-0 depends on stages: Stage-1

STAGE PLANS:
  Stage: Stage-1
    Map Reduce
      Map Operator Tree:
          TableScan
            alias: t
            filterExpr: ((dt = '2016-04-21') and (substr(limitpaydate, 1, 10) >= '2016-04-21')) (type: boolean)
            Statistics: Num rows: 4792293 Data size: 10240206016 Basic stats: COMPLETE Column stats: NONE
            Filter Operator
              predicate: (substr(limitpaydate, 1, 10) >= '2016-04-21') (type: boolean)
              Statistics: Num rows: 1597431 Data size: 3413402005 Basic stats: COMPLETE Column stats: NONE
              Reduce Output Operator
                key expressions: loanno (type: string), curplannum (type: bigint)
                sort order: ++
                Map-reduce partition columns: loanno (type: string)
                Statistics: Num rows: 1597431 Data size: 3413402005 Basic stats: COMPLETE Column stats: NONE
                value expressions: loanno (type: string), curplannum (type: bigint), limitpaydate (type: string), amount (type: double)
      Reduce Operator Tree:
        Extract
          Statistics: Num rows: 1597431 Data size: 3413402005 Basic stats: COMPLETE Column stats: NONE
          PTF Operator
            Statistics: Num rows: 1597431 Data size: 3413402005 Basic stats: COMPLETE Column stats: NONE
            Filter Operator
              predicate: (_col6 = '2015112010543834') (type: boolean)
              Statistics: Num rows: 798715 Data size: 1706699934 Basic stats: COMPLETE Column stats: NONE
              Select Operator
                expressions: '2015112010543834' (type: string), _col10 (type: double), _col8 (type: string), _col7 (type: bigint), _wcol0 (type: int)
                outputColumnNames: _col0, _col1, _col2, _col3, _col4
                Statistics: Num rows: 798715 Data size: 1706699934 Basic stats: COMPLETE Column stats: NONE
                File Output Operator
                  compressed: false
                  Statistics: Num rows: 798715 Data size: 1706699934 Basic stats: COMPLETE Column stats: NONE
                  table:
                      input format: org.apache.hadoop.mapred.TextInputFormat
                      output format: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat
                      serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe

  Stage: Stage-0
    Fetch Operator
      limit: -1
      Processor Tree:
        ListSink
标题基于Python的自主学习系统后端设计与实现AI更换标题第1章引言介绍自主学习系统的研究背景、意义、现状以及本文的研究方法和创新点。1.1研究背景与意义阐述自主学习系统在教育技术领域的重要性和应用价值。1.2国内外研究现状分析国内外在自主学习系统后端技术方面的研究进展。1.3研究方法与创新点概述本文采用Python技术栈的设计方法和系统创新点。第2章相关理论与技术总结自主学习系统后端开发的相关理论和技术基础。2.1自主学习系统理论阐述自主学习系统的定义、特征和理论基础。2.2Python后端技术栈介绍DjangoFlask等Python后端框架及其适用场景。2.3数据库技术讨论关系型和非关系型数据库在系统中的应用方案。第3章系统设计与实现详细介绍自主学习系统后端的设计方案和实现过程。3.1系统架构设计提出基于微服务的系统架构设计方案。3.2核心模块设计详细说明用户管理、学习资源管理、进度跟踪等核心模块设计。3.3关键技术实现阐述个性化推荐算法、学习行为分析等关键技术的实现。第4章系统测试与评估对系统进行功能测试和性能评估。4.1测试环境与方法介绍测试环境配置和采用的测试方法。4.2功能测试结果展示各功能模块的测试结果和问题修复情况。4.3性能评估分析分析系统在高并发等场景下的性能表现。第5章结论与展望总结研究成果并提出未来改进方向。5.1研究结论概括系统设计的主要成果和技术创新。5.2未来展望指出系统局限性并提出后续优化方向。
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值