
hadoop
gdolphinw
这个作者很懒,什么都没留下…
展开
专栏收录文章
- 默认排序
- 最新发布
- 最早发布
- 最多阅读
- 最少阅读
-
python操作hive
安装pyhive from pyhive import hive cursor=hive.connect("10.3.1.8",database="house").cursor() cursor.execute("show tables") for cur in cursor: print cur import pandas as pd from pyhive import ...原创 2018-09-12 18:54:18 · 812 阅读 · 0 评论 -
hdfs fsck
fsck(file system check) ,用于检查HDFS上文件和目录的健康状态、获取文件的block块信息和位置信息等。 #hdfs dfs -ls /user/gewei/ #hdfs dfs -rm /user/gewei/houseApplyTest.csv hadoop fs -put houseApplyTest.csv hdfs:/user/gewei [dev@cd...原创 2019-02-28 16:49:26 · 685 阅读 · 0 评论 -
mapreduce
[dev@cdh1 python]$ vim genWordCount.py import random list =[] lineNum=0 with open("wordcount.txt","a") as w: for i in range(5000000): apply = random.randint(3, 10) word='' ...原创 2019-03-08 18:10:02 · 274 阅读 · 0 评论 -
oozie相关操作
HUE oozie创建Workflow和Scheduler 创建Workflow HUE查看Jobs 通过oozie查看jobs #oozie查看log信息 [dev@cdh1 oozie]$ oozie job -oozie http://cdh1:11000/oozie -info 0000063-190620162727859-oozie...原创 2019-06-21 14:40:54 · 256 阅读 · 0 评论 -
安装CDH5.14.2步骤
CDH 5 and Cloudera Manager 5 Requirements and Supported Versions 可以查看对JDK、数据库版本的要求 https://www.cloudera.com/documentation/enterprise/release-notes/topics/rn_consolidated_pcm.html https://blo...原创 2019-06-12 11:17:59 · 754 阅读 · 0 评论