
hadoop
gdolphinw
这个作者很懒,什么都没留下…
展开
-
python操作hive
安装pyhivefrom pyhive import hivecursor=hive.connect("10.3.1.8",database="house").cursor()cursor.execute("show tables")for cur in cursor: print cur import pandas as pdfrom pyhive import ...原创 2018-09-12 18:54:18 · 802 阅读 · 0 评论 -
hdfs fsck
fsck(file system check) ,用于检查HDFS上文件和目录的健康状态、获取文件的block块信息和位置信息等。#hdfs dfs -ls /user/gewei/#hdfs dfs -rm /user/gewei/houseApplyTest.csvhadoop fs -put houseApplyTest.csv hdfs:/user/gewei[dev@cd...原创 2019-02-28 16:49:26 · 672 阅读 · 0 评论 -
mapreduce
[dev@cdh1 python]$ vim genWordCount.pyimport randomlist =[]lineNum=0with open("wordcount.txt","a") as w: for i in range(5000000): apply = random.randint(3, 10) word='' ...原创 2019-03-08 18:10:02 · 261 阅读 · 0 评论 -
oozie相关操作
HUE oozie创建Workflow和Scheduler创建WorkflowHUE查看Jobs通过oozie查看jobs#oozie查看log信息[dev@cdh1 oozie]$ oozie job -oozie http://cdh1:11000/oozie -info 0000063-190620162727859-oozie...原创 2019-06-21 14:40:54 · 251 阅读 · 0 评论 -
安装CDH5.14.2步骤
CDH 5 and Cloudera Manager 5 Requirements and Supported Versions可以查看对JDK、数据库版本的要求https://www.cloudera.com/documentation/enterprise/release-notes/topics/rn_consolidated_pcm.htmlhttps://blo...原创 2019-06-12 11:17:59 · 730 阅读 · 0 评论