hadoop use at first

本文详细介绍了如何使用Hadoop核心版本0.20.203.0.jar执行WordCount任务,包括编译Java代码、打包成jar文件、修改core-site.xml配置以改变HDFS目录、格式化NameNode、运行jar文件并准备输入数据。
1. compile the WordCount.java

javac -classpath ./hadoop-core-0.20.203.0.jar  -d ../hdp_test -s ../hdp_test  ../hdp_test/WordCo
unt.java -Xlint:deprecation

2. jar it  
jar -cvf ~/hdp_test/WordCount.jar .


3. change core-site.xml
add the dfs property,
because the format meta data of hdfs was save in the /tmp in default,
so it 's clear after reboot the machine
change the dfs proterty block:

    <property>
        <name>hadoop.tmp.dir</name>
        <value>/home/chenglun/tmp</value>
        <description>A base for other temporary directories</description>
    </property>

4.
hadoop namenode format

5. run it
5.0 prepare the input data
hadoop jar WordCount.jar  org.myorg.WordCount /home/chenglun/input /home/chenglun/output

Appendix: hdfs shell:

hadoop fs -ls  [path]
hadoop fs -cat [path]
hadoop fs -mkdir [dir]
hadoop fs -copyFromLocal local URI

lishu@node01:~/hadoop-3.3.6$ sbin/start-all.sh WARNING: Attempting to start all Apache Hadoop daemons as lishu in 10 seconds. WARNING: This is not a recommended production deployment configuration. WARNING: Use CTRL-C to abort. Starting namenodes on [node01] node01: namenode is running as process 16137. Stop it first and ensure /tmp/hadoop-lishu-namenode.pid file is empty before retry. Starting datanodes 2025-11-10 13:11:24,978 ERROR conf.Configuration: error parsing conf core-site.xml com.ctc.wstx.exc.WstxEOFException: Unexpected EOF; was expecting a close tag for element <property> at [row,col,system-id]: [12,0,"file:/home/lishu/hadoop-3.3.6/etc/hadoop/core-site.xml"] at com.ctc.wstx.sr.StreamScanner.throwUnexpectedEOF(StreamScanner.java:701) at com.ctc.wstx.sr.BasicStreamReader.throwUnexpectedEOF(BasicStreamReader.java:5590) at com.ctc.wstx.sr.BasicStreamReader.nextFromTree(BasicStreamReader.java:2791) at com.ctc.wstx.sr.BasicStreamReader.next(BasicStreamReader.java:1121) at org.apache.hadoop.conf.Configuration$Parser.parseNext(Configuration.java:3405) at org.apache.hadoop.conf.Configuration$Parser.parse(Configuration.java:3191) at org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:3084) at org.apache.hadoop.conf.Configuration.loadResources(Configuration.java:3045) at org.apache.hadoop.conf.Configuration.loadProps(Configuration.java:2923) at org.apache.hadoop.conf.Configuration.getProps(Configuration.java:2905) at org.apache.hadoop.conf.Configuration.set(Configuration.java:1413) at org.apache.hadoop.conf.Configuration.set(Configuration.java:1385) at org.apache.hadoop.conf.Configuration.setBoolean(Configuration.java:1727) at org.apache.hadoop.util.GenericOptionsParser.processGeneralOptions(GenericOptionsParser.java:348) at org.apache.hadoop.util.GenericOptionsParser.parseGeneralOptions(GenericOptionsParser.java:581) at org.apache.hadoop.util.GenericOptionsParser.<init>(GenericOptionsParser.java:182) at org.apache.hadoop.util.GenericOptionsParser.<init>(GenericOptionsParser.java:164) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:76) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:97) at org.apache.hadoop.hdfs.tools.GetConf.main(GetConf.java:361) Exception in thread "main" java.lang.RuntimeException: com.ctc.wstx.exc.WstxEOFException: Unexpected EOF; was expecting a close tag for element <property> at [row,col,system-id]: [12,0,"file:/home/lishu/hadoop-3.3.6/etc/hadoop/core-site.xml"] at org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:3101) at org.apache.hadoop.conf.Configuration.loadResources(Configuration.java:3045) at org.apache.hadoop.conf.Configuration.loadProps(Configuration.java:2923) at org.apache.hadoop.conf.Configuration.getProps(Configuration.java:2905) at org.apache.hadoop.conf.Configuration.set(Configuration.java:1413) at org.apache.hadoop.conf.Configuration.set(Configuration.java:1385) at org.apache.hadoop.conf.Configuration.setBoolean(Configuration.java:1727) at org.apache.hadoop.util.GenericOptionsParser.processGeneralOptions(GenericOptionsParser.java:348) at org.apache.hadoop.util.GenericOptionsParser.parseGeneralOptions(GenericOptionsParser.java:581) at org.apache.hadoop.util.GenericOptionsParser.<init>(GenericOptionsParser.java:182) at org.apache.hadoop.util.GenericOptionsParser.<init>(GenericOptionsParser.java:164) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:76) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:97) at org.apache.hadoop.hdfs.tools.GetConf.main(GetConf.java:361) Caused by: com.ctc.wstx.exc.WstxEOFException: Unexpected EOF; was expecting a close tag for element <property> at [row,col,system-id]: [12,0,"file:/home/lishu/hadoop-3.3.6/etc/hadoop/core-site.xml"] at com.ctc.wstx.sr.StreamScanner.throwUnexpectedEOF(StreamScanner.java:701) at com.ctc.wstx.sr.BasicStreamReader.throwUnexpectedEOF(BasicStreamReader.java:5590) at com.ctc.wstx.sr.BasicStreamReader.nextFromTree(BasicStreamReader.java:2791) at com.ctc.wstx.sr.BasicStreamReader.next(BasicStreamReader.java:1121) at org.apache.hadoop.conf.Configuration$Parser.parseNext(Configuration.java:3405) at org.apache.hadoop.conf.Configuration$Parser.parse(Configuration.java:3191) at org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:3084) ... 13 more Starting resourcemanagers on [] Starting nodemanagers
最新发布
11-11
评论
成就一亿技术人!
拼手气红包6.0元
还能输入1000个字符
 
红包 添加红包
表情包 插入表情
 条评论被折叠 查看
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值