【Hadoop实战02】在单机的Hadoop系统中运行WordCount程序

最新推荐文章于 2021-10-19 17:37:41 发布

悟空编程

最新推荐文章于 2021-10-19 17:37:41 发布

阅读量1.3k

点赞数

CC 4.0 BY-SA版权

分类专栏： hadoop 文章标签： hadoop

本文链接：https://blog.youkuaiyun.com/wukongcode/article/details/17535895

hadoop 专栏收录该内容

2 篇文章

订阅专栏

1、启动Hadoop

在上一篇文章中，已经搭建好了Hadoop环境，现在我们启动Hadoop，但是在启动Hadoop之前我们要做一些配置工作。

配置JAVA_HOME

进入到hadoop安装目录下的conf文件夹，这里为：/opt/hadoop-1.2.1/conf，编辑此文件夹中的hadoop-env.sh如下：

# Set Hadoop-specific environment variables here.

# The only required environment variable is JAVA_HOME.  All others are
# optional.  When running a distributed configuration it is best to
# set JAVA_HOME in this file, so that it is correctly defined on
# remote nodes.

# The java implementation to use.  Required.
export JAVA_HOME=/opt/jdk1.7.0_25

# Extra Java CLASSPATH elements.  Optional.
# export HADOOP_CLASSPATH=

# The maximum amount of heap to use, in MB. Default is 1000.
# export HADOOP_HEAPSIZE=2000

# Extra Java runtime options.  Empty by default.
# export HADOOP_OPTS=-server

接着我们配置下core-site.xml，如下：

<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>

<!-- Put site-specific property overrides in this file. -->

<configuration>
    <property>
           <name>fs.default.name</name>
           <value>hdfs://localhost:9000</value>
    </property>
    <property>
           <name>hadoop.tmp.dir</name>
           <value>/var/hadoop/hadoop-${user.name}</value>
    </property>
</configuration>

然后配置hdfs-site.xml：

<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>

<!-- Put site-specific property overrides in this file. -->

<configuration>
     <property>
        <name>dfs.replication</name>
        <value>1</value>
     </property>
</configuration>

然后配置mapred-site.xml

<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>

<!-- Put site-specific property overrides in this file. -->

<configuration>
         <property>
                <name>mapred.job.tracker</name>
                <value>localhost:9001</value>
        </property>
        <property>
                <name>mapred.child.tmp</name>
                <value>/opt/temp</value>
        </property>

</configuration>

配置完之后，我们就可以启动Hadoop了，在启动之前一定要先格式化namenode