1、安装jdk
cd /usr/local
wget http://download.oracle.com/otn-pub/java/jdk/6u27-b07/jdk-6u27-linux-x64-rpm.bin
chmod a+x jdk-6u27-linux-x64-rpm.bin
sudo ./jdk-6u27-linux-x64-rpm.bin
2、hadoop
(1)下载hadoop到某个目录,比如/home/hadoop
wget http://mirror.bjtu.edu.cn/apache/hadoop/common/hadoop-0.20.2/hadoop-0.20.2.tar.gz
解压该文件
sudo tar -xzf hadoop-0.20.0.tar.gz
(2)打通通道:
ssh-keygen -t dsa -P '' -f ~/.ssh/id_dsa
cat ~/.ssh/id_dsa.pub >> ~/.ssh/authorized_keys
这样ssh 127.0.0.1 无需密码登陆。
到/home/hadoop/hadoop-0.20.0/conf下面配置hadoop-env.sh
export JAVA_HOME=/home/dream-victor/jdk1.6.0_21--指向自己的jdk目录
export HADOOP_CLASSPATH="$HADOOP_CLASSPATH"
(4)配置core-site.xml
<configuration>
<property>
<name>fs.default.name</name>
<value>hdfs://ip或者机器名:9000</value>
</property>
<property>
<name>hadoop.tmp.dir</name>
<value>/tmp/hadoop/hadoop-${user.name}</value>
</property>
</configuration>
(5)配置hdfs-site.xml
<configuration>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
</configuration>
(6)配置mapred-site.xml
<configuration>
<property>
<name>mapred.job.tracker</name>
<value>ip或者机器名:9001</value>
</property>
</configuration>
(7)配置master和slaves文件里面,都换成本机ip和机器名
(8)格式化一个新的分布式文件系统:
在/home/hadoop/hadoop-0.20.0/bin目录下,运行如下命令:hadoop namenode -format
(9)启动hadoop:
在bin目录下,运行bash start-all.sh
查看,NameNode - http://服务器ip:50070/
JobTracker - http://服务器ip:50030/
如果要停掉服务,运行bash stop-all.sh