需要的东西:
JDK1.x:http://www.oracle.com/technetwork/java/javase/downloads/jdk8-downloads-2133151.html
Hadoop1.2:http://hadoop.apache.org/releases.html#Download
Mahout:http://mahout.apache.org/general/downloads.html
本文是针对hadoop1.2环境搭建的mahout,如果是2.x的可能会不能运行,详细请参考:
http://rgyq.blog.163.com/blog/static/31612538201471344610872/
http://www.linuxidc.com/Linux/2014-04/99856.htm
http://www.qkeye.com/blog-46-296850.html
上述几篇博客足够解决各种hadoop2不能运行mahout的问题了
那么下面开始搭建我们的系统:
1、搭建分布式hadoop环境
(1)创建hadoop用户:http://blog.youkuaiyun.com/scut_flyaway/article/details/42758507
(2)修改节点的名称和hosts:
(3)节点之间ssh无密码登录:http://blog.youkuaiyun.com/scut_flyaway/article/details/42722293
(4)解压缩java并配置
(5)解压缩hadoop并配置
(6)解压缩mahout并配置
(7)初始化hadoop并分布式启动
(8)运行mahout example程序完成测试