最近在入门spark+hadoop,伪分布式安装,部署推荐这几个地址,不错。这边顺手记录一下自己用到的两个小程序。
推荐教程
http://www.powerxing.com/install-hadoop/
http://blog.youkuaiyun.com/yeruby/article/details/41042713
http://blog.youkuaiyun.com/tongxinzhazha/article/details/54346311
maven配置
<project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">
<modelVersion>4.0.0</modelVersion>
<groupId>com.landi</groupId>
<artifactId>testspark</artifactId>
<version>0.0.1-SNAPSHOT</version>
<name>testspark</name>
<description>testspark</description>
<properties>
<jdk.version>1.8</jdk.version>
<spark.version>2.1.0</spark.version>
<hadoop.version>2.6.5</hadoop.version>
</properties>
<dependencies>
<dependency>
<groupId>org.apache.spark</groupId>
<artifactId>spark-core_2.10</artifactId>
<version>${spark.version}</version>
</dependency>
<dependency>
<groupId>org.apache.hadoop</groupId>
<artifactId>hadoop-client</artifactId>
<version>${hadoop.version}</version>
</depen