- 博客(5)
- 收藏
- 关注
原创 MapReduce Streaming Python AWS 30分钟 (5)matrix multiplication
mapper.py#!/usr/bin/env pythonimport sysdef convert_line_to_iter(file_in): for line in file_in: yield line.strip().split(' ')def emit_key_value(): for k,
2013-05-26 23:24:07
562
原创 MapReduce Streaming Python AWS 30分钟 (4) Word Count & Inverted Index
https://class.coursera.org/datasci-001/assignment/view?assignment_id=5●Word Count 修改版的mapper与reducer,与原版没有区别。Mapper.py#!/usr/bin/env pythonimport sysdef convert_line_to_iter(file_in):
2013-05-26 21:26:20
397
原创 MapReduce Streaming Python AWS 30分钟 (3)
M3+ubuntu的镜像在mac上不知为何总崩溃,换回CentOS+CDH的组合继续,安装顺序如下。1.CentOS 6.02.JAVA JDK>Linux x86 - RPM Installer[root@localhost tmp]# chmod 777 jdk-xxxx-linux-xxxx-rpm.bin[root@localho
2013-05-25 22:32:35
325
原创 MapR M3 MapReduce Streaming Python AWS 30分钟 (2)
wordcount python例子.http://www.michael-noll.com/tutorials/writing-an-hadoop-mapreduce-program-in-python/hadoop命令如下:hadoop jar /opt/mapr/hadoop/hadoop-0.20.2/contrib/streaming/hadoop-0.20.2-dev-st
2013-05-25 01:22:34
484
原创 MapR M3 MapReduce Streaming Python AWS 30分钟 (1)
Hadoop配置本来就很麻烦,这次想用VM本地调试Python脚本,然后到Amazon Elastic MapReduce,M3+VMware Fusion。30分钟即可配置好local mode,又是ubuntu,非常方便。http://www.mapr.com/doc/display/MapR/A+Tour+of+the+MapR+Virtual+Machine这份
2013-05-25 00:13:56
481
空空如也
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人