sequencefile格式转text格式
这里仅针对输入格式为<\Text, IntWritable>的键值对sequencefile文件,可根据实际需要修改,最终输出文本格式。
package org.apache.hadoop.examples;
import java.io.IOException;
import java.util.Iterator;
import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Job;
import org.apache.hadoop.mapreduce.Mapper;
import org.apache.hadoop.mapreduce.Reducer;
import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;
import org.apache.hadoop.mapreduce.lib.input.SequenceFileInputFormat;
import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;
import org.apache.hadoop.util.GenericOptionsParser;
public class SequencefileToText {
public static class ReaderMapper extends Mapper<Text, IntWritable, Text, Text> {
protected void map(Text key, IntWritab