HBASE table导出到文件的方法

本文详细介绍了如何利用HBase自带的Driver包将数据导出到HDFS,包括命令使用、执行示例及执行后的HDFS文件情况。重点展示了通过设置参数进行性能优化的方法。
 

主要是介绍利用HBASE自带的org.apache.hadoop.hbase.mapreduce.Driver包现将HBASE TABLE中的数据导出到HDFS文件的功能

一,命令介绍

 [hadoop@M-172-16-73-194 bin]$ ./hbase org.apache.hadoop.hbase.mapreduce.Driver
An example program must be given as the first argument.
Valid program names are:
  CellCounter: Count cells in HBase table
  completebulkload: Complete a bulk data load.
  copytable: Export a table from local cluster to peer cluster
  export: Write table data to HDFS.-----------------------------从hbase table里面导出数据到HDFS文件,
  import: Import data written by Export.  ---------------------------------导入由export导出的文件到HBASE

 

 

[hadoop@M-172-16-73-194 bin]$ ./hbase org.apache.hadoop.hbase.mapreduce.Driver export
ERROR: Wrong number of arguments: 0
Usage: Export [-D <property=value>]* <tablename> <outputdir> [<versions> [<starttime> [<endtime>]] [^[regex pattern] or [Prefix] to filter]]

  Note: -D properties will be applied to the conf used.
  For example:
   -D mapred.output.compress=true
   -D mapred.output.compression.codec=org.apache.hadoop.io.compress.GzipCodec
   -D mapred.output.compression.type=BLOCK
  Additionally, the following SCAN properties can be specified
  to control/limit what is exported..
   -D hbase.mapreduce.scan.column.family=<familyName>
   -D hbase.mapreduce.include.deleted.rows=true
   -D hbase.mapreduce.scan.row.start=<ROWSTART>
   -D hbase.mapreduce.scan.row.stop=<ROWSTOP>
For performance consider the following properties:
   -Dhbase.client.scanner.caching=100
   -Dmapred.map.tasks.speculative.execution=false
   -Dmapred.reduce.tasks.speculative.execution=false
For tables with very wide rows consider setting the batch size as below:
   -Dhbase.export.scanner.batch=10

二,执行示例

./hbase org.apache.hadoop.hbase.mapreduce.Driver export useraction /tmp/useraction

[hadoop@M-172-16-73-194 bin]$ ./hbase org.apache.hadoop.hbase.mapreduce.Driver export useraction /tmp/useraction      
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:file:/export/distributed/hbase/hbase-0.98.9-hadoop2/lib/slf4j-log4j12-1.6.4.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in [jar:file:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
2015-09-10 14:42:05,965 INFO  [main] mapreduce.Export: versions=1, starttime=0, endtime=9223372036854775807, keepDeletedCells=false
2015-09-10 14:42:07,342 DEBUG [main] mapreduce.TableMapReduceUtil: For class org.apache.hadoop.hbase.HConstants, using jar /export/distributed/hbase/hbase-0.98.9-hadoop2/lib/hbase-common-0.98.9-hadoop2.jar
2015-09-10 14:42:07,344 DEBUG [main] mapreduce.TableMapReduceUtil: For class org.apache.hadoop.hbase.protobuf.generated.ClientProtos, using jar /export/distributed/hbase/hbase-0.98.9-hadoop2/lib/hbase-protocol-0.98.9-hadoop2.jar
2015-09-10 14:42:07,346 DEBUG [main] mapreduce.TableMapReduceUtil: For class org.apache.hadoop.hbase.client.Put, using jar /export/distributed/hbase/hbase-0.98.9-hadoop2/lib/hbase-client-0.98.9-hadoop2.jar
2015-09-10 14:42:07,348 DEBUG [main] mapreduce.TableMapReduceUtil: For class org.apache.hadoop.hbase.CompatibilityFactory, using jar /export/distributed/hbase/hbase-0.98.9-hadoop2/lib/hbase-hadoop-compat-0.98.9-hadoop2.jar
2015-09-10 14:42:07,349 DEBUG [main] mapreduce.TableMapReduceUtil: For class org.apache.hadoop.hbase.mapreduce.TableMapper, using jar /export/distributed/hbase/hbase-0.98.9-hadoop2/lib/hbase-server-0.98.9-hadoop2.jar
2015-09-10 14:42:07,350 DEBUG [main] mapreduce.TableMapReduceUtil: For class org.apache.zookeeper.ZooKeeper, using jar /export/distributed/hbase/hbase-0.98.9-hadoop2/lib/zookeeper-3.4.6.jar
2015-09-10 14:42:07,352 DEBUG [main] mapreduce.TableMapReduceUtil: For class org.jboss.netty.channel.ChannelFactory, using jar /export/distributed/hbase/hbase-0.98.9-hadoop2/lib/netty-3.6.6.Final.jar
2015-09-10 14:42:07,353 DEBUG [main] mapreduce.TableMapReduceUtil: For class com.google.protobuf.Message, using jar /export/distributed/hbase/hbase-0.98.9-hadoop2/lib/protobuf-java-2.5.0.jar
2015-09-10 14:42:07,354 DEBUG [main] mapreduce.TableMapReduceUtil: For class com.google.common.collect.Lists, using jar /export/distributed/hbase/hbase-0.98.9-hadoop2/lib/guava-12.0.1.jar
2015-09-10 14:42:07,355 DEBUG [main] mapreduce.TableMapReduceUtil: For class org.cloudera.htrace.Trace, using jar /export/distributed/hbase/hbase-0.98.9-hadoop2/lib/htrace-core-2.04.jar
2015-09-10 14:42:07,356 DEBUG [main] mapreduce.TableMapReduceUtil: For class org.cliffc.high_scale_lib.Counter, using jar /export/distributed/hbase/hbase-0.98.9-hadoop2/lib/high-scale-lib-1.1.1.jar
2015-09-10 14:42:07,366 DEBUG [main] mapreduce.TableMapReduceUtil: For class org.apache.hadoop.hbase.io.ImmutableBytesWritable, using jar /export/distributed/hbase/hbase-0.98.9-hadoop2/lib/hbase-common-0.98.9-hadoop2.jar
2015-09-10 14:42:07,367 DEBUG [main] mapreduce.TableMapReduceUtil: For class org.apache.hadoop.hbase.client.Result, using jar /export/distributed/hbase/hbase-0.98.9-hadoop2/lib/hbase-client-0.98.9-hadoop2.jar
2015-09-10 14:42:07,368 DEBUG [main] mapreduce.TableMapReduceUtil: For class org.apache.hadoop.hbase.mapreduce.TableInputFormat, using jar /export/distributed/hbase/hbase-0.98.9-hadoop2/lib/hbase-server-0.98.9-hadoop2.jar
2015-09-10 14:42:07,370 DEBUG [main] mapreduce.TableMapReduceUtil: For class org.apache.hadoop.io.LongWritable, using jar /export/distributed/hbase/hbase-0.98.9-hadoop2/lib/hadoop-common-2.4.1.jar
2015-09-10 14:42:07,371 DEBUG [main] mapreduce.TableMapReduceUtil: For class org.apache.hadoop.io.Text, using jar /export/distributed/hbase/hbase-0.98.9-hadoop2/lib/hadoop-common-2.4.1.jar
2015-09-10 14:42:07,373 DEBUG [main] mapreduce.TableMapReduceUtil: For class org.apache.hadoop.mapreduce.lib.output.TextOutputFormat, using jar /export/distributed/hbase/hbase-0.98.9-hadoop2/lib/hadoop-mapreduce-client-core-2.4.1.jar
2015-09-10 14:42:07,374 DEBUG [main] mapreduce.TableMapReduceUtil: For class org.apache.hadoop.mapreduce.lib.partition.HashPartitioner, using jar /export/distributed/hbase/hbase-0.98.9-hadoop2/lib/hadoop-mapreduce-client-core-2.4.1.jar
2015-09-10 14:42:12,469 INFO  [main] zookeeper.RecoverableZooKeeper: Process identifier=hconnection-0x28afec63 connecting to ZooKeeper ensemble=172.16.73.76:2181,172.16.73.68:2181,172.16.73.194:2181
2015-09-10 14:42:12,479 INFO  [main] zookeeper.ZooKeeper: Client environment:zookeeper.version=3.4.6-1569965, built on 02/20/2014 09:09 GMT
2015-09-10 14:42:12,479 INFO  [main] zookeeper.ZooKeeper: Client environment:host.name=M-172-16-73-194
2015-09-10 14:42:12,479 INFO  [main] zookeeper.ZooKeeper: Client environment:java.version=1.7.0_67
2015-09-10 14:42:12,479 INFO  [main] zookeeper.ZooKeeper: Client environment:java.vendor=Oracle Corporation
2015-09-10 14:42:12,479 INFO  [main] zookeeper.ZooKeeper: Client environment:java.home=/usr/java/jdk1.7.0_67/jre
2015-09-10 14:42:12,479 INFO  [main] zookeeper.ZooKeeper: Client environment:java.class.path=/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../conf:/usr/java/jdk1.7.0_67//lib/tools.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/..:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/activation-1.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/aopalliance-1.0.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/asm-3.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/avro-1.7.4.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/commons-beanutils-1.7.0.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/commons-beanutils-core-1.8.0.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/commons-cli-1.2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/commons-codec-1.7.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/commons-collections-3.2.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/commons-compress-1.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/commons-configuration-1.6.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/commons-daemon-1.0.13.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/commons-digester-1.8.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/commons-el-1.0.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/commons-httpclient-3.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/commons-io-2.4.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/commons-lang-2.6.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/commons-logging-1.1.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/commons-math-2.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/commons-net-3.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/findbugs-annotations-1.3.9-1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/gmbal-api-only-3.0.0-b023.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/grizzly-framework-2.1.2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/grizzly-http-2.1.2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/grizzly-http-server-2.1.2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/grizzly-http-servlet-2.1.2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/grizzly-rcm-2.1.2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/guava-12.0.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/guice-3.0.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/guice-servlet-3.0.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hadoop-annotations-2.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hadoop-auth-2.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hadoop-client-2.2.0.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hadoop-common-2.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hadoop-hdfs-2.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hadoop-mapreduce-client-app-2.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hadoop-mapreduce-client-common-2.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hadoop-mapreduce-client-core-2.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hadoop-mapreduce-client-hs-2.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hadoop-mapreduce-client-hs-plugins-2.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hadoop-mapreduce-client-jobclient-2.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hadoop-mapreduce-client-shuffle-2.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hadoop-mapreduce-examples-2.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hadoop-yarn-api-2.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hadoop-yarn-client-2.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hadoop-yarn-common-2.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hadoop-yarn-server-common-2.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hadoop-yarn-server-nodemanager-2.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hadoop-yarn-server-resourcemanager-2.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hadoop-yarn-server-tests-2.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hadoop-yarn-server-web-proxy-2.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hamcrest-core-1.3.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hbase-annotations-0.98.9-hadoop2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hbase-checkstyle-0.98.9-hadoop2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hbase-client-0.98.9-hadoop2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hbase-common-0.98.9-hadoop2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hbase-common-0.98.9-hadoop2-tests.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hbase-examples-0.98.9-hadoop2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hbase-hadoop2-compat-0.98.9-hadoop2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hbase-hadoop-compat-0.98.9-hadoop2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hbase-it-0.98.9-hadoop2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hbase-it-0.98.9-hadoop2-tests.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hbase-prefix-tree-0.98.9-hadoop2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hbase-protocol-0.98.9-hadoop2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hbase-rest-0.98.9-hadoop2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hbase-server-0.98.9-hadoop2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hbase-server-0.98.9-hadoop2-tests.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hbase-shell-0.98.9-hadoop2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hbase-testing-util-0.98.9-hadoop2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/hbase-thrift-0.98.9-hadoop2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/high-scale-lib-1.1.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/htrace-core-2.04.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/httpclient-4.1.3.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/httpcore-4.1.3.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jackson-core-asl-1.8.8.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jackson-jaxrs-1.8.8.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jackson-mapper-asl-1.8.8.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jackson-xc-1.8.8.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jamon-runtime-2.3.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jasper-compiler-5.5.23.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jasper-runtime-5.5.23.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/javax.inject-1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/javax.servlet-3.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/javax.servlet-api-3.0.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jaxb-api-2.2.2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jaxb-impl-2.2.3-1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jcodings-1.0.8.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jersey-client-1.9.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jersey-core-1.8.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jersey-grizzly2-1.9.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jersey-guice-1.9.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jersey-json-1.8.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jersey-server-1.8.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jersey-test-framework-core-1.9.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jersey-test-framework-grizzly2-1.9.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jets3t-0.6.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jettison-1.3.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jetty-6.1.26.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jetty-sslengine-6.1.26.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jetty-util-6.1.26.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/joni-2.1.2.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jruby-complete-1.6.8.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jsch-0.1.42.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jsp-2.1-6.1.14.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jsp-api-2.1-6.1.14.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/jsr305-1.3.9.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/junit-4.11.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/libthrift-0.9.0.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/log4j-1.2.17.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/management-api-3.0.0-b012.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/metrics-core-2.2.0.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/netty-3.6.6.Final.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/paranamer-2.3.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/protobuf-java-2.5.0.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/servlet-api-2.5-6.1.14.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/slf4j-api-1.6.4.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/slf4j-log4j12-1.6.4.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/snappy-java-1.0.4.1.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/xmlenc-0.52.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/xz-1.0.jar:/export/distributed/hbase/hbase-0.98.9-hadoop2/bin/../lib/zookeeper-3.4.6.jar:/export/distributed/hadoop/hadoop-2.4.1/etc/hadoop:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/avro-1.7.4.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/xmlenc-0.52.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/jsp-api-2.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/slf4j-log4j12-1.7.5.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/jetty-6.1.26.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/commons-compress-1.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/hadoop-auth-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/guava-11.0.2.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/jaxb-impl-2.2.3-1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/jasper-compiler-5.5.23.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/jersey-core-1.9.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/commons-net-3.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/junit-4.8.2.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/jettison-1.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/log4j-1.2.17.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/commons-collections-3.2.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/mockito-all-1.8.5.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/jackson-core-asl-1.8.8.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/commons-configuration-1.6.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/jaxb-api-2.2.2.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/jackson-mapper-asl-1.8.8.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/commons-digester-1.8.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/stax-api-1.0-2.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/jsr305-1.3.9.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/servlet-api-2.5.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/jackson-jaxrs-1.8.8.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/zookeeper-3.4.5.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/commons-lang-2.6.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/jets3t-0.9.0.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/activation-1.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/jsch-0.1.42.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/jetty-util-6.1.26.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/commons-el-1.0.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/jersey-json-1.9.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/netty-3.6.2.Final.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/commons-math3-3.1.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/protobuf-java-2.5.0.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/commons-logging-1.1.3.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/httpcore-4.2.5.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/commons-codec-1.4.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/hadoop-annotations-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/jackson-xc-1.8.8.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/commons-beanutils-core-1.8.0.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/asm-3.2.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/slf4j-api-1.7.5.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/httpclient-4.2.5.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/jersey-server-1.9.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/commons-beanutils-1.7.0.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/xz-1.0.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/commons-httpclient-3.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/paranamer-2.3.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/snappy-java-1.0.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/commons-io-2.4.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/commons-cli-1.2.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/jasper-runtime-5.5.23.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/lib/java-xmlbuilder-0.4.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/hadoop-common-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/hadoop-common-2.4.1-tests.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/common/hadoop-nfs-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/xmlenc-0.52.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/jsp-api-2.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/jetty-6.1.26.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/guava-11.0.2.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/jersey-core-1.9.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/log4j-1.2.17.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/jackson-core-asl-1.8.8.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/commons-daemon-1.0.13.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/jackson-mapper-asl-1.8.8.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/jsr305-1.3.9.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/servlet-api-2.5.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/commons-lang-2.6.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/jetty-util-6.1.26.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/commons-el-1.0.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/netty-3.6.2.Final.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/protobuf-java-2.5.0.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/commons-logging-1.1.3.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/commons-codec-1.4.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/asm-3.2.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/jersey-server-1.9.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/commons-io-2.4.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/commons-cli-1.2.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/lib/jasper-runtime-5.5.23.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/hadoop-hdfs-nfs-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/hadoop-hdfs-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/hdfs/hadoop-hdfs-2.4.1-tests.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/jetty-6.1.26.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/commons-compress-1.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/guava-11.0.2.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/jaxb-impl-2.2.3-1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/jersey-core-1.9.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/jettison-1.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/log4j-1.2.17.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/commons-collections-3.2.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/jackson-core-asl-1.8.8.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/jaxb-api-2.2.2.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/jackson-mapper-asl-1.8.8.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/stax-api-1.0-2.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/jsr305-1.3.9.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/servlet-api-2.5.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/jackson-jaxrs-1.8.8.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/guice-3.0.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/jersey-guice-1.9.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/zookeeper-3.4.5.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/commons-lang-2.6.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/jline-0.9.94.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/activation-1.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/jetty-util-6.1.26.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/jersey-json-1.9.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/javax.inject-1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/leveldbjni-all-1.8.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/protobuf-java-2.5.0.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/aopalliance-1.0.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/commons-logging-1.1.3.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/commons-codec-1.4.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/jackson-xc-1.8.8.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/asm-3.2.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/guice-servlet-3.0.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/jersey-server-1.9.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/jersey-client-1.9.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/xz-1.0.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/commons-httpclient-3.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/commons-io-2.4.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/lib/commons-cli-1.2.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/hadoop-yarn-server-resourcemanager-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/hadoop-yarn-applications-distributedshell-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/hadoop-yarn-server-web-proxy-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/hadoop-yarn-server-nodemanager-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/hadoop-yarn-server-tests-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/hadoop-yarn-applications-unmanaged-am-launcher-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/hadoop-yarn-server-applicationhistoryservice-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/hadoop-yarn-server-common-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/hadoop-yarn-api-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/hadoop-yarn-client-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/yarn/hadoop-yarn-common-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/avro-1.7.4.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/commons-compress-1.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/jersey-core-1.9.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/log4j-1.2.17.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/jackson-core-asl-1.8.8.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/jackson-mapper-asl-1.8.8.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/guice-3.0.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/jersey-guice-1.9.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/junit-4.10.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/javax.inject-1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/netty-3.6.2.Final.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/protobuf-java-2.5.0.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/aopalliance-1.0.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/hadoop-annotations-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/asm-3.2.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/hamcrest-core-1.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/guice-servlet-3.0.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/jersey-server-1.9.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/xz-1.0.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/paranamer-2.3.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/snappy-java-1.0.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/lib/commons-io-2.4.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/hadoop-mapreduce-client-jobclient-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/hadoop-mapreduce-client-app-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/hadoop-mapreduce-client-hs-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/hadoop-mapreduce-client-jobclient-2.4.1-tests.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/hadoop-mapreduce-client-shuffle-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/hadoop-mapreduce-client-hs-plugins-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/hadoop-mapreduce-examples-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/hadoop-mapreduce-client-core-2.4.1.jar:/export/distributed/hadoop/hadoop-2.4.1/share/hadoop/mapreduce/hadoop-mapreduce-client-common-2.4.1.jar:/contrib/capacity-scheduler/*.jar:/export/distributed/hadoop/hadoop-2.4.1/etc/hadoop
2015-09-10 14:42:12,482 INFO  [main] zookeeper.ZooKeeper: Client environment:java.library.path=/export/distributed/hadoop/hadoop-2.4.1/lib/native
2015-09-10 14:42:12,482 INFO  [main] zookeeper.ZooKeeper: Client environment:java.io.tmpdir=/tmp
2015-09-10 14:42:12,482 INFO  [main] zookeeper.ZooKeeper: Client environment:java.compiler=<NA>
2015-09-10 14:42:12,482 INFO  [main] zookeeper.ZooKeeper: Client environment:os.name=Linux
2015-09-10 14:42:12,482 INFO  [main] zookeeper.ZooKeeper: Client environment:os.arch=amd64
2015-09-10 14:42:12,482 INFO  [main] zookeeper.ZooKeeper: Client environment:os.version=2.6.32-279.el6.x86_64
2015-09-10 14:42:12,482 INFO  [main] zookeeper.ZooKeeper: Client environment:user.name=hadoop
2015-09-10 14:42:12,482 INFO  [main] zookeeper.ZooKeeper: Client environment:user.home=/home/hadoop
2015-09-10 14:42:12,483 INFO  [main] zookeeper.ZooKeeper: Client environment:user.dir=/export/distributed/hbase/hbase-0.98.9-hadoop2/bin
2015-09-10 14:42:12,484 INFO  [main] zookeeper.ZooKeeper: Initiating client connection, connectString=172.16.73.76:2181,172.16.73.68:2181,172.16.73.194:2181 sessionTimeout=90000 watcher=hconnection-0x28afec63, quorum=172.16.73.76:2181,172.16.73.68:2181,172.16.73.194:2181, baseZNode=/hbase
2015-09-10 14:42:12,525 INFO  [main-SendThread(172.16.73.76:2181)] zookeeper.ClientCnxn: Opening socket connection to server 172.16.73.76/172.16.73.76:2181. Will not attempt to authenticate using SASL (unknown error)
2015-09-10 14:42:12,527 INFO  [main-SendThread(172.16.73.76:2181)] zookeeper.ClientCnxn: Socket connection established to 172.16.73.76/172.16.73.76:2181, initiating session
2015-09-10 14:42:12,539 INFO  [main-SendThread(172.16.73.76:2181)] zookeeper.ClientCnxn: Session establishment complete on server 172.16.73.76/172.16.73.76:2181, sessionid = 0x34f2b1a22070af7, negotiated timeout = 60000
2015-09-10 14:42:12,630 INFO  [main] util.RegionSizeCalculator: Calculating region sizes for table "useraction".
2015-09-10 14:42:13,109 DEBUG [main] util.RegionSizeCalculator: Region useraction,,1433920211620.32caaf1c48fca51d98543526a148e26a. has size 156237824
2015-09-10 14:42:13,109 DEBUG [main] util.RegionSizeCalculator: Region sizes calculated
2015-09-10 14:42:13,158 DEBUG [main] client.ClientSmallScanner: Finished with small scan at {ENCODED => 1588230740, NAME => 'hbase:meta,,1', STARTKEY => '', ENDKEY => ''}
2015-09-10 14:42:13,183 DEBUG [main] mapreduce.TableInputFormatBase: getSplits: split -> 0 -> HBase table split(table name: useraction, scan: , start row: , end row: , region location: i-659-39684-VM.cs293cloud.internal)
2015-09-10 14:42:14,468 INFO  [main] mapreduce.JobSubmitter: number of splits:1
2015-09-10 14:42:14,612 INFO  [main] Configuration.deprecation: dfs.socket.timeout is deprecated. Instead, use dfs.client.socket-timeout
2015-09-10 14:42:14,612 INFO  [main] Configuration.deprecation: io.bytes.per.checksum is deprecated. Instead, use dfs.bytes-per-checksum
2015-09-10 14:42:14,966 INFO  [main] mapreduce.JobSubmitter: Submitting tokens for job: job_1439537258552_30759
2015-09-10 14:42:15,239 INFO  [main] impl.YarnClientImpl: Submitted application application_1439537258552_30759
2015-09-10 14:42:15,288 INFO  [main] mapreduce.Job: The url to track the job: http://M-172-16-73-194:8088/proxy/application_1439537258552_30759/
2015-09-10 14:42:15,289 INFO  [main] mapreduce.Job: Running job: job_1439537258552_30759
2015-09-10 14:42:31,704 INFO  [main] mapreduce.Job: Job job_1439537258552_30759 running in uber mode : false
2015-09-10 14:42:31,901 INFO  [main] mapreduce.Job:  map 0% reduce 0%
2015-09-10 14:42:49,110 INFO  [main] mapreduce.Job:  map 100% reduce 0%
2015-09-10 14:42:49,124 INFO  [main] mapreduce.Job: Job job_1439537258552_30759 completed successfully
2015-09-10 14:42:49,418 INFO  [main] mapreduce.Job: Counters: 40
        File System Counters
                FILE: Number of bytes read=0
                FILE: Number of bytes written=124410
                FILE: Number of read operations=0
                FILE: Number of large read operations=0
                FILE: Number of write operations=0
                HDFS: Number of bytes read=100
                HDFS: Number of bytes written=165439063
                HDFS: Number of read operations=4
                HDFS: Number of large read operations=0
                HDFS: Number of write operations=2
        Job Counters 
                Launched map tasks=1
                Rack-local map tasks=1
                Total time spent by all maps in occupied slots (ms)=14584
                Total time spent by all reduces in occupied slots (ms)=0
                Total time spent by all map tasks (ms)=14584
                Total vcore-seconds taken by all map tasks=14584
                Total megabyte-seconds taken by all map tasks=14934016
        Map-Reduce Framework
                Map input records=137089
                Map output records=137089
                Input split bytes=100
                Spilled Records=0
                Failed Shuffles=0
                Merged Map outputs=0
                GC time elapsed (ms)=211
                CPU time spent (ms)=13610
                Physical memory (bytes) snapshot=179671040
                Virtual memory (bytes) snapshot=919109632
                Total committed heap usage (bytes)=203423744
        HBase Counters
                BYTES_IN_REMOTE_RESULTS=0
                BYTES_IN_RESULTS=157080777
                MILLIS_BETWEEN_NEXTS=7664
                NOT_SERVING_REGION_EXCEPTION=0
                NUM_SCANNER_RESTARTS=0
                REGIONS_SCANNED=1
                REMOTE_RPC_CALLS=0
                REMOTE_RPC_RETRIES=0
                RPC_CALLS=1373
                RPC_RETRIES=0
        File Input Format Counters 
                Bytes Read=0
        File Output Format Counters 
                Bytes Written=165439063
You have mail in /var/spool/mail/root
[hadoop@M-172-16-73-194 bin]$ hdfs dfs -ls /tmp/useraction
Found 2 items
-rw-r--r--   2 hadoop supergroup          0 2015-09-10 14:42 /tmp/useraction/_SUCCESS
-rw-r--r--   2 hadoop supergroup  165439063 2015-09-10 14:42 /tmp/useraction/part-m-00000
[hadoop@M-172-16-73-194 bin]$ ./hbase org.apache.hadoop.hbase.mapreduce.Driver export useraction /tmp/useraction

三,执行结果

 

[hadoop@M-172-16-73-194 bin]$ hdfs dfs -ls /tmp/useraction
Found 2 items
-rw-r--r--   2 hadoop supergroup          0 2015-09-10 14:42 /tmp/useraction/_SUCCESS
-rw-r--r--   2 hadoop supergroup  165439063 2015-09-10 14:42 /tmp/useraction/part-m-00000

./hbase org.apache.hadoop.hbase.mapreduce.Driver export useraction /tmp/useraction 1 '1450088362919' '1461446864979337'
 

 

<think>嗯,用户这次遇到了HBase导出数据到HDFS后出现乱码的问题,看来是字符编码导致的。结合用户提供的两条引用内容:第一条说明HBase不管理自身Zookeeper实例,第二条显示Hadoop环境变量配置路径——这些虽然与乱码问题没有直接关联,但说明用户正在处理分布式环境配置。用户可能是在数据迁移或导出过程中遇到的中文或其他非ASCII字符显示异常。考虑到HBase默认采用UTF-8编码,而导出文件出现乱码通常有三种典型场景:1.导出时未指定字符集导致二进制序列误解2.HDFS读取工具未使用正确编码解析3.源数据本身存在混合编码(如部分GBK部分UTF-8)应该分层次排查:首先确认源数据的存储状态。如果源数据已经是UTF-8格式,问题就容易解决很多。建议用户先用Hexdump检查原始文件头特征,毕竟很多"乱码"问题其实是错误识别UTF-8字节序列造成的。Hadoop的TextInputFormat默认处理UTF-8文本的特性可以利用起来。技术方案中要特别强调导出阶段的控制,Snapshot导出时通过DEFAULT_CODEC_MAP修改字符集映射是可靠方案。注意到用户环境是Hadoop集群,所以路径配置要明确用hdfs://前缀更安全。最后提供的验证方案里特意选择包含多字节字符的测试数据,这样能立即暴露编码问题。啊,用户没提具体HBase版本,但1.4+版本后快照导出支持Character编码参数比较稳定。相关建议应该能覆盖大多数情况。如果用户后续反馈仍存在乱码,可能需要深入讨论RowKey设计中的二进制存储问题——不过初次回复先聚焦典型解决方案为宜。</think> HBase导出到HDFS出现乱码的核心原因是**字符编码不一致**。以下是系统化的解决方案和排查步骤: --- ### **一、问题根本原因** 1. **导出过程编码缺失** HBase默认存储字节数组(byte[]),导出时若未指定字符集,会使用系统默认编码(如ISO-8859-1),与HDFS读取时UTF-8编码不匹配 $$ \text{乱码} = \text{ByteArray(UTF-8)} \xrightarrow{\text{错误解码}} \text{ISO-8859-1} $$ 2. **HDFS读取工具未指定编码** 使用`hadoop fs -cat`或代码读取时未显式声明字符集 3. **源数据混合编码** HBase表中同时存在UTF-8和GBK等编码数据 --- ### **二、解决方案** #### ▶ **方案1:强制指定导出编码(推荐)** ```bash hbase org.apache.hadoop.hbase.mapreduce.Export \ -D mapreduce.output.fileoutputformat.output.encoder=UTF-8 \ # 关键参数 -D hbase.export.scanner.rowbytes=10485760 \ # 防超时 <表名> <hdfs导出路径> ``` #### ▶ **方案2:修复HDFS读取方式** - **命令行验证**: ```bash hadoop fs -cat /path/file | iconv -f UTF-8 -t ISO-8859-1 # 反向解码测试 ``` - **JAVA代码示例**: ```java FSDataInputStream in = fs.open(path); BufferedReader reader = new BufferedReader( new InputStreamReader(in, "UTF-8") // 显式指定编码 ); ``` #### ▶ **方案3:源表数据清洗** ```java // 写入HBase时强制统一编码 Put put = new Put(Bytes.toBytes("rowkey")); put.addColumn(Bytes.toBytes("cf"), Bytes.toBytes("col"), "中文数据".getBytes(StandardCharsets.UTF_8)); // 指定UTF-8 ``` --- ### **三、辅助配置检查** 1. **确认Hadoop环境编码** 检查`core-site.xml`,确保已配置全局UTF-8: ```xml <property> <name>io.file.buffer.size</name> <value>65536</value> </property> <!-- 添加编码参数 --> <property> <name>io.encoding.override</name> <value>UTF-8</value> </property> ``` 2. **HBase服务端配置** 在`hbase-site.xml`中增加: ```xml <property> <name>hbase.regionserver.lease.period</name> <value>120000</value> <!--导出超时 --> </property> ``` --- ### **四、验证流程** 1. **生成测试数据**:写入含中文的UTF-8编码数据 2. **带参数导出**: ```bash hbase org.apache.hadoop.hbase.mapreduce.Export \ -D mapreduce.output.fileoutputformat.output.encoder=UTF-8 \ test_table /export_test ``` 3. **HDFS读取验证**: ```bash hadoop fs -cat /export_test/part-m-00000 | head -n 1 ``` --- ### **五、深度优化建议** 1. **SnapShot导出替代方案** ```bash hbase org.apache.hadoop.hbase.snapshot.ExportSnapshot \ -D hbase.snapshot.export.scanner.caching=5000 \ # 性能调优 -snapshot <快照名> -copy-to <hdfs_path> ``` 2. **使用HBase BulkLoad** 通过生成HFile直接加载(规避中间编码转换) > 📌 **关键提示**:若需兼容GBK等编码,可扩展`TextOutputFormat`重写编码逻辑,但会增加系统复杂度[^1]。 ---
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值