hadoop - chapter 1

本文详细介绍了分布式系统的概念及CAP理论,并深入探讨了Hadoop的基本组件,包括HDFS、MapReduce、Hive、Sqoop、HBase和Mahout等,为读者提供了一个全面了解分布式数据处理框架的知识体系。

一.分布式

1.分布式是指将不同的业务分布在不同的地方。 而集群指的是将几台服务器集中在一起,实现同一业务。

2.分布式是以缩短单个任务的执行时间来提升效率的,而集群则是通过提高单位时间内执行的任务数来提升效率。

二.cap理论

C(Consistency一致性):所有的节点上的数据时刻保持同步

A(Availability可用性):每个请求都能接受到一个响应,无论响应成功或失败

P(Partition tolerance 分区容错):系统应该能持续提供服务,即使系统内部有消息丢失(分区)

三.hadoop的基本组件

1.hdfs(hadoop distrubute file system)->分布式文件系统 ,它采用主从结构,Namenode属于主段,Datanode属于从端

2.mapreduce->MapReduce是一个软件框架,基于该框架能够容易地编写应用程序,这些应用程序能够运行在由上千个商用机器组成的大集群上,并以一种可靠的,具有容错能力的方式并行地处理上TB级别的海量数据集。

3.Hive->是一个分布式数据仓库,管理存储在hdfs的数据,提供了基于sql的查询语言。

4.sqoop->完成hdfs和关系型数据库中的数据相互转移的工具

5.Hbase->分布式列数据库

6.Mahout ->机器学习和数据挖掘的一个分布式框架

"C:\Program Files\Java\jdk-1.8\bin\java.exe" "-javaagent:D:\idea\IntelliJ IDEA 2023.2.3\lib\idea_rt.jar=62253:D:\idea\IntelliJ IDEA 2023.2.3\bin" -Dfile.encoding=UTF-8 -classpath "C:\Program Files\Java\jdk-1.8\jre\lib\charsets.jar;C:\Program Files\Java\jdk-1.8\jre\lib\deploy.jar;C:\Program Files\Java\jdk-1.8\jre\lib\ext\access-bridge-64.jar;C:\Program Files\Java\jdk-1.8\jre\lib\ext\cldrdata.jar;C:\Program Files\Java\jdk-1.8\jre\lib\ext\dnsns.jar;C:\Program Files\Java\jdk-1.8\jre\lib\ext\jaccess.jar;C:\Program Files\Java\jdk-1.8\jre\lib\ext\jfxrt.jar;C:\Program Files\Java\jdk-1.8\jre\lib\ext\localedata.jar;C:\Program Files\Java\jdk-1.8\jre\lib\ext\nashorn.jar;C:\Program Files\Java\jdk-1.8\jre\lib\ext\sunec.jar;C:\Program Files\Java\jdk-1.8\jre\lib\ext\sunjce_provider.jar;C:\Program Files\Java\jdk-1.8\jre\lib\ext\sunmscapi.jar;C:\Program Files\Java\jdk-1.8\jre\lib\ext\sunpkcs11.jar;C:\Program Files\Java\jdk-1.8\jre\lib\ext\zipfs.jar;C:\Program Files\Java\jdk-1.8\jre\lib\javaws.jar;C:\Program Files\Java\jdk-1.8\jre\lib\jce.jar;C:\Program Files\Java\jdk-1.8\jre\lib\jfr.jar;C:\Program Files\Java\jdk-1.8\jre\lib\jfxswt.jar;C:\Program Files\Java\jdk-1.8\jre\lib\jsse.jar;C:\Program Files\Java\jdk-1.8\jre\lib\management-agent.jar;C:\Program Files\Java\jdk-1.8\jre\lib\plugin.jar;C:\Program Files\Java\jdk-1.8\jre\lib\resources.jar;C:\Program Files\Java\jdk-1.8\jre\lib\rt.jar;D:\code\Spark\demo\chapter09\target\classes;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\scala-lang\scala-library\2.11.8\scala-library-2.11.8.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\spark\spark-core_2.11\2.2.3\spark-core_2.11-2.2.3.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\avro\avro\1.7.7\avro-1.7.7.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\codehaus\jackson\jackson-core-asl\1.9.13\jackson-core-asl-1.9.13.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\codehaus\jackson\jackson-mapper-asl\1.9.13\jackson-mapper-asl-1.9.13.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\com\thoughtworks\paranamer\paranamer\2.3\paranamer-2.3.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\commons\commons-compress\1.4.1\commons-compress-1.4.1.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\tukaani\xz\1.0\xz-1.0.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\avro\avro-mapred\1.7.7\avro-mapred-1.7.7-hadoop2.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\avro\avro-ipc\1.7.7\avro-ipc-1.7.7.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\avro\avro-ipc\1.7.7\avro-ipc-1.7.7-tests.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\com\twitter\chill_2.11\0.8.0\chill_2.11-0.8.0.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\com\esotericsoftware\kryo-shaded\3.0.3\kryo-shaded-3.0.3.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\com\esotericsoftware\minlog\1.3.0\minlog-1.3.0.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\objenesis\objenesis\2.1\objenesis-2.1.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\com\twitter\chill-java\0.8.0\chill-java-0.8.0.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\xbean\xbean-asm5-shaded\4.4\xbean-asm5-shaded-4.4.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\spark\spark-launcher_2.11\2.2.3\spark-launcher_2.11-2.2.3.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\spark\spark-network-common_2.11\2.2.3\spark-network-common_2.11-2.2.3.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\fusesource\leveldbjni\leveldbjni-all\1.8\leveldbjni-all-1.8.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\com\fasterxml\jackson\core\jackson-annotations\2.6.5\jackson-annotations-2.6.5.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\spark\spark-network-shuffle_2.11\2.2.3\spark-network-shuffle_2.11-2.2.3.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\spark\spark-unsafe_2.11\2.2.3\spark-unsafe_2.11-2.2.3.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\net\java\dev\jets3t\jets3t\0.9.3\jets3t-0.9.3.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\httpcomponents\httpcore\4.3.3\httpcore-4.3.3.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\httpcomponents\httpclient\4.3.6\httpclient-4.3.6.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\commons-codec\commons-codec\1.8\commons-codec-1.8.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\javax\activation\activation\1.1.1\activation-1.1.1.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\mx4j\mx4j\3.0.2\mx4j-3.0.2.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\javax\mail\mail\1.4.7\mail-1.4.7.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\bouncycastle\bcprov-jdk15on\1.51\bcprov-jdk15on-1.51.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\com\jamesmurty\utils\java-xmlbuilder\1.0\java-xmlbuilder-1.0.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\net\iharder\base64\2.3.8\base64-2.3.8.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\curator\curator-recipes\2.6.0\curator-recipes-2.6.0.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\curator\curator-framework\2.6.0\curator-framework-2.6.0.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\zookeeper\zookeeper\3.4.6\zookeeper-3.4.6.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\com\google\guava\guava\16.0.1\guava-16.0.1.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\javax\servlet\javax.servlet-api\3.1.0\javax.servlet-api-3.1.0.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\commons\commons-lang3\3.5\commons-lang3-3.5.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\commons\commons-math3\3.4.1\commons-math3-3.4.1.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\com\google\code\findbugs\jsr305\1.3.9\jsr305-1.3.9.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\slf4j\slf4j-api\1.7.16\slf4j-api-1.7.16.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\slf4j\jul-to-slf4j\1.7.16\jul-to-slf4j-1.7.16.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\slf4j\jcl-over-slf4j\1.7.16\jcl-over-slf4j-1.7.16.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\com\ning\compress-lzf\1.0.3\compress-lzf-1.0.3.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\xerial\snappy\snappy-java\1.1.2.6\snappy-java-1.1.2.6.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\net\jpountz\lz4\lz4\1.3.0\lz4-1.3.0.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\roaringbitmap\RoaringBitmap\0.5.11\RoaringBitmap-0.5.11.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\commons-net\commons-net\2.2\commons-net-2.2.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\json4s\json4s-jackson_2.11\3.2.11\json4s-jackson_2.11-3.2.11.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\glassfish\jersey\core\jersey-client\2.22.2\jersey-client-2.22.2.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\javax\ws\rs\javax.ws.rs-api\2.0.1\javax.ws.rs-api-2.0.1.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\glassfish\hk2\hk2-api\2.4.0-b34\hk2-api-2.4.0-b34.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\glassfish\hk2\hk2-utils\2.4.0-b34\hk2-utils-2.4.0-b34.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\glassfish\hk2\external\aopalliance-repackaged\2.4.0-b34\aopalliance-repackaged-2.4.0-b34.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\glassfish\hk2\external\javax.inject\2.4.0-b34\javax.inject-2.4.0-b34.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\glassfish\hk2\hk2-locator\2.4.0-b34\hk2-locator-2.4.0-b34.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\javassist\javassist\3.18.1-GA\javassist-3.18.1-GA.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\glassfish\jersey\core\jersey-common\2.22.2\jersey-common-2.22.2.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\javax\annotation\javax.annotation-api\1.2\javax.annotation-api-1.2.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\glassfish\jersey\bundles\repackaged\jersey-guava\2.22.2\jersey-guava-2.22.2.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\glassfish\hk2\osgi-resource-locator\1.0.1\osgi-resource-locator-1.0.1.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\glassfish\jersey\core\jersey-server\2.22.2\jersey-server-2.22.2.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\glassfish\jersey\media\jersey-media-jaxb\2.22.2\jersey-media-jaxb-2.22.2.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\javax\validation\validation-api\1.1.0.Final\validation-api-1.1.0.Final.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\glassfish\jersey\containers\jersey-container-servlet\2.22.2\jersey-container-servlet-2.22.2.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\glassfish\jersey\containers\jersey-container-servlet-core\2.22.2\jersey-container-servlet-core-2.22.2.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\io\netty\netty-all\4.0.43.Final\netty-all-4.0.43.Final.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\io\netty\netty\3.9.9.Final\netty-3.9.9.Final.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\com\clearspring\analytics\stream\2.7.0\stream-2.7.0.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\io\dropwizard\metrics\metrics-core\3.1.2\metrics-core-3.1.2.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\io\dropwizard\metrics\metrics-jvm\3.1.2\metrics-jvm-3.1.2.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\io\dropwizard\metrics\metrics-json\3.1.2\metrics-json-3.1.2.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\io\dropwizard\metrics\metrics-graphite\3.1.2\metrics-graphite-3.1.2.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\com\fasterxml\jackson\core\jackson-databind\2.6.5\jackson-databind-2.6.5.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\com\fasterxml\jackson\core\jackson-core\2.6.5\jackson-core-2.6.5.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\com\fasterxml\jackson\module\jackson-module-scala_2.11\2.6.5\jackson-module-scala_2.11-2.6.5.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\scala-lang\scala-reflect\2.11.7\scala-reflect-2.11.7.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\com\fasterxml\jackson\module\jackson-module-paranamer\2.6.5\jackson-module-paranamer-2.6.5.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\ivy\ivy\2.4.0\ivy-2.4.0.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\oro\oro\2.0.8\oro-2.0.8.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\net\razorvine\pyrolite\4.13\pyrolite-4.13.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\net\sf\py4j\py4j\0.10.7\py4j-0.10.7.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\spark\spark-tags_2.11\2.2.3\spark-tags_2.11-2.2.3.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\commons\commons-crypto\1.0.0\commons-crypto-1.0.0.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\spark-project\spark\unused\1.0.0\unused-1.0.0.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\hadoop\hadoop-client\2.7.7\hadoop-client-2.7.7.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\hadoop\hadoop-common\2.7.7\hadoop-common-2.7.7.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\commons-cli\commons-cli\1.2\commons-cli-1.2.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\xmlenc\xmlenc\0.52\xmlenc-0.52.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\commons-httpclient\commons-httpclient\3.1\commons-httpclient-3.1.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\commons-io\commons-io\2.4\commons-io-2.4.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\commons-collections\commons-collections\3.2.2\commons-collections-3.2.2.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\mortbay\jetty\jetty-sslengine\6.1.26\jetty-sslengine-6.1.26.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\javax\servlet\jsp\jsp-api\2.1\jsp-api-2.1.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\commons-logging\commons-logging\1.1.3\commons-logging-1.1.3.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\commons-lang\commons-lang\2.6\commons-lang-2.6.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\commons-configuration\commons-configuration\1.6\commons-configuration-1.6.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\commons-digester\commons-digester\1.8\commons-digester-1.8.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\commons-beanutils\commons-beanutils-core\1.8.0\commons-beanutils-core-1.8.0.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\com\google\protobuf\protobuf-java\2.5.0\protobuf-java-2.5.0.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\com\google\code\gson\gson\2.2.4\gson-2.2.4.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\hadoop\hadoop-auth\2.7.7\hadoop-auth-2.7.7.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\directory\server\apacheds-kerberos-codec\2.0.0-M15\apacheds-kerberos-codec-2.0.0-M15.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\directory\server\apacheds-i18n\2.0.0-M15\apacheds-i18n-2.0.0-M15.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\directory\api\api-asn1-api\1.0.0-M20\api-asn1-api-1.0.0-M20.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\directory\api\api-util\1.0.0-M20\api-util-1.0.0-M20.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\curator\curator-client\2.7.1\curator-client-2.7.1.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\htrace\htrace-core\3.1.0-incubating\htrace-core-3.1.0-incubating.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\hadoop\hadoop-hdfs\2.7.7\hadoop-hdfs-2.7.7.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\mortbay\jetty\jetty-util\6.1.26\jetty-util-6.1.26.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\xerces\xercesImpl\2.9.1\xercesImpl-2.9.1.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\xml-apis\xml-apis\1.3.04\xml-apis-1.3.04.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\hadoop\hadoop-mapreduce-client-app\2.7.7\hadoop-mapreduce-client-app-2.7.7.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\hadoop\hadoop-mapreduce-client-common\2.7.7\hadoop-mapreduce-client-common-2.7.7.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\hadoop\hadoop-yarn-client\2.7.7\hadoop-yarn-client-2.7.7.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\hadoop\hadoop-yarn-server-common\2.7.7\hadoop-yarn-server-common-2.7.7.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\hadoop\hadoop-mapreduce-client-shuffle\2.7.7\hadoop-mapreduce-client-shuffle-2.7.7.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\hadoop\hadoop-yarn-api\2.7.7\hadoop-yarn-api-2.7.7.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\hadoop\hadoop-mapreduce-client-core\2.7.7\hadoop-mapreduce-client-core-2.7.7.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\hadoop\hadoop-yarn-common\2.7.7\hadoop-yarn-common-2.7.7.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\javax\xml\bind\jaxb-api\2.2.2\jaxb-api-2.2.2.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\javax\xml\stream\stax-api\1.0-2\stax-api-1.0-2.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\javax\servlet\servlet-api\2.5\servlet-api-2.5.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\com\sun\jersey\jersey-core\1.9\jersey-core-1.9.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\com\sun\jersey\jersey-client\1.9\jersey-client-1.9.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\codehaus\jackson\jackson-jaxrs\1.9.13\jackson-jaxrs-1.9.13.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\codehaus\jackson\jackson-xc\1.9.13\jackson-xc-1.9.13.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\hadoop\hadoop-mapreduce-client-jobclient\2.7.7\hadoop-mapreduce-client-jobclient-2.7.7.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\hadoop\hadoop-annotations\2.7.7\hadoop-annotations-2.7.7.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\slf4j\slf4j-log4j12\1.7.25\slf4j-log4j12-1.7.25.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\log4j\log4j\1.2.17\log4j-1.2.17.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\json4s\json4s-native_2.11\3.2.11\json4s-native_2.11-3.2.11.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\json4s\json4s-core_2.11\3.2.11\json4s-core_2.11-3.2.11.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\json4s\json4s-ast_2.11\3.2.11\json4s-ast_2.11-3.2.11.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\scala-lang\scalap\2.11.0\scalap-2.11.0.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\scala-lang\scala-compiler\2.11.0\scala-compiler-2.11.0.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\scala-lang\modules\scala-xml_2.11\1.0.1\scala-xml_2.11-1.0.1.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\scala-lang\modules\scala-parser-combinators_2.11\1.0.1\scala-parser-combinators_2.11-1.0.1.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\com\opencsv\opencsv\4.1\opencsv-4.1.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\org\apache\commons\commons-text\1.1\commons-text-1.1.jar;D:\Maven\apache-maven-3.9.9-bin\apache-maven-3.9.9\repo\commons-beanutils\commons-beanutils\1.9.3\commons-beanutils-1.9.3.jar" com.spark.io.ReadTextFile 2025-12-05 08:57:47 ERROR [org.apache.hadoop.util.Shell] - Failed to locate the winutils binary in the hadoop binary path java.io.IOException: Could not locate executable null\bin\winutils.exe in the Hadoop binaries. at org.apache.hadoop.util.Shell.getQualifiedBinPath(Shell.java:382) at org.apache.hadoop.util.Shell.getWinUtilsPath(Shell.java:397) at org.apache.hadoop.util.Shell.<clinit>(Shell.java:390) at org.apache.hadoop.util.StringUtils.<clinit>(StringUtils.java:80) at org.apache.hadoop.security.SecurityUtil.getAuthenticationMethod(SecurityUtil.java:610) at org.apache.hadoop.security.UserGroupInformation.initialize(UserGroupInformation.java:277) at org.apache.hadoop.security.UserGroupInformation.ensureInitialized(UserGroupInformation.java:265) at org.apache.hadoop.security.UserGroupInformation.loginUserFromSubject(UserGroupInformation.java:810) at org.apache.hadoop.security.UserGroupInformation.getLoginUser(UserGroupInformation.java:780) at org.apache.hadoop.security.UserGroupInformation.getCurrentUser(UserGroupInformation.java:653) at org.apache.spark.util.Utils$$anonfun$getCurrentUserName$1.apply(Utils.scala:2427) at org.apache.spark.util.Utils$$anonfun$getCurrentUserName$1.apply(Utils.scala:2427) at scala.Option.getOrElse(Option.scala:121) at org.apache.spark.util.Utils$.getCurrentUserName(Utils.scala:2427) at org.apache.spark.SparkContext.<init>(SparkContext.scala:295) at com.spark.io.ReadTextFile$.main(ReadTextFile.scala:12) at com.spark.io.ReadTextFile.main(ReadTextFile.scala) asfasdfasdfas Exception in thread "main" java.lang.RuntimeException: Error while running command to get file permissions : java.io.IOException: (null) entry in command string: null ls -F D:\code\Spark\demo\chapter09\datas\textfiles\file1.txt at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:772) at org.apache.hadoop.util.Shell.execCommand(Shell.java:869) at org.apache.hadoop.util.Shell.execCommand(Shell.java:852) at org.apache.hadoop.fs.FileUtil.execCommand(FileUtil.java:1108) at org.apache.hadoop.fs.RawLocalFileSystem$DeprecatedRawLocalFileStatus.loadPermissionInfo(RawLocalFileSystem.java:659) at org.apache.hadoop.fs.RawLocalFileSystem$DeprecatedRawLocalFileStatus.getPermission(RawLocalFileSystem.java:634) at org.apache.hadoop.fs.LocatedFileStatus.<init>(LocatedFileStatus.java:47) at org.apache.hadoop.fs.FileSystem$4.next(FileSystem.java:1732) at org.apache.hadoop.fs.FileSystem$4.next(FileSystem.java:1712) at org.apache.hadoop.mapred.FileInputFormat.singleThreadedListStatus(FileInputFormat.java:270) at org.apache.hadoop.mapred.FileInputFormat.listStatus(FileInputFormat.java:229) at org.apache.hadoop.mapred.FileInputFormat.getSplits(FileInputFormat.java:315) at org.apache.spark.rdd.HadoopRDD.getPartitions(HadoopRDD.scala:199) at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:252) at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:250) at scala.Option.getOrElse(Option.scala:121) at org.apache.spark.rdd.RDD.partitions(RDD.scala:250) at org.apache.spark.rdd.MapPartitionsRDD.getPartitions(MapPartitionsRDD.scala:46) at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:252) at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:250) at scala.Option.getOrElse(Option.scala:121) at org.apache.spark.rdd.RDD.partitions(RDD.scala:250) at org.apache.spark.SparkContext.runJob(SparkContext.scala:2094) at org.apache.spark.rdd.RDD.count(RDD.scala:1166) at com.spark.io.ReadTextFile$.main(ReadTextFile.scala:23) at com.spark.io.ReadTextFile.main(ReadTextFile.scala) at org.apache.hadoop.fs.RawLocalFileSystem$DeprecatedRawLocalFileStatus.loadPermissionInfo(RawLocalFileSystem.java:697) at org.apache.hadoop.fs.RawLocalFileSystem$DeprecatedRawLocalFileStatus.getPermission(RawLocalFileSystem.java:634) at org.apache.hadoop.fs.LocatedFileStatus.<init>(LocatedFileStatus.java:47) at org.apache.hadoop.fs.FileSystem$4.next(FileSystem.java:1732) at org.apache.hadoop.fs.FileSystem$4.next(FileSystem.java:1712) at org.apache.hadoop.mapred.FileInputFormat.singleThreadedListStatus(FileInputFormat.java:270) at org.apache.hadoop.mapred.FileInputFormat.listStatus(FileInputFormat.java:229) at org.apache.hadoop.mapred.FileInputFormat.getSplits(FileInputFormat.java:315) at org.apache.spark.rdd.HadoopRDD.getPartitions(HadoopRDD.scala:199) at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:252) at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:250) at scala.Option.getOrElse(Option.scala:121) at org.apache.spark.rdd.RDD.partitions(RDD.scala:250) at org.apache.spark.rdd.MapPartitionsRDD.getPartitions(MapPartitionsRDD.scala:46) at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:252) at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:250) at scala.Option.getOrElse(Option.scala:121) at org.apache.spark.rdd.RDD.partitions(RDD.scala:250) at org.apache.spark.SparkContext.runJob(SparkContext.scala:2094) at org.apache.spark.rdd.RDD.count(RDD.scala:1166) at com.spark.io.ReadTextFile$.main(ReadTextFile.scala:23) at com.spark.io.ReadTextFile.main(ReadTextFile.scala) 进程已结束,退出代码为 1 idea中遇到问题
最新发布
12-06
评论
成就一亿技术人!
拼手气红包6.0元
还能输入1000个字符
 
红包 添加红包
表情包 插入表情
 条评论被折叠 查看
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值