Spark 提交应用

最新推荐文章于 2024-11-22 20:25:50 发布

原创最新推荐文章于 2024-11-22 20:25:50 发布 · 181 阅读

CC 4.0 BY-SA版权

Spark Submitting Applications

Spark的bin目录中的Spark -submit脚本用于在集群上启动应用程序。它可以通过一个统一的接口使用所有Spark支持的集群管理器，这样您就不必配置您的应用程序，尤其是对每个应用程序dd

Bundling Your Application’s Dependencies

如果您的代码依赖于其他项目，您将需要将它们与应用程序一起打包，以便将代码分发到Spark集群中

Launching Applications with spark-submit

./bin/spark-submit \

--class <main-class> \

--master <master-url> \

--deploy-mode <deploy-mode> \

--conf <key>=<value> \

... # other options

<application-jar> \

[application-arguments]

Some of the commonly used options are:

--class: The entry point for your application (e.g. org.apache.spark.examples.SparkPi)
--master: The master URL for the cluster (e.g. spark://23.195.26.187:7077)
--deploy-mode: Whether to deploy your driver on the worker nodes (cluster) or locally as an external client (client) (default: client) †
--conf: Arbitrary Spark configuration property in key=value format. For values that contain spaces wrap “key=value” in quotes (as shown).
application-jar: Path to a bundled jar including your application and all dependencies. The URL must be globally visible inside of your cluster, for instance, an hdfs:// path or a file:// path that is present on all nodes.
application-arguments: Arguments passed to the main method of your main class, if any