
sparkCore
文章平均质量分 50
qwerdf@QAQ
present~
展开
-
Spark的Driver和Executor端代码划分
package sparkStream import org.apache.spark.sql.SparkSession import org.apache.spark.streaming.{Seconds, StreamingContext} import org.apache.spark.streaming.dstream.{DStream, ReceiverInputDStream} object DriverAndExecutorCode { def main(args: Array[Str原创 2020-08-10 00:05:40 · 1323 阅读 · 2 评论 -
Spark广播变量
package sparkCore.BroadCastV1 import org.apache.spark.rdd.RDD import org.apache.spark.sql.SparkSession object BroadCastV1 { def main(args: Array[String]): Unit = { val spark = SparkSession .builder() .appName(this.getClass.getName)原创 2022-05-16 00:09:17 · 441 阅读 · 0 评论 -
Spark自定义累加器
package sparkCore.accumulator import org.apache.spark.sql.SparkSession object AccumulatorV1 { def main(args: Array[String]): Unit = { val spark = SparkSession .builder() .appName(this.getClass.getName) .master("local[*]") .g原创 2022-05-15 21:35:45 · 519 阅读 · 0 评论 -
spark内存管理及性能优化
1.spark2.x内存模型 2.Shuffle的内存占用 Shuffle Read和Shuffle Write原创 2020-09-27 00:42:03 · 385 阅读 · 0 评论 -
Spark常用算子
1.map package sparkCore.rddTransform import org.apache.spark.sql.SparkSession object RDDTransformV1 { def main(args: Array[String]): Unit = { val spark = SparkSession .builder() .appName(this.getClass.getName) .master("local[*]")原创 2022-05-12 01:27:41 · 439 阅读 · 0 评论