1.数据说明(数据集是货品交易数据集)
每个订单可能包含多个货品,每个订单可以产生多次交易,不同的货品有不同的单价。
2.需求说明:
• 计算所有订单中每年的销售单数、销售总额
• 计算所有订单每年最大金额订单的销售额
• 计算所有订单中每年最畅销货品
3.具体实现
import org.apache.spark.SparkConf
import org.apache.spark.sql.{DataFrame, SaveMode, SparkSession}
case class tbStock(ordernumber: String, locationid: String, dateid: String) extends Serializable
case class tbStockDetail(ordernumber: String, rownum: Int, itemid: String, number: Int, price: Double, amount: Double) extends Serializable
case class tbDate(dateid: String, years: Int, theyear: Int, month: Int, day: Int, weekday: Int, week: Int, quarter: Int, period: Int, halfmonth: Int) extends Serializable
object Pra