spark2 dataframe map报错-优快云博客

本文链接：https://blog.youkuaiyun.com/u013090676/article/details/78167640

本文介绍在Spark2中使用DataFrame的map操作时遇到的错误及其两种解决方案。第一种是在使用map操作前导入spark.implicits._；第二种是自定义encoder来处理特定类型的数据。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

在spark2中使用dataframe的map操作时候报错:

Error:(34, 20) Unable to find encoder for type stored in a Dataset. Primitive types (Int, String, etc) and Product types (case classes) are supported by importing spark.implicits._ Support for serializing other types will be added in future releases.
mRecord.map(teenager => teenager(0)+"lina").show(false);

这里有两种解决办法：

第一种:

        val spark = SparkSession.builder
        .master("local[4]")
        .appName("test1")
        .getOrCreate();
	import spark.implicits._

在要使用map的语句前面加上:

import spark.implicits._

第二种:

// No pre-defined encoders for Dataset[Map[K,V]], define explicitly
implicit val mapEncoder = org.apache.spark.sql.Encoders.kryo[Map[String, Any]]
// Primitive types and case classes can be also defined as
// implicit val stringIntMapEncoder: Encoder[Map[String, Any]] = ExpressionEncoder()

// row.getValuesMap[T] retrieves multiple columns at once into a Map[String, T]
teenagersDF.map(teenager => teenager.getValuesMap[Any](List("name", "age"))).collect()
// Array(Map("name" -> "Justin", "age" -> 19))