……未完待补充……
ref:
1. Spark官方文档
2. http://bbs.youkuaiyun.com/topics/391002544
在spark.ml.linalg里有两种vector——DenseVector 和 Sparse Vector,两者都继承于Vectors
1.两者区别
DenseVector: a value array
def:
Vectors.dense(values: Array[Double])
(直接把所有的元素都列出来了)
SparseVector : an index and a value array
def:
Vectors.sparse(size: Int, indices: Array[Int], values: Array[Double])
(存储元素的个数、以及非零元素的编号index和值value)
import org.apache.spark.mllib.linalg.{Vector, Vectors}
// Create a dense vector (1.0, 0.0, 3.0).
val dv: Vector = Vectors.dense(1.0,