Scala 并发编程与数据库交互:Futures 和 SQL 基础
1. 并行交叉验证
在数据处理中,交叉验证是评估模型性能的重要方法。下面是一段 Scala 代码,用于实现并行交叉验证:
val cvCalculator = new RandomSubsample(data.npoints, testSize)
// Start parallel CV loop
val cvErrors = cvCalculator.mapSamples(1000) {
(trainingIndices, testIndices) =>
val regressor = new LogisticRegression(
data.featureMatrix(trainingIndices, ::).toDenseMatrix,
data.target(trainingIndices).toDenseVector
)
// Predictions on test-set
val genderPredictions = regressor.classifyMany(
data.featureMatrix(testIndices, ::).toDenseMatrix
)
// Calculate number of mis-classified examples
val dist = manhattanDistance(
genderPredictions, data.target(testIndices)
)
// Calculate mis-classification rate
超级会员免费看
订阅专栏 解锁全文
1130

被折叠的 条评论
为什么被折叠?



