例如要把一组数据分成两个簇:
6个数据的簇标号分别是:
可视化:
参考:
http://stat.ethz.ch/R-manual/R-devel/library/stats/html/kmeans.html
|
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
|
> dataset = matrix(c(1,2,
+ 1.2,2,
+ 8,9,
+ 0.9,1.8,
+ 7,10,
+ 8.8,9.2), nrow=6, byrow=T)
> dataset
[,1] [,2]
[1,] 1.0 2.0
[2,] 1.2 2.0
[3,] 8.0 9.0
[4,] 0.9 1.8
[5,] 7.0 10.0
[6,] 8.8 9.2
> kmeans(dataset, 2, iter.max = 20)
K-means clustering with 2 clusters of sizes 3, 3
Cluster means:
[,1] [,2]
1 1.033333 1.933333
2 7.933333 9.400000
Clustering vector:
[1] 1 1 2 1 2 2
Within cluster
sum
of squares by cluster:
[1] 0.07333333 2.18666667
(between_SS / total_SS = 98.6 %)
Available components:
[1]
"cluster"
"centers"
"totss"
"withinss"
"tot.withinss"
"betweenss"
[7]
"size"
|
|
1
2
3
4
|
Cluster means:
[,1] [,2]
1 1.033333 1.933333
2 7.933333 9.400000
|
|
1
2
|
Clustering vector:
[1] 1 1 2 1 2 2
|
可视化:
|
1
2
|
> result = kmeans(dataset, 2, iter.max = 20)
> plot(c(dataset[,1]), c(dataset[,2]), col=result$cluster)
|
参考:
http://stat.ethz.ch/R-manual/R-devel/library/stats/html/kmeans.html
本文介绍如何使用R语言实现K均值聚类算法,通过具体实例展示了如何将一组数据分为两个簇,并详细说明了聚类中心、簇标号及可视化结果。
2309

被折叠的 条评论
为什么被折叠?



