函数功能2:
merge:可以将两个dataFrame连接在一起,和数据库中sql语句JOIN很相似。Dataframe a(with columns x, y, z) and b (with columns x1, x2, y,z)可以生成新的Dataframe c(with columns x, y, z, x1, x2).
merge(x, y, all.x = TRUE) # LEFT JOIN
merge(x, y, all.y = TRUE) # RIGHT JOIN
merge(x, y, all = TRUE) # OUTER JOIN
merge(a, b, by=c("y", "z"))
subset:从matrix,dataframe中选择合适的数据集。
subset(x, name=='abc', age>20, select=c(id, department, age))
split(x, f, drop=FALSE, …):可以将vector或者da
x : vector 或者da
f : factor, 定义分组
drop : 如果为TRUE,则在factor中没有出现的元素将被丢弃
Example:
For vector:
x<-c(1:10)
f <-factor (c(1,2,3,4,2,3,6,3,2, 7 ))
Split(x, f)的结果是:
‘1’: 1
‘2’: 2, 5, 9
‘3’: 3, 6, 8
‘4’: 4
‘6’: 7
‘7’: 10
For da
split(da, col(da)): 将da
split(ma, ma$x): 将da
unique(x, incomparable=FALSE, fromlast=FALSE): 可以将vector, da
x: vector, da
incomparable: 是否能被比较
fromlast: 是否从后往前比
(注:不能按照某一列unique这个da