20200329更新
原来Pandas里有根据列类型筛选的函数
select_dtypes
它有两个参数,incluse和exclude,既可以接受标量,也可以接受列表,返回一个DF类。
例如:
# 筛选列类型为int和float的列
df.select_dtypes(include=['int', 'float'])
# 筛选除了类型为O的列
df.select_dtypes(exclude='O')
# 筛选数值类型的列名
df.select_dtypes(exclude='O').columns
pandas里常用的筛选、索引工具主要有
- loc:按行列标签名称进行索引和筛选
- iloc:根据行列索引序号进行索引
- at:根据行列标签找到相应单元格的值
- iat:根据行列索引序号找到相应单元格的值
这里有个需求,以fifa19数据为例,列标签有多种数据类型:
>>> fifa19.dtypes
ID int64
Name object
Age int64
Nationality object
Overall int64
Potential int64
Club object
Value object
Wage object
Preferred Foot object
Position object
Crossing float64
Finishing float64
HeadingAccuracy float64
ShortPassing float64
Volleys float64
Dribbling float64
FKAccuracy float64
LongPassing float64
BallControl float64
Acceleration float64
SprintSpeed float64
Agility float64
Reactions float64
Balance float64
ShotPower float64
Jumping float64
Stamina float64
Strength float64
LongShots float64
Aggression float64
Interceptions float64
Positioning float64
Vision float64
Penalties float64
Composure float64
Marking float64
Sta