一、查看数据集的特征信息:info( )
#导入所需库
import pandas as pd
import numpy as np
#导入数据
data = pd.read_csv('Salary Data.csv')
print(data.info())
先导入数据,用info()函数查看数据属性的具体信息:数据集行数、属性列编号、属性名、非空列数、数据类型。导入工资预测数据集(https://www.datacastle.cn/dataset_description.html?type=dataset&id=2519),运行结果如下:
RangeIndex: 375 entries, 0 to 374
Data columns (total 6 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 Age 373 non-null float64
1 Gender 373 non-null object