数据集:
https://www.kaggle.com/ronitf/heart-disease-uci
- 检查目标值
print(data.target.value_counts())
1 165
0 138
Name: target, dtype: int64
- 画出图像
sns.countplot(x='target',data=data)
- 求百分比
target1 = len(data[data['target']==1])
target0 = len(data[data['target']==0])
NoHD = target0/len(data['target'])
IsHD = target1/len(data['target'])
0.45544554455445546 0.5445544554455446
- 年龄跟心脏病的分布
pd.crosstab(data.age,data.