01.相关矩阵图的绘制
from pandas import read_csv
import matplotlib.pyplot as plt
import numpy as np
filename = 'Pima_Indians.csv'
names = ['preg','plas','pres','skin','test','mess','pedi','age','class']
data = read_csv(filename,names=names)
#corr()方法——计算属性相互影响的矩阵
correlations=data.corr()
#相关矩阵图法
fig = plt.figure()
ax = fig.add_subplot(111)
cax = ax.matshow(correlations,vmin=-1,vmax=1)
fig.colorbar(cax)
ticks = np.arange(0,9,1)
ax.set_xticks(ticks)
ax.set_yticks(ticks)
ax.set_xticklabels(names)
ax.set_yticklabels(names)
plt.show()

02.scatter_matrix() 方法绘制 散点矩阵图
from pandas import read_csv
import matplotlib.pyplot as plt
from pandas.plotting import scatter_matrix
filename = 'Pima_Indians.csv'
names = ['preg','plas','pres','skin','test','mess','pedi','age','class']
data = read_csv(filename,names=names)
# 02.scatter_matrix() 方法绘制 散点矩阵图
scatter_matrix(data)
plt.show()

本文介绍了使用Python的Pandas库和Matplotlib库来绘制数据集的相关矩阵图和散点矩阵图的方法。通过corr()方法计算属性间的相关性,并用matshow()函数绘制热力图,清晰展示各属性之间的相关性强度。同时,利用scatter_matrix()方法,生成散点矩阵图,直观呈现变量间的关系。
5569

被折叠的 条评论
为什么被折叠?



