机器学习入门(三)
#python生成分类数据集并存盘(读取)
#作者:冯德平(山野雪人)
#生成blobs数据
from sklearn.datasets import make_blobs
import matplotlib.pyplot as plt
from pandas import DataFrame
X, y = make_blobs(n_samples=150, centers=3, n_features=2,random_state=50)
df = DataFrame(dict(x=X[:,0], y=X[:,1], label=y))
print(df)
df.to_csv(r'D:\temp\sjflset.csv',index=False) #保存文件
colors = {0:'red', 1:'yellow', 2:'blue'} #颜色
fig, ax = plt.subplots()
grouped = df.groupby('label')
for key, group in grouped:
group.plot(ax=ax, kind='scatter', x='x', y='y', label=key, color=colors[key])
plt.show()
‘’’
从pandas可以很方便的读取所保存的数据:
import pandas as pd
df=pd.read_csv(r’D:\temp\sjflset.csv’)
print(df)
参考:
https://blog.youkuaiyun.com/htuhxf/article/details/82986440
https://blog.youkuaiyun.com/qq_36850381/article/details/100833192
https://blog.youkuaiyun.com/weixin_42782150/article/details/90716533
https://zhuanlan.zhihu.com/p/33097650
‘’’