通往Kaggle之路
付修磊
微信公众号 南极Python
展开
专栏收录文章
- 默认排序
- 最新发布
- 最早发布
- 最多阅读
- 最少阅读
-
(线性支持向量机)手写数字识别
from sklearn.datasets import load_digitsdigits=load_digits()digits.data.shapefrom sklearn.cross_validation import train_test_splitX_train,X_test,y_train,y_test=train_test_split(digits.data,digits...原创 2018-02-18 19:37:49 · 477 阅读 · 0 评论 -
(逻辑斯蒂回归(主要)+随机梯度)良/恶性乳腺肿瘤预测
# -*- coding: cp936 -*-import pandas as pdimport numpy as npcolumn_names=['Sample code number','Clump Thickness','Uniformity of Cell Size','Uniformity of Cell Shape','Marginal Adhesion','Single Epi...原创 2018-02-18 15:25:13 · 474 阅读 · 0 评论 -
(朴素贝叶斯)新闻文本分类
from sklearn.datasets import fetch_20newsgroupsnews=fetch_20newsgroups(subset='all')print len(news.data)print news.data[0]from sklearn.cross_validation import train_test_splitX_train,X_test,y_tr...原创 2018-02-19 17:05:12 · 845 阅读 · 0 评论 -
(KNN)iris种类预测
# -*- coding: cp936 -*-from sklearn.datasets import load_irisiris=load_iris()iris.data.shapeprint iris.DESCRfrom sklearn.cross_validation import train_test_splitX_train,X_test,y_train,y_test=tr...原创 2018-02-19 17:59:28 · 802 阅读 · 1 评论 -
(决策树)泰坦尼克号生还者简单预测
import pandas as pdtitanic=pd.read_csv('http://biostat.mc.vanderbilt.edu/wiki/pub/Main/DataSets/titanic.txt')X=titanic[['pclass','age','sex']]y=titanic['survived']X['age'].fillna(X['age'].mean(...原创 2018-02-19 21:08:56 · 1203 阅读 · 0 评论 -
【线性回归】波斯顿房价预测
# -*- coding: cp936 -*-from sklearn.datasets import load_bostonboston=load_boston()from sklearn.cross_validation import train_test_splitimport numpy as npX=boston.datay=boston.targetX_train,X...原创 2018-02-20 19:32:01 · 697 阅读 · 0 评论
分享