
python机器学习基础工具使用
一个人的旅行qiu
我原因用我10年的生命换一个闪耀的人生
展开
-
numpy矩阵的基础操作
import numpy#delimiter分隔符,dtype数据格式word_alcho = numpy.genfromtxt("D:\qiujiahao4.txt",delimiter=",",dtype="str")#print (type(word_alcho))print (word_alcho[0])['1' '2' '3' '4' 'qiu']print (word_alcho[原创 2017-03-12 13:24:57 · 633 阅读 · 0 评论 -
Matplotlib直方图和四方图
import pandas as pdimport matplotlib.pyplot as pltreviews = pd.read_csv("D:\\test\\fandango_scores.csv")clos = ["Fandango_Ratingvalue","RT_norm","IMDB_norm","RT_norm_round","Metacritic_norm_round"]原创 2017-03-15 18:32:17 · 699 阅读 · 0 评论 -
sklearn中基础库函数笔记
sklearn中的cross validation模块,最主要的函数是如下函数: sklearn.cross_validation.cross_val_score。他的调用形式是scores = cross_validation.cross_val_score(clf, raw data, raw target, cv=5, score_func=None) 参数解释: clf是不同的分类器,原创 2017-03-24 19:52:52 · 1554 阅读 · 0 评论 -
Matplotlib条形图与散点图
import pandas as pdimport numpy as npnum_info=pd.read_csv("D:/test/fandango_scores.csv")num_info.head(1) FILM RottenTomatoes RottenTomatoes_User Metacritic原创 2017-03-14 19:44:59 · 683 阅读 · 0 评论 -
Matplotlib画出折线图
import pandas as pdimport numpy as npnum_info=pd.read_csv("D:/test/UNRATE.csv")num_info["DATE"] = pd.to_datetime(num_info["DATE"]) #将其转化成时间的格式print (num_info.head(2)) DATE VALUE0 1948-01-原创 2017-03-14 18:51:48 · 2154 阅读 · 0 评论 -
Pandas核心数据结构
#series 是值的集合#DataFrame 是series的集合#Panel是DataFrame的集合import pandas as pdfandango = pd.read_csv("D:\\test\\fandango_score_comparison.csv")series_film=fandango["FILM"]series_rt=fandango["RottenToma原创 2017-03-13 19:55:04 · 665 阅读 · 0 评论 -
Pandas数据预处理与透视表
import pandas as pdimport numpy as npfood_info = pd.read_csv("D:\\test\\titanic_train.csv") #此处需要转义food_info.head(2) PassengerId Survived Pclass Name Sex原创 2017-03-13 18:42:12 · 1012 阅读 · 0 评论 -
Pandas数值计算与排序
import pandas as pdfood_info = pd.read_csv("D:\\test\\food_info.csv") #此处需要转义print(food_info.head(2)) NDB_No Shrt_Desc Water_(g) Energ_Kcal Protein_(g) \0 1001 BUTT原创 2017-03-12 20:35:35 · 689 阅读 · 0 评论 -
Pandas数据读取与显示2
import pandas as pdfood_info = pd.read_csv("D:\\test\\food_info.csv") #此处需要转义print (type(food_info)) first_row = food_info.head() #默认是前5行print (food_info.shape) (8618, 36)print (food_info.l原创 2017-03-12 19:58:00 · 1092 阅读 · 0 评论 -
Pandas数据读取与显示
import pandas as pdfood_info = pd.read_csv("D:\\test\\food_info.csv") #此处需要转义print (type(food_info))<class 'pandas.core.frame.DataFrame'>first_row = food_info.head() #默认是前5行print (first_row)first原创 2017-03-12 19:29:36 · 1059 阅读 · 0 评论 -
numpy矩阵基础操作4
import numpy as npa = np.sin(py.arange(12).reshape(3,4))print (a)[[ 0. 0.84147098 0.90929743 0.14112001] [-0.7568025 -0.95892427 -0.2794155 0.6569866 ] [ 0.98935825 0.41211849 -0.544原创 2017-03-12 16:25:56 · 568 阅读 · 0 评论 -
numpy矩阵的基础操作3
import numpy as npa = np.arange(3)print (a)print (np.exp(a)) #e的0,1,2次幂print (np.sqrt(a))#开根号[0 1 2][ 1. 2.71828183 7.3890561 ][ 0. 1. 1.41421356]a = np.floor(10*np.r原创 2017-03-12 14:58:27 · 655 阅读 · 0 评论 -
numpy矩阵的基础操作2
import numpy as npa = np.arange(15) #生成0到15的数组print (a)b= a.reshape(3,5) #生成3行5列的矩阵print (b)[ 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14][[ 0 1 2 3 4] [ 5 6 7 8 9] [10 11 12 13 14]]b.原创 2017-03-12 14:07:30 · 416 阅读 · 0 评论 -
Matplotlib数据集可视化图表细节
import pandas as pdimport matplotlib.pyplot as pltreviews = pd.read_csv("D:\\test\\percent-bachelors-degrees-women-usa.csv")plt.plot(reviews["Year"],reviews["Biology"])plt.show()plt.plot(reviews["Y原创 2017-03-15 19:19:12 · 828 阅读 · 0 评论