- 博客(8)
- 收藏
- 关注
转载 pandas如何实现hive的窗口函数功能
groupby+rank 分组+排序df['row_num']=df['a'].groupby(df['b']).rank(ascending=False,method='max')转载于:https://www.cnblogs.com/fatcici2017/p/10348904.html
2019-02-02 18:00:00
397
转载 pandas--merge
sr=pd.read_csv('/Users/macui 1 2/Documents/hot_ifx.csv',names=['timestamp','request1','request2','account_id','api_key','idcard','name','source','confidence','compare_platform','image_type'],he...
2018-11-10 11:14:00
122
转载 机器学习笔记(一)
吴恩达机器学习课程,第二讲。1、多元线性回归转载于:https://www.cnblogs.com/fatcici2017/p/8284331.html
2018-01-14 20:59:00
120
转载 常用awk
awk -F ',' '{split ($2,a,"-");split ($3,b,"-"); S[a[1]"\t"b[1]]++}END{for (s in S) print s"\t"S[s]}' huanji_2016_ori-2.csv |sort -r -n -t$'\t' -k3 >huanji_brand_2016.txtawk -F "," '{spl...
2017-05-19 14:30:00
142
转载 pandas--对str的操作
huanji[(huanji['from'].str.contains('金立')) & (huanji['h_month']<201701)].groupby(huanji['to'].str.partition('-').get(0))['uid'].agg({'uv':'count'}).sort_values(by='uv',ascending=0).to_csv(...
2017-04-10 15:29:00
152
转载 pandas入门--透视表
pd.pivot_table(df5,index=['key1','key2'],values=['data1','data2'],aggfunc=[np.sum,np.mean],margins=True)margins=True 会有加和后的数据参考文章http://python.jobbole.com/81212/转载于:https://ww...
2017-03-30 19:13:00
97
转载 pandas入门--筛选字符串+groupby+sort
一 先筛选出还有'from'列中带有'iphone 6s'的行,然后对这些数据进行groupby,结果倒序排约等同于sql中的groupby+where+order by +descdf[df['from'].str.contains('iphone 6s plus')].groupby(['from','to'])['uid'].agg({'uv':'count'}).sort...
2017-03-28 15:06:00
612
转载 pandas入门(一)
一 引入数据,以CSV为例import pandas as pdimport numpy as npdf=pd.read_csv('/Users/cici/Documents/huanji_2017_ori.csv',header=0,sep='\t',names=['uid','month','h_date','from','to'],index='uid')二 查看数...
2017-03-28 14:58:00
76
空空如也
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人