
机器学习实战
语亦情非
进步是留给时间最好的礼物
展开
-
等距分箱
python 自带的等宽分箱函数pd.cut() import numpy as np import pandas as pd from pandas import Series,DataFrame score_list=np.random.randint(30,100,size=20) print(score_list) bins=[0,59,70,80,100] score_cat=pd....原创 2019-08-04 10:44:05 · 2458 阅读 · 1 评论 -
机器学习处理离散值方法之 95分位数盖帽法
def train_add_hat(x,features): import numpy as np import pandas as pd df=x.copy() q95_dict={} for col in features: q95=np.percentile(df[col],95) q95_dict[col]=q95 ...原创 2019-08-04 11:23:17 · 5304 阅读 · 0 评论