评价模型metircs optimization指南

最新推荐文章于 2025-04-09 14:15:04 发布

shanesu

最新推荐文章于 2025-04-09 14:15:04 发布

阅读量757

点赞数

分类专栏： python kaggle 课程笔记文章标签：评价

本文链接：https://blog.youkuaiyun.com/qq_36080693/article/details/79836603

版权

课程笔记同时被 3 个专栏收录

28 篇文章

订阅专栏

python

12 篇文章

订阅专栏

kaggle

4 篇文章

订阅专栏

本文所有内容整理自Coursera - Advanced Machine Learning- How to Win a Data Science Competition: Learn from Top Kagglers

一、定义evaluation metrics

1、基本概念

一般来说，优化矩阵由组织者按照业务需求定义
有些情况下，优化目标不太好量化，需要直觉判断转换为其他优化矩阵
观察优化矩阵优化的趋势是否收敛良好，如果否，则采取其他优化矩阵

2、基本模型

reg回归

MSE 平均方差
RMSE 根号下MSE（标准差）：使得差异值和对象值在一个层级，而不是被平方过
R-squared
MAE 平均绝对差：不会过于惩罚极大极小值，对异常值比较容忍
MSPE\MSAE 相对误差：计算误差的时候，考虑数据本身的体量（999-1000，和 1-2 的误差是不一样的）
RMSLE 根号下log平均差

classi分类

accuracy正确率
logloss：倾向于容忍小的错误，而惩罚明显的差错
AUC：适用于二元分类。计算分类后的，有序程度；auc是pair-wise的

def aucfun(act,pred):
    fpr, tpr, thresholds = sklearn.metrics.roc_curve(act, pred, pos_label=1)
    return metrics.auc(fpr, tpr)

Cohens Kappa：给予baseline定义正确率，避免没有意义的高正确率

# **计算loss**
def soft_kappa(preds, dtrain):
    '''
        Having predictions `preds` and targets `dtrain.get_label()` this function coumputes soft kappa loss.
        NOTE, that it assumes `mean(target) = 0`.

    '''
    target = dtrain.get_label()
    return 'kappa' ,  -2 * target.dot(preds) / (target.dot(target) + preds.dot(preds))

# **计算一阶二阶导数**
def soft_kappa_grad_hess(y, p):
    '''
        Returns first and second derivatives of the objective with respect to predictions `p`. 
        `y` is a vector of corresponding target labels.  
    '''
    norm = p.dot(p) + y.dot(y)

    grad = -2 * y / norm + 4 * p * np.dot(y, p) / (norm ** 2)
    hess = 8 * p * y / (norm ** 2) + 4 * np.dot(y, p) / (norm ** 2)  - (16 * p ** 2 * np.dot(y, p)) / (norm ** 3)
    return grad, hess

Quadratic weighted加权的

3、模型训练优化

可直接优化的：MSE、Logloss
不可直接优化，需要先训练优化另一个模型，再讲结果在最终模型上进行检验：MSEPE、MAPE、RMSLE
优化另一个模型，处理后得到最终模型：Accuracy、Kappa

二、在model中的评价模型

1、model和评价模型

这里写图片描述

2、决策树

信息镝
gini系数：总体内包含的类别越杂乱，GINI指数就越大
优劣：

1、Gini is intended for continuous attributes, and Entropy for attributes that occur in classes (e.g. colors)
2、“Gini” will tend to find the largest class, and “entropy” tends to find groups of classes that make up ~50% of the data((http://paginas.fe.up.pt/~ec/files_1011/week%2008%20-%20Decision%20Trees.pdf))
3、“Gini” to minimize misclassification
4、“Entropy” for exploratory analysis
5、Some studies show this doesn’t matter – these differ less than 2% of the time
6、Entropy may be a little slower to compute