Performance Measurement-优快云博客

本文链接：https://blog.youkuaiyun.com/Murphy_study/article/details/145271253

Performance Measurement

Confusion Matrix

Reference:
https://towardsdatascience.com/understanding-confusion-matrix-a9ad42dcfd62

在这里插入图片描述

It is extremely useful for measuring Recall, Precision, Specificity, Accuracy, and most importantly AUC-ROC curves
Just Remember, We describe predicted values as Positive and Negative and actual values as True and False.
在这里插入图片描述

TP represents the number of samples that the model correctly predicts to be positive
FN represents the number of samples that the model incorrectly predicted as negative classes

Recall

$\frac{TP}{TP+FN}$
The above equation can be explained by saying, from all the positive classes, how many we predicted correctly.

Recall should be high as possible.

Precision

$\frac{TP}{TP+FP}$
The above equation can be explained by saying, from all the classes we have predicted as positive, how many are actually positive.

Precision should be high as possible.

Accuracy

From all the classes (positive and negative), how many of them we have predicted correctly. In this case, it will be 4/7.

Accuracy should be high as possible.

F-score

$F-measure=\frac{2\times{Recall}\times{Precison}}{Recall+Precison}$
It is difficult to compare two models with low precision and high recall or vice versa. So to make them comparable, we use F-Score. F-score helps to measure Recall and Precision at the same time. It uses Harmonic Mean in place of Arithmetic Mean by punishing the extreme values more.

Matthew’s correlation coefficient (MCC)

$=\frac{TN\times{TP}-FN\times{FP}}{\sqrt{(TP+FP)(TP+FN)(TN+FP)(TN+FN)}}$
It is said to be a reliable measure producing high scores.
Reference: https://www.voxco.com/blog/matthewss-correlation-coefficient-definition-formula-and-advantages/

AUC

Reference:
https://towardsdatascience.com/understanding-auc-roc-curve-68b2303cc9c5

ROC is a probability curve.
AUC represents the degree or measure of separability.
Higher the AUC, the better the model is at predicting 0 classes as 0 and 1 classes as 1.

TPR (True Positive Rate) / Recall /Sensitivity
$TPR\space (True Positive Rate) \space/\space Recall \space/\space Sensitivity=Recall = \frac{TP}{TP+FN}$
Specificity
$Specificity=\frac{TN}{TN+FP}$
FPR
$FPR=1-Specificity=\frac{TN}{TN+FP}$