使用MMDetection3.x训练自定义数据集和类别

Arrow

已于 2022-12-16 17:33:04 修改

阅读量2.2k

点赞数 2

分类专栏：训练模型文章标签： caffe 深度学习 python

于 2022-12-15 17:25:07 首次发布

本文链接：https://blog.youkuaiyun.com/MyArrow/article/details/128332717

版权

训练模型专栏收录该内容

9 篇文章

订阅专栏

本文介绍使用MMDetection3.x进行自定义数据集和类别的训练流程，包括数据准备、配置、训练、测试及评估指标详解。涵盖目标检测评估如Precision、Recall计算及COCO数据集上的性能指标。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

1. 安装

2. 训练

2.1 准备数据和配置文件

参考

2.2 训练

训练方式一

python tools/train.py configs/balloon/mask-rcnn_r50-caffe_fpn_ms-poly-1x_balloon.py

会出现如下错误, 且生成的.pth文件不能检测到balloon, 原因是lr太大
训练方式二

python tools/train.py configs/balloon/mask-rcnn_r50-caffe_fpn_ms-poly-1x_balloon.py --auto-scale-lr

2.3 测试

测试图片集并对比显示

python tools/test.py configs/balloon/mask-rcnn_r50-caffe_fpn_ms-poly-1x_balloon.py work_dirs/mask-rcnn_r50-caffe_fpn_ms-poly-1x_balloon/epoch_12.pth --show

测试图片并保存结果

python demo/image_demo.py data/balloon/val/test.jpg configs/balloon/mask-rcnn_r50-caffe_fpn_ms-poly-1x_balloon.py work_dirs/mask-rcnn_r50-caffe_fpn_ms-poly-1x_balloon/epoch_12.pth --out-file 1.jpg

3. 评价指标 (Evaluation Metrics)

3.1 目标检测评估（ Detection Evaluation）

在这里插入图片描述

基于2分类的指标如下：
- Recall 召回率（查全率）：所有真实目标中，模型预测正确的目标比例，其公式为： $R e c a l l = t p r = T P / (T P + F N)$
- Precision 精确率（查准率）：模型预测的所有目标中，预测正确的比例，其公式为： $P r e c i s i o n = T P / (T P + F P)$
- Accuracy：准确率。正确分类（正例分为正例，负例分为负例）的样本数除以所有的样本数，正确率越高，分类器越好。其公式为： $A c c u r a c y = （ T P + T N ） / (T P + T N + F P + F N)$
- FP (False Positive)：IoU<=0.5时的检测框（或者是检测到同一个GT的多余检测框的数量）
  - 误报，即负例中识别为正例的样本。其中在计算roc曲线的时候需要fpr = FP / (FP + TN)
- FN (False Negative)：没有检测到的GT的数量 (漏检)
- TP (True Positive)：IoU>0.5的检测框数量（同一Ground Truth只计算一次）
- TN (True Negative)
- IOU (Intersection-Over-Union)：交并比（交集与并集的比值）
- AP：P-R曲线下面积
- P-R曲线：Precision-Recall曲线
- mAP：mean Average Precision，即各类别AP的平均值 (每个类别有一个AP)

3.1.1 计算Precision和Recall

计算Precision和Recall, Rank为1表示Confidence不小于0.98的才视为正确检测出来，即Confidence取不同的阈值所得到的Precision和Recall，参考目标检测mAP计算以及coco评价标准
计算一类物体的AP

3.1.2 通过P-R曲线计算AP

在这里插入图片描述

3.2 COCO上物体检测器的性能指标 (12个）

指标	描述
Average Precision (AP):
AP	% AP at IoU=0.50:0.05:0.95 (primary challenge metric) IoU从0.5到0.95，间隔为0.05，总共10个IoU(0.5, 0.55, 0.60, …, 0.90, 0.95)，对于每个IoU计算一个AP，再取这10个AP的均值
$AP^{IoU=.50}$	% AP at IoU=0.50 (PASCAL VOC metric)
$AP^{IoU=.75}$	% AP at IoU=0.75 (strict metric)
AP Across Scales:	对不同尺度目标的检测效果
$AP^{small}$	% AP for small objects: $area < 32^2$
$AP^{medium}$	% AP for medium objects: $32^2 < area < 96^2$
$AP^{large}$	% AP for large objects: $area > 96^2$
Average Recall (AR):
$AR^{max=1}$	% AR given 1 detection per image
$AR^{max=10}$	% AR given 10 detections per image
$AR^{max=100}$	% AR given 100 detections per image 通过NMS之后，每个图像最多预测100个目标
AR Across Scales:
$AP^{small}$	% AR for small objects: $area < 32^2$
$AP^{medium}$	% AR for medium objects: $32^2 < area < 96^2$
$AP^{large}$	% AR for large objects: $area > 96^2$

area：测量的面积（area）是分割掩码（segmentation mask）中的像素数量

3.3 MMDetection训练输出评价值

训练过程中的输出值

 Average Precision  (AP) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.561
 Average Precision  (AP) @[ IoU=0.50      | area=   all | maxDets=1000 ] = 0.757
 Average Precision  (AP) @[ IoU=0.75      | area=   all | maxDets=1000 ] = 0.681
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= small | maxDets=1000 ] = 0.000
 Average Precision  (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=1000 ] = 0.475
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= large | maxDets=1000 ] = 0.629
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.614
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=300 ] = 0.614
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=1000 ] = 0.614
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= small | maxDets=1000 ] = 0.000
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=1000 ] = 0.508
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= large | maxDets=1000 ] = 0.683