[caffe]深度学习之CNN检测object detection方法摘要介绍

最新推荐文章于 2025-08-26 22:12:17 发布

转载最新推荐文章于 2025-08-26 22:12:17 发布 · 3.9k 阅读

DeepLearning 同时被 3 个专栏收录

181 篇文章

订阅专栏

ADAS_辅助驾驶_自动驾驶

72 篇文章

订阅专栏

Caffe

41 篇文章

订阅专栏

本文回顾了CNN在目标检测领域的快速发展，介绍了RCNN、SPP-NET、FAST-RCNN及FASTER-RCNN等模型。重点阐述了这些模型在检测速度与准确率方面的改进。

[caffe]深度学习之CNN检测object detection方法摘要介绍

2015-08-17 17:44 3276人阅读评论(1) 收藏举报

一两年cnn在检测这块的发展突飞猛进，下面详细review下整个cnn检测领域模型的发展，以及在时间性能上的发展。

一、RCNN

流程：

Extract region(off model) + extract features(on model) + classifyregions according feature (svm or softmax)

性能：

精度：

二、SPP-NET

流程：

先做conv，再根据window提取特征。为什么rcnn不能也这么做呢？原因在于spp对不同尺度进行了max pool处理能更好的满足不同尺度window的特征表达。

性能：

核心思想在全图只做一次conv，这个和overfeat的思想一致

精度：

三、FAST-RCNN

流程：

引入了ROI层pooling，以及multi-task同时训练分类和检测框。

性能：

Compared to SPPnet, Fast R-CNN trains VGG163× faster, tests 10× faster, and is more accurate.

另外还额外提出了fc层SVD的思想

Vgg时间性能分析

精度：

The improvement of Fast R-CNN over SPPnetillustrates that even though Fast R-CNN uses single-scale training and testing,fine-tuning the conv layers provides a large improvement in mAP (from 63.1% to66.9%). Traditional R-CNN achieves a mAP of 66.0%. These results arepragmatically valuable given how much faster and easier Fast R-CNN is to trainand test, which we discuss next.

四、FASTER-RCNN

流程：

在fast-rcnn的基础上，借鉴了FCN的思路，将proposal阶段转化成一个layer加进了网络一起学习。

性能：

cost-free for proposal

精度：

our detection system has a frame rate of5fps (including all steps) on a GPU, while achieving state-of-the-art objectdetection accuracy on PASCAL VOC 2007 (73.2% mAP) and 2012 (70.4% mAP) using300 proposals per image

from： http://blog.youkuaiyun.com/sunbaigui/article/details/47728251