人群密度估计--CNN-based Cascaded Multi-task Learning of High-level Prior and Density Estimation for Crowd

最新推荐文章于 2022-07-13 10:14:55 发布

原创最新推荐文章于 2022-07-13 10:14:55 发布 · 2.9k 阅读

4 ·

CC 4.0 BY-SA版权

人群分析同时被 2 个专栏收录

38 篇文章

订阅专栏

人群分析

37 篇文章

订阅专栏

提出一种基于CNN的级联多任务学习方法，通过引入高层先验知识解决人群密度估计中的视角畸变问题。该方法将图像按人数分类，并结合密度图估计，有效提升了跨尺度变化下的人群计数精度。

CNN-based Cascaded Multi-task Learning of High-level Prior and Density Estimation for Crowd Counting
International Conference on Advanced Video and Signal Based Surveillance (AVSS) 2017
Torch: https://github.com/svishwa/crowdcount-cascaded-mtl

本文主要解决人群密度估计问题中的人群场景变化大的问题，人在场景中的尺度和外观变化范围大
the issue of large variations in scale and appearance of the objects that occurs due to severe perspective distortion of the scene

这里写图片描述

本文提出的解决思路是使用 CNN网络，并在网络中嵌入 high-level prior 先验知识
The aim of this work is to learn models that cater to a wide variety of density levels present in the data set by incorporating a high-level prior into the network.

所谓的 high-level prior 就是根据图像中的大致总人数将图像分类不同的若干类，本文将图像根据总人数分为10类
The high-level prior learns to classify the count into various groups whose class labels are based on the number of people present in the image.

这个 high-level prior 可以不受 scale variations 的影响让我们能够对图像中总人数有一个大致的估计
By exploiting count labels, the high-level prior is able to estimate coarse count of people in the entire image irrespective of scale variations thereby enabling the network to learn more discriminative global features.

3 Proposed method
这里写图片描述
我们的CNN网络前两个卷积用于提取公用特征，接着网络一分为二，一个分支是用于 High-level prior stage，这个分支主要干什么了？Classifying the crowd into several groups， quantize the crowd count into ten groups and learn a crowd count group classifier which also performs the task of incorporating high-level prior into the network