READING NOTE: Spatially Supervised Recurrent Convolutional Neural Networks for Visual Object Trackin

本文提出了一种用于视觉目标跟踪的空间监督循环卷积神经网络。该方法利用YOLO收集丰富的视觉特征并进行初步位置推断,使用LSTM回归视频帧中目标的位置。通过考虑历史位置和外观信息,并采用端到端训练方式实现统一的跟踪系统。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

TITLE: Spatially Supervised Recurrent Convolutional Neural Networks for Visual Object Tracking

AUTHER: Guanghan Ning, Zhi Zhang, Chen Huang, Zhihai He, Xiaobo Ren, Haohong Wang

ASSOCIATION: University of Missouri, University of Missouri

FROM: arXiv:1607.05781

CONTRIBUTIONS

  1. LSTM’s interpretation and regression capabilities of high-level visual features is explored.
  2. Neural network analysis is extended into the spatiotemporal domain for efficient visual object tracking.

METHOD

The main steps of the method is as follows:

  1. YOLO is used to collect rich and robust visual features, as well as preliminary location inferences.
  2. LSTM is used to regress the location of the object in video frames.

Some Details

There are two streams of data flowing into the LSTMs. One stream includes

  1. the feature representations from the convolutional layers Xt , for example the 4096-d feature from the fully-connected layer of VGG.
  2. the detection information Bt,i from the fully connected layers. Thus, at each time-step t, we extract a feature vector of length 4096. We refer to these vectors as Xt .

Another stream includes

  1. the output of states from the last time-step St1 .

ADVANTAGES

  1. The processing speed is fast because of the YOLO algorithm.
  2. History of both location and appearance are considered.
  3. End-to-end training is used in tracking, which means that a unified system is introduced.

DISADVANTAGES

  1. YOLO may be not the best choice for detection.
  2. Only single object is processed.

OTHERS

  1. 4096-d feature is a representation for whole image, how about local representation.
  2. Is there a method of temporal-full convolutional operation on 3D video, similar with fully convolutional operation on 2D image?
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值