Visual Tracking 介绍

视觉跟踪旨在在连续帧间寻找目标。面临光照变化、遮挡、视角变化等挑战,包括底部向上和顶部向下两种方法。涉及目标表示、观测表示、假设测量和假设生成四个要素。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

 
  1. Visual Tracking 的定义

  2. Detection of a new moving object
  3. Locations of the previous objects in the current frame

    --- from paper : Multi-kernel Object Tracking

    The goal of object tracking is to find the targets between the consecutive frames in image sequences.

    --- from paper : Efficient Mean-Shift Tracking via a New Similarity Measure

     

    Visual Tracking 的难点

    Much effort has been made to solve the problem of real-time object tracking over the years. However, tracking algorithms still suffer from fundamental problems including drifts away from targets (partially due to change of viewpoint), inability to adapt to changes of object appearance, dependence on the first frame for template matching, instability to track objects under deformations (e.g. deformed contours), the inefficiency of Monte Carlo simulations for temporal tracking, and reliance on gradients by active contours, i.e. problems with similar intensities on the background and the object, or high gradient edges on the object itself. These problems are due to the complexity of the object dynamics. We also have to deal with difficult tracking conditions which include illumination changes, occlusions, changes of viewpoint, moving cameras and non-translational object motions like zooming and rotation.

    --- from paper : Mean-Shift Tracking with RandomSampling

     

    Visual Tracking 的分类

    Many tracking algorithms have been proposed and implemented to overcome difficulties that arise from noise, occlusion, clutter, and changes in the foreground objects or in the background environment. Gradient based methods align tracked regions between successive frames by minimizing a cost function using various gradient descent techniques. Feature-based approaches extract features (such as intensity, colors, edges, contours) and use them to establish correspondence between model images and target images. Knowledge-based tracking algorithms incorporate a priori information about the tracked objects to obtain representations such as projected shape, skin complexion, body blobs, kinematic skeletons and silhouettes. Learning-based approaches apply pattern recognition algorithms to learn the objects either in the eigenspace or in the kernel space, and then search for targts in image sequences.

    --- from paper : Efficient Mean-Shift Tracking via a New Similarity Measure

     

    Visual Tracking 的框架分类

    Botton-up and Top-down approaches are two kinds of methodologies to approach the visual tracking problem. Botton-up approaches generally tend to construct object states by analyzing the content of images. Basically, many segmentation-based methods can be categorized as Botton-up approaches. For example, blob tracking techniques group similar image pixels into blobs to estimate the positions and shapes of the target. On the contrary, Top-down approaches generate candidate hypotheses form previous time frame based on a parametric representation of the target. Tracking is achieved by measuring and verifying these hypotheses against image observations. Many model-based and template-matching methods can be categorized as Top-down approaches. Botton-up methods could be efficient, yet the robustness is largely limited by the ability of image analysis. On the other hand, Top-down approaches depend less on image analysis, but their performances are largely determined by hypotheses generating and verification.

    --- from paper : A Co-inference Approach to Robust Visual Tracking

     

    Visual Tracking 4大要素

  4. Target representation: To discriminate the target from other objects, target representation, including the target's geometry, motion, appearance, etc., characterizes the target in a state space either explicitly or implicitly. It is a fundamental problem in computer vision.
  5. Observation representation: Closely related to target representation, observation representation defines the image evidence of the object representation.
  6. Hypotheses measurement: Hypotheses measurenment evaluates the matching between hypotheses and image observations.
  7. Hypotheses generating: Hypotheses generating is to produce new hypotheses based on old estimation of target's representation and old observation. Target's dynamics could be embedded in such a predicting process. Intuitively, hypotheses generating characterizes the search range and confidence level ot the tracking.

    --- from paper : A Co-inference Approach to Robust Visual Tracking

 
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值