卡尔曼预测在视觉跟踪中的运用

最新推荐文章于 2025-04-09 23:23:54 发布

小兮风

最新推荐文章于 2025-04-09 23:23:54 发布

阅读量2.3k

点赞数 4

文章标签：计算机视觉目标跟踪人工智能

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.youkuaiyun.com/qq_43190806/article/details/124933212

版权

卡尔曼预测在视觉跟踪中的运用

本文以byteTrack为例进行分析

byteTrack：论文：code

卡尔曼的五个公式
- 其中A 为状态转移矩阵
- P为协方差矩阵
- K为卡尔曼增益
- H为观测矩阵
在byteTrack中的代码实现过程：
1. 其中卡尔曼的实现在项目的文件中
2. 在视觉的目标跟踪一般是状态变量X采用（x, y, a, h, vx, vy, va, vh）观测变量Z采用(x, y, a, h)
  - 状态变量X分别代表检测框的中心点：x，y 检测框的长宽比率a，以及检测框的高h，剩下的4个表示变换速率
  - 观测变量Z分别代表检测框的中心点：x，y 检测框的长宽比率a，以及检测框的高h
  - （在不太的跟踪器下可能后面的a，h 选择使用S面积，以及高度h）
  - （注意：只要状态变量能够完整描述整个系统即可）
3. 主要是有一个卡尔曼类：
4. 解析其中的代码
  - 在__init__函数中:
  - ```
  def __init__(self):
      ndim, dt = 4, 1.
  
      # Create Kalman filter model matrices.
      self._motion_mat = np.eye(2 * ndim, 2 * ndim)
      for i in range(ndim):
          self._motion_mat[i, ndim + i] = dt
          # _motion_mat = [[1, 0, 0, 0, dt, 0, 0, 0],
          #                [0, 1, 0, 0, 0, dt, 0, 0],
          #                [0, 0, 1, 0, 0, 0, dt, 0],
          #                [0, 0, 0, 1, 0, 0, 0, dt],
          #                [0, 0, 0, 0, 1, 0, 0, 0],
          #                [0, 0, 0, 0, 0, 1, 0, 0],
          #                [0, 0, 0, 0, 0, 0, 1, 0],
          #                [0, 0, 0, 0, 0, 0, 0, 1],
          #                ]   A矩阵   卡尔曼中的运动转移矩阵
          self._update_mat = np.eye(ndim, 2 * ndim)
          # _update_mat = [[1, 0, 0, 0, 0, 0, 0, 0],
          #                [0, 1, 0, 0, 0, 0, 0, 0],
          #                [0, 0, 1, 0, 0, 0, 0, 0],
          #                [0, 0, 0, 1, 0, 0, 0, 0],
          #                ]    H 矩阵   卡尔曼中的 观测矩阵
  
          # Motion and observation uncertainty are chosen relative to the current
          # state estimate. These weights control the amount of uncertainty in
          # the model. This is a bit hacky.
          self._std_weight_position = 1. / 20
          self._std_weight_velocity = 1. / 160
```
- 其中self._motion_mat中存的为卡尔曼中的A矩阵，其具体值为上文现实的样子（为什么是这样，是因为建模就是这样建立的，建立了一个恒定速度的模型）
- 其中self._update_mat是卡尔曼中的观测矩阵，这个很好理解 Z=HX（这是矩阵乘法，不可以随意调换顺序）
- 其中，self._std_weight_position，self._std_weight_velocity是Q、R调整参数我们可以在后面看到
5. def project(self, mean, covariance):函数
  - ```
  def project(self, mean, covariance):
      """Project state distribution to measurement space.
  
      Parameters
      ----------
      mean : ndarray
          The state's mean vector (8 dimensional array).
      covariance : ndarray
          The state's covariance matrix (8x8 dimensional).
  
      Returns
      -------
      (ndarray, ndarray)
          Returns the projected mean and covariance matrix of the given state
          estimate.
  
      """
      std = [
          self._std_weight_position * mean[3],
          self._std_weight_position * mean[3],
          1e-1,
          self._std_weight_position * mean[3]]
      innovation_cov = np.diag(np.square(std))
  
      mean = np.dot(self._update_mat, mean)
      covariance = np.linalg.multi_dot((
          self._update_mat, covariance, self._update_mat.T))
      return mean, covariance + innovation_cov
```
  - 其中mean = HX
  - 其中 innovation_cov为R 测量噪声协方差。滤波器实际实现时，测量噪声协方差 R一般可以观测得到，是滤波器的已知条件。
  - 其中 covariance = $HPH^T$ 为卡尔曼中[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-jBS6e54W-1653305102843)(https://raw.githubusercontent.com/xifen523/img_typora_lib/main/img202205231652554.png)]
6. def initiate(self, measurement)函数：
  - 初始化均值以及方差（就是卡尔曼中的X，以及P）
  - ```
  def initiate(self, measurement):
      """Create track from unassociated measurement.
  
      Parameters
      ----------
      measurement : ndarray
          Bounding box coordinates (x, y, a, h) with center position (x, y),
          aspect ratio a, and height h.
  
      Returns
      -------
      (ndarray, ndarray)
          Returns the mean vector (8 dimensional) and covariance matrix (8x8
          dimensional) of the new track. Unobserved velocities are initialized
          to 0 mean.
  
      """
      mean_pos = measurement
      mean_vel = np.zeros_like(mean_pos)
      mean = np.r_[mean_pos, mean_vel]  # 按照列叠加
      # mean = [ measurement[0],
      #          measurement[1],
      #          measurement[2],
      #          measurement[3],
      #                       0,
      #                       0,
      #                       0,
      #                       0,
      #
      #         ]  X 状态向量
  
      std = [
          2 * self._std_weight_position * measurement[3],
          2 * self._std_weight_position * measurement[3],
          1e-2,
          2 * self._std_weight_position * measurement[3],
          10 * self._std_weight_velocity * measurement[3],
          10 * self._std_weight_velocity * measurement[3],
          1e-5,
          10 * self._std_weight_velocity * measurement[3]]
      covariance = np.diag(np.square(std))
  
      # covariance = [[(2 * self._std_weight_position * measurement[3]) ** 2, 0, 0, 0, 0, 0, 0, 0],
      #               [0, (2 * self._std_weight_position * measurement[3]) ** 2, 0, 0, 0, 0, 0, 0],
      #               [0, 0, 1e-4, 0, 0, 0, 0, 0],
      #               [0, 0, 0, (2 * self._std_weight_position * measurement[3]) ** 2, 0, 0, 0, 0],
      #               [0, 0, 0, 0, (10 * self._std_weight_velocity * measurement[3]) ** 2, 0, 0, 0],
      #               [0, 0, 0, 0, 0, (10 * self._std_weight_velocity * measurement[3]) ** 2, 0, 0],
      #               [0, 0, 0, 0, 0, 0, 1e-5, 0],
      #               [0, 0, 0, 0, 0, 0, 0, (10 * self._std_weight_velocity * measurement[3]) ** 2],
      #               ]   初始化P矩阵   只要不是0矩阵就可以
  
      return mean, covariance
```
- 返回一个X，一个P
- （其实只要维度对上就行了，X值随意，P只要不是0就行）
7. def predict(self, mean, covariance):函数
  - ```
  def predict(self, mean, covariance):
      """Run Kalman filter prediction step.
  
      Parameters
      ----------
      mean : ndarray
      The 8 dimensional mean vector of the object state at the previous
      time step.
      covariance : ndarray
      The 8x8 dimensional covariance matrix of the object state at the
      previous time step.
  
      Returns
      -------
      (ndarray, ndarray)
      Returns the mean vector and covariance matrix of the predicted
      state. Unobserved velocities are initialized to 0 mean.
  
      """
      std_pos = [
      self._std_weight_position * mean[3],
      self._std_weight_position * mean[3],
      1e-2,
      self._std_weight_position * mean[3]]
      std_vel = [
      self._std_weight_velocity * mean[3],
      self._std_weight_velocity * mean[3],
      1e-5,
      self._std_weight_velocity * mean[3]]
      motion_cov = np.diag(np.square(np.r_[std_pos, std_vel]))
  
      # mean = np.dot(self._motion_mat, mean)
      mean = np.dot(mean, self._motion_mat.T)
      covariance = np.linalg.multi_dot((
      self._motion_mat, covariance, self._motion_mat.T)) + motion_cov
  
      return mean, covariance
```
- 其中mean = np.dot(mean, self._motion_mat.T)对应卡尔曼中预测
- covariance = np.linalg.multi_dot((self._motion_mat, covariance, self._motion_mat.T)) + motion_cov对应
- 所以motion_cov是卡尔曼中的Q矩阵，该参数被用来表示状态转换矩阵与实际过程之间的误差。因为我们无法直接观测到过程信号，所以 Q 的取值是很难确定的。是卡尔曼滤波器用于估计离散时间过程的状态变量，也叫预测模型本身带来的噪声。
8. def update(self, mean, covariance, measurement):函数
  - ```
  def update(self, mean, covariance, measurement):
      """Run Kalman filter correction step.
  
      Parameters
      ----------
      mean : ndarray
          The predicted state's mean vector (8 dimensional).
      covariance : ndarray
          The state's covariance matrix (8x8 dimensional).
      measurement : ndarray
          The 4 dimensional measurement vector (x, y, a, h), where (x, y)
          is the center position, a the aspect ratio, and h the height of the
          bounding box.
  
      Returns
      -------
      (ndarray, ndarray)
          Returns the measurement-corrected state distribution.
  
      """
      projected_mean, projected_cov = self.project(mean, covariance)
  
      chol_factor, lower = scipy.linalg.cho_factor(
          projected_cov, lower=True, check_finite=False)
      kalman_gain = scipy.linalg.cho_solve(
          (chol_factor, lower), np.dot(covariance, self._update_mat.T).T,
          check_finite=False).T
      
      innovation = measurement - projected_mean # z-HX
  
      new_mean = mean + np.dot(innovation, kalman_gain.T) # x+K(z-HX)
      new_covariance = covariance - np.linalg.multi_dot((
          kalman_gain, projected_cov, kalman_gain.T))
      return new_mean, new_covariance
```
- 其中 projected_cov= $HPH^T+R$ 为卡尔曼中
- projected_mean = HZ
- - 该部分为卡尔曼增益求解
  - $K_k(HP^-_kH^T+R)=P^-_KH^T$ 其中 $HP^-_kH^T+R)$ 的值就是projected_cov （定义为S的话）
  - 将projected_cov 用scipy.linalg.cho_factor（）分解
  - 再用scipy.linalg.cho_solve（）求解出 $K_k$
- new_covariance = covariance - np.linalg.multi_dot(( kalman_gain, projected_cov, kalman_gain.T)) 为卡尔曼中的：
- 这个地方好像有点不一样最后的P更新：
  
  $P_k=P^-_k-(K_k(HP^-_kH^T+R)K^T_k)$ 这个怎么化简呢? 由前面可知： $K_k(HP^-_kH^T+R)=P^-_KH^T$
  
  则： $P_k=P^-_k-(P^-_KH^T)K^T_k$
- 这个地方化简不下去了但是计算的维度是对的 H是4X8 P是8X8 K是8X4

参考

评论 1

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

打赏作者

小兮风 你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20

扫码支付：¥1

获取中

扫码支付

您的余额不足，请更换扫码支付或充值

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。