faster-rcnn 之 bbox_transform_inv(boxes, deltas)

最新推荐文章于 2020-08-15 01:02:59 发布

原创最新推荐文章于 2020-08-15 01:02:59 发布 · 3.2k 阅读

2 ·

CC 4.0 BY-SA版权

Python 同时被 2 个专栏收录

23 篇文章

订阅专栏

深度学习

14 篇文章

订阅专栏

本文介绍了一个名为 bbox_transform_inv 的函数，该函数用于将边界框的偏移量转换回原始坐标。通过输入的边界框坐标 (boxes) 和偏移量 (deltas)，函数能够计算并返回新的边界框坐标。文章详细解释了函数内部实现过程，包括中心点计算、宽度高度计算等关键步骤。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

def bbox_transform_inv(boxes, deltas):
  if boxes.shape[0] == 0:
    return np.zeros((0, deltas.shape[1]), dtype=deltas.dtype)

  boxes = boxes.astype(deltas.dtype, copy=False)
  widths = boxes[:, 2] - boxes[:, 0] + 1.0
  heights = boxes[:, 3] - boxes[:, 1] + 1.0
  ctr_x = boxes[:, 0] + 0.5 * widths
  ctr_y = boxes[:, 1] + 0.5 * heights

  dx = deltas[:, 0::4]
  dy = deltas[:, 1::4]
  dw = deltas[:, 2::4]
  dh = deltas[:, 3::4]

  pred_ctr_x = dx * widths[:, np.newaxis] + ctr_x[:, np.newaxis]
  pred_ctr_y = dy * heights[:, np.newaxis] + ctr_y[:, np.newaxis]
  pred_w = np.exp(dw) * widths[:, np.newaxis]
  pred_h = np.exp(dh) * heights[:, np.newaxis]

  pred_boxes = np.zeros(deltas.shape, dtype=deltas.dtype)
  # x1
  pred_boxes[:, 0::4] = pred_ctr_x - 0.5 * pred_w
  # y1
  pred_boxes[:, 1::4] = pred_ctr_y - 0.5 * pred_h
  # x2
  pred_boxes[:, 2::4] = pred_ctr_x + 0.5 * pred_w
  # y2
  pred_boxes[:, 3::4] = pred_ctr_y + 0.5 * pred_h

  return pred_boxes