torchvision.ops.nms实现NMS

最新推荐文章于 2025-06-14 23:06:52 发布

原创

最新推荐文章于 2025-06-14 23:06:52 发布 · 6.4k 阅读

30 ·

CC 4.0 BY-SA版权

文章标签：

#计算机视觉 #深度学习 #人工智能

NMS（非极大值抑制）用于目标检测模型中去除重复的检测框，主要步骤包括概率筛选、按概率排序和IOU计算。torchvision.ops.nms函数执行NMS操作，输入包含框坐标和概率，返回保留的框索引。在Yolov8的例子中，通过batchedNMS处理多类别检测，避免不同类别间的误删。

nms原理：
当目标检测模型对一个目标有多个检测框时，需要滤掉多余的框，留下最接近真实目标的框。

在这里插入图片描述

步骤是这样的：
1.先把目标框初筛一波，比如设阈值为0.25, 把预测概率 < 0.25的目标框滤掉。
2.把每个类别的目标框按预测概率从大到小排序。
3.每个类别的目标框两两计算 IOU，当IOU > 阈值时，说明这两个目标框高度重合，没必要都留着，留概率较大的那个。注意这里是同类别的框计算IOU，不同类别的即使高度重合也不滤掉。

这样每个类别就滤掉了多余的框。因为已经按照预测概率从大到小排序，所以两两计算IOU时优先计算的是概率较大的框。

下面看下torchvision.ops.nms的用法，
这里假设有N个目标框。
传入参数boxes为Tensor，shape为[N,4], 每个box坐标格式是(x1,y1,x2,y2),即左上角右下角坐标。
如果你的目标框为(x,y,w,h), 需要做格式转换。
score: Tensor, shape为[N], 每个目标框检测的概率，如果是COCO，预测了80个类别的概率，就把最大的概率取出来。
iou阈值：float型，每个类别内两两目标框IOU > 这个阈值时，扔掉概率小的那个。

用它之前先用概率阈值初筛一波box, 把预测概率很低的box去掉。
需要把box的坐标转为左上角右下角坐标格式。
把box按概率从大到小排序。
提取每个box的预测概率。

torchvision.ops.nms的注释：
```python
def nms(boxes: Tensor, scores: Tensor, iou_threshold: float) -> Tensor:
    """
    Performs non-maximum suppression (NMS) on the boxes according
    to their intersection-over-union (IoU).

    NMS iteratively removes lower scoring boxes which have an
    IoU greater than iou_threshold with another (higher scoring)
    box.

    If multiple boxes have the exact same score and satisfy the IoU
    criterion with respect to a reference box, the selected box is
    not guaranteed to be the same between CPU and GPU. This is similar
    to the behavior of argsort in PyTorch when repeated values are present.

    Args:
        boxes (Tensor[N, 4])): boxes to perform NMS on. They
            are expected to be in ``(x1, y1, x2, y2)`` format with ``0 <= x1 < x2`` and
            ``0 <= y1 < y2``.
        scores (Tensor[N]): scores for each one of the boxes
        iou_threshold (float): discards all overlapping boxes with IoU > iou_threshold

    Returns:
        Tensor: int64 tensor with the indices of the elements that have been kept
        by NMS, sorted in decreasing order of scores
    """