BBOX_cv

最新推荐文章于 2024-07-24 19:07:58 发布

归一码字

最新推荐文章于 2024-07-24 19:07:58 发布

阅读量368

点赞数 2

CC 4.0 BY-SA版权

文章标签： python 前端开发语言

本文链接：https://blog.youkuaiyun.com/m0_46221545/article/details/138766145

在深度学习中，特别是在物体检测领域，BBOX（Bounding Box）是包含物体的最小矩形，该物体应在最小矩形内部。物体检测中关于物体位置的信息输出通常是一组(x,y,w,h)数据，其中x,y代表着BBOX的左上角（或者其他固定点，可自定义），而w,h分别表示BBOX的宽和高。这一组(x,y,w,h)数据可以唯一地确定一个定位框。

在深度学习的物体检测任务中，BBOX的预测是一个关键步骤。通过训练深度学习模型，如Faster R-CNN、YOLO、SSD等，模型可以学习如何从输入图像中预测出物体的BBOX，从而实现对物体的定位和识别。

class BBox:
    def __init__(self, x, y, r, b, score=0):
        self.x, self.y, self.r, self.b, self.score = x, y, r, b, score

    def __xor__(self, other):
        '''
        计算box和other的IoU
        '''
        cross = self & other
        union = self | other
        return cross / (union + 1e-6)

    def __or__(self, other):
        '''
        计算box和other的并集
        '''
        cross = self & other
        union = self.area + other.area - cross
        return union

    def __and__(self, other):
        '''
        计算box和other的交集
        '''
        xmax = min(self.r, other.r)
        ymax = min(self.b, other.b)
        xmin = max(self.x, other.x)
        ymin = max(self.y, other.y)
        cross_box = BBox(xmin, ymin, xmax, ymax)
        if cross_box.width <= 0 or cross_box.height <= 0:
            return 0

        return cross_box.area

    def locations(self):
        return self.x, self.y, self.r, self.b

    @property
    def center(self):
        return (self.x + self.r) / 2, (self.y + self.b) / 2

    @property
    def area(self):
        return self.width * self.height

    @property
    def width(self):
        return self.r - self.x + 1

    @property
    def height(self):
        return self.b - self.y + 1

    def __repr__(self):
        return f"{{{self.x:.2f}, {self.y:.2f}, {self.r:.2f}, {self.b:.2f}, {self.score:.2f}}}"