【目标检测--detr源码复现】

最新推荐文章于 2024-10-31 10:32:58 发布

m0_57114626

最新推荐文章于 2024-10-31 10:32:58 发布

阅读量2k

点赞数 5

文章标签：目标检测人工智能计算机视觉

本文链接：https://blog.youkuaiyun.com/m0_57114626/article/details/142998846

版权

文章目录

一、detr介绍
二、使用步骤

一、detr介绍

DEtection TRansformer（DETR）是Facebook AI的研究者提出的Transformer的视觉版本，用于目标检测和全景分割。这是第一个将Transformer成功整合为检测pipeline中心构建块的目标检测框架。

1、代码地址Github：https://github.com/facebookresearch/detr
2、论文地址paper with code：
End-to-End Object Detection with Transformers

二、使用步骤

1、先将代码下载下来并在Pycharm中打开，创建一个虚拟环境，激活后点击terminal,输入

pip install -r requirements.txt

如果要安装cuda版本的torch和torchvision,可以在Pytorch官网搜索，我这边直接给出下载指令

pip install torch==2.2.2 torchvision==0.17.2 torchaudio==2.2.2 --index-url https://download.pytorch.org/whl/cu118

这样，基本的环境就没什么问题了。

2、数据集：coco数据集
格式：

path/to/coco/
  annotations/  # annotation json files
  train2017/    # train images
  val2017/      # val images

3、模型
在这里插入图片描述
4、训练模型

python -m torch.distributed.launch --nproc_per_node=1 --use_env main.py --coco_path coco

模型结果：
在这里插入图片描述
训练十个epoch,每训练一个打印出参数，并保存在log.txt文件夹当中：

在这里插入图片描述
5、预测

预测脚本predict.py内容如下：

from PIL import Image
import matplotlib.pyplot as plt
import torch
import torchvision.transforms as T

torch.set_grad_enabled(False)

# COCO classes
CLASSES = [
    'N/A', 'person', 'bicycle', 'car', 'motorcycle', 'airplane', 'bus',
    'train', 'truck', 'boat', 'traffic light', 'fire hydrant', 'N/A',
    'stop sign', 'parking meter', 'bench', 'bird', 'cat', 'dog', 'horse'

最低0.47元/天解锁文章