RoboMaster 目标检测训练（官方数据集）附完整Demo代码

最新推荐文章于 2025-11-01 19:18:56 发布

原创

最新推荐文章于 2025-11-01 19:18:56 发布 · 1.5w 阅读

211 ·

CC 4.0 BY-SA版权

本指南介绍如何使用PyTorch和zisan包在个人电脑上轻松训练YoloV3模型，即使设备配置较低也能高效运行。文章详细展示了从数据准备到模型训练的全过程，包括代码示例和参数调整建议。

参考: Yolov3 训练自己的数据集 Pytorch 最简单最少代码最易调参

名言：大部分人都是死在了调参和配置的路上。。。。。然后归咎于设备不行。好了，现在你不需要再担心这些问题了，因为下面这个教程一般来说都不会怎么报错。
首先声明，以下训练是在我的笔记本上完成的，我的是那种很菜鸡的轻薄本，市面上任何一款游戏本都可以吊打它，显卡型号：MX150 2048MB。读者完全可以迁移到更高级设备上进行训练，请留意本文关于迁移到其他设备的说明。

下载数据集

Robomaster2019 数据集上官网就有的下载，文件名称：DJCOCO.zip
数据集官网下载地址：https://terra-1-g.djicdn.com/b2a076471c6c4b72b574a977334d3e05/resources/DJI%20ROCO.zip
数据集百度云下载地址：链接：https://pan.baidu.com/s/1Ezh1ip8ZOLJeVzhBD9JuOQ 提取码：ytls

安装第三方快速训练包：芷山

使用这个zisan包之前确保你已经配置好PyTorch+CUDA+CUDNN环境，如果你还没配置环境，移步到这里：Pytorch+CUDA+CUDNN配置教程
然后安装如下包：

pip install numpy pillow matplotlib opencv-python

然后安装zisan：

pip install zisan

一般来说不会有任何报错，如果报错那就是之前的环境配置出现问题，请仔细检查。因为我们使用的深度学习框架是Pytorch，不像tensorflow会有那么多api接口版本迭代问题，pytorch非常简洁优美。
这里提供 ‘zisan芷山’ 的官网：http://jintupersonal.com/zisan/
相关文档资料：http://jintupersonal.com/zisan/doc/1.html
这个第三方包是开源的，可以到Github自取（求点一个star）：https://github.com/JintuZheng/zisan
目前迭代版本1.0.11，更新频率比较快，我也来不及写文档，有的接口用法请读者自行浏览源码理解使用。

训练示例

（1）数据集和本次训练目标

dataset
数据集非常大，一共9.8GB，作为本次演示我不会训练完所有数据集，设备也不允许嘛，作为演示，我先实现识别annotations标签里面的car识别，就是把车子抠出来。
如下图，只把黄色car标签训练识别出来。
在这里插入图片描述
另外说一句，zisan里面底层使用的是Yolov3的Pytorch版本复现代码，是优化过的版本。

（2）打造一个训练目录

首先，我们下载预训练权重文件和zisan的目标检测配置文件，下载地址：
百度云：https://pan.baidu.com/s/1qj-Lpe4OKV0L-w9uKO8EFw
提取码：x9wl
它有图像语义分割权重和目标检测权重，我们只需要完成训练数据集的目标检测任务，只需要Yolov3的权重，找到 runBox.zip （475 MB）下载：

把runBox解压到一个没有中文路径的地方，里面的文件如下：
runbox inside
此时，cfgs和weights文件夹是有权重和网络配置文件的，我们不要也不需要取改动它。
然后，我们跑回到我们之前下载的那个Robomaster2019数据集里面，任意选一个比赛的文件夹，在image文件选246张图片
注意，我这里为了演示才选那么少，不过迭代30次已经有一定效果了，很明显的，设备好一点的可以选择1000-2000张左右一次训练，下次训练我们使用冷却的权重再训练其他更多的图片。
我们把image的图片选246张复制到刚才runBox/data/images/文件夹下面，然后把相应的246个xml标签文件复制到runBox/data/Annotations/文件夹下面，睁大你的卡斯兰大眼睛！！别复制错了，只是两个文件夹，其余的文件夹不要管也千万不要删除。
在这里插入图片描述
至此，我们的训练目录打造完成，接下来我们只需要很少量的代码就能完成训练。

（3）编写train.py

我们在runBox文件夹新建一个py文件
newtrain
train.py:

from zisan.ObjDetect.Interface import ObjDetect_train, ObjDetect_Preprocess
import os

if __name__ == "__main__":  

    pr=ObjDetect_Preprocess(classnames=['car'],currentpath='D:/xxx/runBox') # cuurentpath is needed, it is your runBox path, car是我们需要训练的类别名称
    #pr.clear_data() #clear all data  
    trainModel=ObjDetect_train(currentpath='D:/xxx/runBox')
    trainModel.Run(cfg='yolov3-tiny.cfg',epochs=35,img_size=(1920/3,1080/3)) # 开始训练，训练好的权重文件保存在 weights文件夹里面

解释：
（1）classnames=[‘class1’,‘class2’]，这里是描述你annotation里面xml文件标签的类别名字，我这里只训练一类，就识别所有的车，所以classnames=['car']，必须和xml文件里面的类名字保持一致。

（2）Yolov3版本选择：
zisan一共提供三种版本：Yolov3-tiny，Yolov3-spp，Yolov3，其中要求最低配置的就是tiny了。我也说过了我这个破机器就只能跑tiny了。如果你有更好的机器，你可以把cfg参数cfg='yolov3-tiny.cfg'换成：

‘yolov3-spp.cfg’
‘yolov3.cfg’

（小心引号别漏了）

（4）其他参数说明：
如果你不熟悉调参，请不要随意调参数。使用默认参数即可（我已经调过了）

epochs: The times you loop training.
batch_size: The sum of once you
put into training. cfg: You can choose ‘yolov3-tiny.cfg’,
‘yolov3-spp.cfg’ and ‘yolov3.cfg’, you must sure the weights folder
has the corresponding weight.
img_size: You can set as (height,width),
also like above 416 means (416,416)
resume: Due to the limitation of device resources, you may not be able to train too much data at a time. At this time, you can use resume to continue training for the weight of last cooling
num_workers: Multithreading, you must use main to use this nosave: if save each epoch weight

（5）关于图片放缩参数问题：

img_size=(1920/3,1080/3)

最低0.47元/天解锁文章

58 条评论

Keyd 2023.11.05
请问为什么会有这个报错阿 FileNotFoundError: [Errno 2] No such file or directory: 'home/keyd/Downloads/runBox/data/classes.names' 不是本来data文简夹里就没有这个.names吗

酩语593 2023.08.19
ValueError Traceback (most recent call last) e:\zisan\ObjectDetect_data_weights\runBox\train.py in line 6 4 pr=ObjDetect_Preprocess(classnames=['car'],currentpath='./') # cuurentpath is needed, current path parameter is your runBox path 5 trainModel=ObjDetect_train(currentpath='./') ----> 6 trainModel.Run(cfg='yolov3-tiny.cfg',epochs=10) File c:\Users\Administrator\AppData\Local\Programs\Python\Python39\lib\site-packages\zisan\ObjDetect\Interface.py:492, in ObjDetect_train.Run(self, epochs, batch_size, accumulate, cfg, multi_scale, img_size, resume, transfer, num_workers, backend, nosave, notest, evolve, var) 489 self.global_nosave = True # do not save checkpoints 491 # Train --> 492 results = self.rawtrain( 493 self.global_cfg, ValueError: Number of rows must be a positive integer, not 2.0 还在吗，想问一下为什么会报这个错误，是我的版本问题吗

北斗星影 2021.05.02
作者您好，想打扰一下，我在训练的时候报这个错误：“cv2.error: OpenCV(4.4.0) /tmp/pip-install-bericugh/opencv-python/opencv/modules/core/src/copy.cpp:1415: error: (-215:Assertion failed) top >= 0 && bottom >= 0 && left >= 0 && right >= 0 && _src.dims() <= 2 in function 'copyMakeBorder' ”主要是什么原因呢？
- 夜何其回复北斗星影 2021.09.04
  你试试把img_size=(1920/3,1080/3)这句话给去掉

weixin_48341307 2021.04.06
Traceback (most recent call last): File "f:\runbox\detect.py", line 10, in <module> re,im0=detectModel.detect_from_RGBimg(img,is_showPreview=True) File "F:\python\lib\site-packages\zisan\ObjDetect\Interface.py", line 606, in detect_from_RGBimg det = non_max_suppression(pred, conf_thres, nms_thres)[0] File "F:\python\lib\site-packages\zisan\ObjDetect\utils\utils.py", line 389, in non_max_suppression pred[:, 4] *= class_conf RuntimeError: Output 0 of SelectBackward is a view and is being modified inplace. This view is the output of a function that returns multiple views. Such functions do not allow the output views to be modified inplace. You should replace the inplace operation by an out-of-place one. 本人小白，在运行detect文件的时候报错了，为什么啊

下次一定毕业 2020.11.15
请问博主这是啥问题，感谢感谢！ Traceback (most recent call last): File "E:\runBox\detect.py", line 10, in <module> re,im0=detectModel.detect_from_RGBimg(img,is_showPreview=True) File "C:\Users\86131\anaconda3\lib\site-packages\zisan\ObjDetect\Interface.py", line 606, in detect_from_RGBimg det = non_max_suppression(pred, conf_thres, nms_thres)[0] File "C:\Users\86131\anaconda3\lib\site-packages\zisan\ObjDetect\utils\utils.py", line 389, in non_max_suppression pred[:, 4] *= class_conf RuntimeError: Output 0 of UnbindBackward is a view and its base or another view of its base has been modified inplace. This view is the output of a function that returns multiple views. Such functions do not allow the output views to be modified inplace. You should replace the inplace operation by an out-of-place one.
- JintuZheng回复亲近、亲近 2021.05.21
  这句没必要：img = np.array(img)
- 亲近、亲近回复JintuZheng 2021.05.20
  我也是会报这个错，[code=python] img=io.imread('0train_batch56.jpg') img=cv2.resize(img,(480,640)) # Here rechange for your train images set Height and width img = np.array(img) print(img.shape) [/code] (640, 480, 3)，这个图片的shape,可以指点一下吗？谢谢老哥
- 下次一定毕业回复JintuZheng 2020.11.15
  okk，感谢感谢
- JintuZheng回复下次一定毕业 2020.11.15
  模型输出判别失败了，检查一下你的图片数据格式是否是(RGB)的numpy数组，或者是否图片路径是否正确？

Baron3 2020.08.18
小白不懂就问：这个是对比赛没大用处么？
- 白告.回复Baron3 2021.02.18
  不清楚，现在的视觉还是采用的传统识别，没有用到深度学习

子金水 2020.07.29
pr=ObjDetect_Preprocess(classnames=['car','armor','red','blue'], 作者你好，请问如果我想添加多个类，这样改动之后训练发现也只有car，请问是哪里出了问题呢
- JintuZheng回复子金水 2020.07.30
  [reply]weixin_42194874[/reply]我没记错的话，几乎每张图都有car这个标签，那肯定这个类的样本很多的，但是blue和red这两个标签并不经常出现，你可以写一个筛选脚本，对xml文件的内容进行筛选，找出带有red和blue的样本，这样训练的话或许能解决你这个问题
- 子金水回复JintuZheng 2020.07.30
  [reply]rizero[/reply]前两项，我检查后发现没有问题，但是，各类样本的数据的数量，我需要怎么检查呢，初次实验时，按照作者步骤来的，添加了图片文件和xml文件，后来，训练多个类的时候，并没有改动这里，请问是这里出了问题嘛，如有问题，请问需要改动哪里呢
- JintuZheng回复子金水 2020.07.29
  [reply]weixin_42194874[/reply]这个标签的名称你需要保证几件事情，第一，有一个classnames的文件，你去那里看一下是否和你设置的一致，第二，标签的名字是否和xml的名字一致（注意大小写），第三，各类样本的数据的数量

Crankish. 2020.07.05
运行detect.py 报错OSError: [WinError 126] 找不到指定的模块。 windows环境下怎么回事？卡在这最后一步
- JintuZheng回复Crankish. 2020.07.18
  [reply]weixin_44731182[/reply]显卡驱动问题