Strip-R-CNN项目在FAIR和DIOR数据集上的训练问题解析-优快云博客

本文链接：https://blog.youkuaiyun.com/gitblog_07936/article/details/148417780

Strip-R-CNN项目在FAIR和DIOR数据集上的训练问题解析

Strip-R-CNN Offical implementation of "Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection" 项目地址: https://gitcode.com/gh_mirrors/st/Strip-R-CNN

数据集格式转换的重要性

在目标检测领域，Strip-R-CNN作为一个基于旋转框的检测框架，对数据集的格式有着严格要求。从实际使用情况来看，许多用户在使用FAIR数据集时遇到了训练失败的问题，这主要是因为FAIR数据集原始格式与项目要求不符。

FAIR数据集需要先转换为DOTA格式才能正常使用。项目提供了专门的转换脚本fair_to_dota.py，位于tools/data/fair/目录下。这个转换过程包括：

重新组织标注文件结构
调整坐标表示方式
统一文件命名规范

常见错误分析与解决方案

1. 分布式采样器错误

用户遇到的"ZeroDivisionError: division by zero"错误通常是由于数据集路径配置不当导致的。当数据加载器无法找到有效样本时，分布式采样器会抛出这个异常。解决方法包括：

检查数据集路径是否正确
确认标注文件后缀是否匹配
验证数据集是否成功转换为DOTA格式

2. 配置重复定义问题

在DIOR数据集训练时出现的"Duplicate key is not allowed among bases"错误表明配置文件中存在重复定义。特别是evaluation参数在基础配置和当前配置中都被定义。解决方案是：

删除当前配置文件中的evaluation = dict(metric='mAP')行
确保继承的基础配置不包含重复参数

3. 参数传递错误

"TypeError: init() got an unexpected keyword argument 'imgset'"错误表明数据集初始化时传入了不支持的参数。DIOR数据集的配置应严格遵循项目规定的格式：

data = dict(
    samples_per_gpu=1,
    workers_per_gpu=1,
    train=dict(
        type=dataset_type,
        ann_file=data_root + 'trainval/annfiles/',
        img_prefix=data_root + 'trainval/images/',
        pipeline=train_pipeline),
    val=dict(
        type=dataset_type,
        ann_file=data_root + 'trainval/annfiles/',
        img_prefix=data_root + 'trainval/images/',
        pipeline=test_pipeline),
    test=dict(
        type=dataset_type,
        ann_file=data_root + 'test/images/',
        img_prefix=data_root + 'test/images/',
        pipeline=test_pipeline))