PyTorch-Faster R-CNN模型训练好了后进行预测出现size mismatch for head.cls_loc.weight/cls_loc.bias/weight/bias

原创

已于 2022-05-12 19:15:50 修改 · 3.4k 阅读

7 ·

CC 4.0 BY-SA版权

文章标签：

#pytorch #cnn #深度学习

于 2022-05-12 19:15:24 首次发布

博主在尝试使用Faster R-CNN模型进行预测时遇到RuntimeError，问题在于模型参数与加载的检查点参数尺寸不一致。经过分析，博主发现num_classes的值可能不正确。通过检查classes.txt文件确定类别数量，并调整num_classes的值，问题得到解决。关键在于正确设置num_classes以匹配模型和检查点的类别数。

一、问题描述

在大牛的一个讲解训练Faster R-CNN的B站视频上，我依他的步骤训练完了模型。

然后进行预测的时候，出现了以下错误：

RuntimeError: Error(s) in loading state_dict for FasterRCNN:
	size mismatch for head.cls_loc.weight: copying a param with shape torch.Size([40, 2048]) from checkpoint, the shape in current model is torch.Size([36, 2048]).
	size mismatch for head.cls_loc.bias: copying a param with shape torch.Size([40]) from checkpoint, the shape in current model is torch.Size([36]).
	size mismatch for head.score.weight: copying a param with shape torch.Size([10, 2048]) from checkpoint, the shape in current model is torch.Size([9, 2048]).
	size mismatch for head.score.bias: copying a param with shape torch.Size([10]) from checkpoint, the shape in current model is torch.Size([9]).

二、解决思路

显然这是模型参数和输入参数之间不匹配的问题，但是我不知道问题出在哪个参数

最低0.47元/天解锁文章

9 条评论

huifeideyu12123 2024.09.18
还是没有解决!害！

IS大威天龙 2022.12.23
这个get_classes函数在哪个文件里啊
- CherishC1回复IS大威天龙 2024.03.04
  utils\utils.py

wk0216 2022.10.23
有大佬解决这个问题了吗？求教

LW0020 2022.10.03
我觉得这可能是加载进来的权重不匹配的问题
- 努力看代码回复LW0020 2022.11.24
  我自己是把原来代码的5个类别改成1000就对了，不过我不知道这样对不对[face]emoji:051.png[/face]
- 努力看代码回复LW0020 2022.11.24
  我也是这个问题，我用的imagenet的权重，有1000个类别，我那个数据集只有5个，训练的时候也是这个问题，请问您解决了吗？