pytorch的预训练模型的使用

最新推荐文章于 2025-05-08 16:12:26 发布

冬日and暖阳

最新推荐文章于 2025-05-08 16:12:26 发布

阅读量3.6k

点赞数

分类专栏： deeplearning pytorch 文章标签： pytorch 深度学习 python

本文链接：https://blog.youkuaiyun.com/qq_29007291/article/details/123855457

版权

pytorch 同时被 2 个专栏收录

32 篇文章

订阅专栏

deeplearning

4 篇文章

订阅专栏

本文介绍了PyTorch中预训练模型的基本使用方法，强调了为了获得最佳效果及快速收敛，在进行迁移学习时需遵循特定的图像预处理步骤。包括调整图像通道顺序为RGB，确保张量尺寸为C×H×W，并使用特定均值和标准差进行归一化。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

背景

pytorch中有很多在ImageNet上训练得到的预训练模型，可以拿来做迁移学习（如下图）。但是使用的时候需要注意，如果想得到最佳的效果以及最快的收敛速度，那么迁移学习的时候，预处理部分需要和这些模型在ImageNet上训练的时候保持一直。

在这里插入图片描述

对应的预处理

以上的模型在ImageNet上训练时，对图像使用的预处理是

channe的顺序是 RGB（不是BGR）
tensor的维度顺序是 C ✖️ H ✖️ W
像素值归一化到0~1
减去均值：[0.485, 0.456, 0.406]
除以标准差：[0.229, 0.224, 0.225]

原文

All pre-trained models expect input images normalized in the same way, i.e. mini-batches of 3-channel RGB images of shape (3 x H x W), where H and W are expected to be at least 224. The images have to be loaded in to a range of [0, 1] and then normalized using mean = [0.485, 0.456, 0.406] and std = [0.229, 0.224, 0.225]. You can use the following transform to normalize:

参考：https://pytorch.org/vision/stable/models.html#classification