Hugging 的 Trainer 训练多输入网络遇到的问题 The batch received was empty, your model won‘t be able to train on it

最新推荐文章于 2025-04-02 11:01:07 发布

悄悄地努力

最新推荐文章于 2025-04-02 11:01:07 发布

阅读量1.7k

点赞数 3

CC 4.0 BY-SA版权

分类专栏： hugging bug 解决文章标签： batch 开发语言

本文链接：https://blog.youkuaiyun.com/weixin_46034990/article/details/137646098

文章讲述了使用HuggingFace的Trainer训练自定义网络时遇到的问题，即由于自动移除未使用的列导致空批次。通过设置`remove_unused_columns=False`解决了这个问题，提醒用户在训练参数中注意保留所有列以保证训练过程顺利。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

背景描述

使用 Hugging 的 Trainer 训练自定义网络，自定义网络包括多个输入，如下图所示：
在这里插入图片描述

代码详见 HuggingFace 自定义数据集，使用 Trainer 训练，多个输入，定义多流网络

错误描述

root@55esp9ftlj3h2-0:/codes/ssm-gaze-estimation/swin_v2_headpose# python3 train.py 
/usr/local/lib/python3.8/dist-packages/accelerate/accelerator.py:436: FutureWarning: Passing the following arguments to `Accelerator` is deprecated and will be removed in version 1.0 of Accelerate: dict_keys(['dispatch_batches']). Please pass an `accelerate.DataLoaderConfiguration` instead: 
dataloader_config = DataLoaderConfiguration(dispatch_batches=None)
  warnings.warn(
Detected kernel version 3.10.0, which is below the recommended minimum of 5.5.0; this can cause the process to hang. It is recommende