使用 Hugging Face 的 Transformers 库加载预训练模型遇到的问题

营赢盈英

已于 2024-07-10 08:02:17 修改

阅读量1.2k

点赞数 14

CC 4.0 BY-SA版权

分类专栏： AI 文章标签： ai 人工智能 pytorch llm huggingface transformers

于 2024-07-09 23:50:04 首次发布

本文链接：https://blog.youkuaiyun.com/suiusoar/article/details/140309535

AI 专栏收录该内容

624 篇文章 ¥19.90 ¥99.00

订阅专栏

超级会员免费看

题意：

Size mismatch for embed_out.weight: copying a param with shape torch.Size([0]) from checkpoint - Huggingface PyTorch

这个错误信息 "Size mismatch for embed_out.weight: copying a param with shape torch.Size([0]) from checkpoint - Huggingface PyTorch" 通常出现在使用 Hugging Face 的 Transformers 库加载预训练模型时，模型的某些参数与预训练模型检查点（checkpoint）中的参数形状不匹配。

问题背景：

I want to finetune an LLM. I am able to successfully finetune LLM. But when reload the model after save, gets error. Below is the code

想要微调（finetune）一个大型语言模型（LLM）并在保存后重新加载模型时遇到错误，以下是代码：

import argparse
import numpy as np
import torch
from datasets import load_dataset
from transformers import AutoTokenizer, AutoModelForCausalLM

from trl import DPOTr

了解本专栏