【大模型】return _sentencepiece.SentencePieceProcessor_LoadFromFile(self, arg) TypeError: not a string

【大模型】return _sentencepiece.SentencePieceProcessor_LoadFromFileself, arg TypeError: not a string

错误信息

运行大模型 Qwen1.5-14B-Chat 出现如下错误:

Traceback (most recent call last):
  File "/abc/llm/./transformers_low_bit_pipeline.py", line 44, in <module>
    tokenizer = LlamaTokenizer.from_pretrained(model_path, trust_remote_code=True)
                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/abc/.conda/envs/llm/lib/python3.11/site-packages/transformers/tokenization_utils_base.py", line 2048, in from_pretrained
    return cls._from_pretrained(
           ^^^^^^^^^^^^^^^^^^^^^
  File "/abc/.conda/envs/llm/lib/python3.11/site-packages/transformers/tokenization_utils_base.py", line 2287, in _from_pretrained
    tokenizer = cls(*init_inputs, **init_kwargs)
                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/abc/.conda/envs/llm/lib/python3.11/site-packages/transformers/models/llama/tokenization_llama.py", line 182, in __init__
    self.sp_model = self.get_spm_processor(kwargs.pop("from_slow", False))
                    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/abc/.conda/envs/llm/lib/python3.11/site-packages/transformers/models/llama/tokenization_llama.py", line 209, in get_spm_processor
    tokenizer.Load(self.vocab_file)
  File "/abc/.conda/envs/llm/lib/python3.11/site-packages/sentencepiece/__init__.py", line 961, in Load
    return self.LoadFromFile(model_file)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/abc/.conda/envs/llm/lib/python3.11/site-packages/sentencepiece/__init__.py", line 316, in LoadFromFile
    return _sentencepiece.SentencePieceProcessor_LoadFromFile(self, arg)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
TypeError: not a string

重点信息

这行代码:

    tokenizer = LlamaTokenizer.from_pretrained(model_path, trust_remote_code=True)

报错:

  File "/abc/.conda/envs/llm/lib/python3.11/site-packages/sentencepiece/__init__.py", line 316, in LoadFromFile
    return _sentencepiece.SentencePieceProcessor_LoadFromFile(self, arg)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
TypeError: not a string

解决方法

代码:

    from transformers import LlamaTokenizer, TextGenerationPipeline

    ...
    
    tokenizer = LlamaTokenizer.from_pretrained(model_path, trust_remote_code=True)
    

修改为:

    from transformers import LlamaTokenizer, TextGenerationPipeline, AutoTokenizer

    ...
    
    tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)

验证

再次运行模型,正常!!

Loading remote code failed: model, No module named 'model' Traceback (most recent call last): File "/root/ModelZoo-PyTorch/ACL_PyTorch/built-in/audio/SenseVoice/TorchAir/FunASR/infer.py", line 28, in <module> model = AutoModel(model=args.model_path, trust_remote_code=True, device="npu", fp16=True, disable_update=True) File "/root/ModelZoo-PyTorch/ACL_PyTorch/built-in/audio/SenseVoice/TorchAir/FunASR/funasr/auto/auto_model.py", line 125, in __init__ model, kwargs = self.build_model(**kwargs) File "/root/ModelZoo-PyTorch/ACL_PyTorch/built-in/audio/SenseVoice/TorchAir/FunASR/funasr/auto/auto_model.py", line 225, in build_model tokenizer = tokenizer_class(**tokenizer_conf) File "/root/ModelZoo-PyTorch/ACL_PyTorch/built-in/audio/SenseVoice/TorchAir/FunASR/funasr/tokenizer/sentencepiece_tokenizer.py", line 23, in __init__ self._build_sentence_piece_processor() File "/root/ModelZoo-PyTorch/ACL_PyTorch/built-in/audio/SenseVoice/TorchAir/FunASR/funasr/tokenizer/sentencepiece_tokenizer.py", line 32, in _build_sentence_piece_processor self.sp.load(self.bpemodel) File "/usr/local/lib/python3.10/dist-packages/sentencepiece/__init__.py", line 961, in Load return self.LoadFromFile(model_file) File "/usr/local/lib/python3.10/dist-packages/sentencepiece/__init__.py", line 316, in LoadFromFile return _sentencepiece.SentencePieceProcessor_LoadFromFile(self, arg) RuntimeError: Internal: could not parse ModelProto from ./SenseVoiceSmall/chn_jpn_yue_eng_ko_spectok.bpe.model [ERROR] 2025-12-03-10:35:37 (PID:1316002, Device:-1, RankID:-1) ERR99999 UNKNOWN application exception
最新发布
12-04
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包

打赏作者

szZack

你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20
扫码支付:¥1
获取中
扫码支付

您的余额不足,请更换扫码支付或充值

打赏作者

实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值