DeepSeek-R1模型思考＜/think＞没有＜think＞【解决方法】

原创已于 2025-07-21 16:23:53 修改 · 1.4k 阅读

CC 4.0 BY-SA版权

文章标签：

于 2025-07-21 16:21:34 首次发布

7 篇文章

订阅专栏

4 篇文章

订阅专栏

针对DeepSeek-R1模型输出中缺失<think>标签的问题，以下是详细修改tokenizer_config.json文件的操作步骤及原理说明：
在这里插入图片描述

路径说明：
模型目录通常包含模型权重文件（如.bin或.safetensors）、分词器文件（如tokenizer.json）以及配置文件（如config.json和tokenizer_config.json）。
例如：
```
/path/to/model/deepseek-r1/
├── config.json
├── tokenizer.json
├── tokenizer_config.json
└── pytorch_model.bin
```

命令示例：

cp /path/to/model/deepseek-r1/tokenizer_config.json /path/to/model/deepseek-r1/tokenizer_config.json.bak

工具推荐：
使用文本编辑器（如VS Code、Notepad++）或命令行工具（如vim、nano）打开文件。
关键字段定位：
查找包含<think>标签的字段，通常出现在以下位置：
```
{
  "additional_special_tokens": ["<think>\\n", "<other_token>"],
  "chat_template": "<|begin_of_text|>{% for message in messages %}...{% endfor %}"
}
```
- 重点检查：
  - additional_special_tokens：额外特殊标记列表，可能包含"<think>\\n"。
  - chat_template：对话模板，可能强制插入特定格式。
修改操作：
- 删除<think>\\n中的换行符：
  将"<think>\\n"改为"<think>"，避免分词器在生成时强制追加换行符。
```
{
  "additional_special_tokens": ["<think>", "<other_token>"]
}
```
- 彻底移除字段（我是直接删除）：
  如果模型未明确依赖此字段，可直接删除包含<think>\\n的条目。