Chinese-Llama-2-7b 开源项目教程

最新推荐文章于 2024-12-26 12:11:41 发布

张俊领Tilda

最新推荐文章于 2024-12-26 12:11:41 发布

阅读量1.1k

点赞数 19

CC 4.0 BY-SA版权

本文链接：https://blog.youkuaiyun.com/gitblog_01194/article/details/141046074

Chinese-Llama-2-7b 开源项目教程

Chinese-Llama-2-7b开源社区第一个能下载、能运行的中文 LLaMA2 模型！项目地址:https://gitcode.com/gh_mirrors/ch/Chinese-Llama-2-7b

项目介绍

Chinese-Llama-2-7b 是一个完全开源且可商用的中文版 Llama2 模型。该项目基于 Meta 发布的 Llama-2，通过扩展和优化中文词汇，并使用大规模中文数据进行增量预训练，进一步提升了对中文语言的基本语义理解。该项目不仅提供了基础模型，还包括了中英文 SFT 数据集，输入格式严格遵循 llama-2-chat 格式，兼容适配所有针对原版 llama-2-chat 模型的优化。

项目快速启动

环境准备

确保你已经安装了以下依赖：

Python 3.7 或更高版本
PyTorch 1.10 或更高版本
transformers 库

安装步骤

克隆项目仓库：

git clone https://github.com/LinkSoul-AI/Chinese-Llama-2-7b.git
cd Chinese-Llama-2-7b

安装必要的 Python 包：
```
pip install -r requirements.txt
```

快速测试

以下是一个简单的代码示例，展示如何加载模型并进行文本生成：

from transformers import AutoTokenizer, AutoModelForCausalLM, TextStreamer

model_path = "LinkSoul/Chinese-Llama-2-7b"
tokenizer = AutoTokenizer.from_pretrained(model_path, use_fast=False)
model = AutoModelForCausalLM.from_pretrained(model_path).half().cuda()

streamer = TextStreamer(tokenizer, skip_prompt=True, skip_special_tokens=True)

instruction = """[INST] <<SYS>>\nYou are a helpful, respectful and honest assistant. Always answer as helpfully as possible while being safe. Your answers should not include any harmful, unethical, racist, sexist, toxic, dangerous, or illegal content. Please ensure that your responses are socially unbiased and positive in nature.

If a question does not make any sense, or is not factually coherent, explain why instead of answering something not correct. If you don't know the answer to a question, please don't share false information.
<</SYS>>

你好，你能帮我翻译一下“Hello, how are you?”这句话吗？ [/INST]"""

inputs = tokenizer(instruction, return_tensors="pt").to("cuda")
output = model.generate(**inputs, streamer=streamer)
print(tokenizer.decode(output[0], skip_special_tokens=True))

应用案例和最佳实践

文本生成

Chinese-Llama-2-7b 模型在文本生成方面表现出色，可以应用于多种场景，如聊天机器人、内容创作、翻译等。以下是一个简单的应用案例：

from transformers import pipeline

generator = pipeline("text-generation", model=model_path, tokenizer=tokenizer)
result = generator("你好，你能帮我翻译一下“Hello, how are you?”这句话吗？", max_length=50)
print(result[0]['generated_text'])