Smol Models 使用教程-优快云博客

本文链接：https://blog.youkuaiyun.com/gitblog_00492/article/details/146721825

Smol Models 使用教程

smollm Everything about the SmolLM2 and SmolVLM family of models 项目地址: https://gitcode.com/gh_mirrors/smo/smollm

1. 项目介绍

Smol Models 是来自 Hugging Face 的一系列高效且轻量级的 AI 模型，适用于文本和视觉任务。我们的目标是创建出在设备上运行高效且性能强劲的紧凑型模型。

2. 项目快速启动

以下是快速启动 SmolLM2 和 SmolVLM 模型的基本步骤。

SmolLM2 快速启动

from transformers import AutoModelForCausalLM, AutoTokenizer

# 选择模型checkpoint
checkpoint = "HuggingFaceTB/SmolLM2-1.7B-Instruct"

# 加载分词器和模型
tokenizer = AutoTokenizer.from_pretrained(checkpoint)
model = AutoModelForCausalLM.from_pretrained(checkpoint)

# 创建对话信息
messages = [{
    "role": "user",
    "content": "编写一篇关于'开源在AI研究中益处'的100字文章"
}]

# 应用聊天模板并获取输入文本
input_text = tokenizer.apply_chat_template(messages, tokenize=False)

SmolVLM 快速启动

from transformers import AutoProcessor, AutoModelForVision2Seq

# 加载处理器和模型
processor = AutoProcessor.from_pretrained("HuggingFaceTB/SmolVLM-Instruct")
model = AutoModelForVision2Seq.from_pretrained("HuggingFaceTB/SmolVLM-Instruct")

# 创建对话信息，包括图像和文本
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image"},
            {"type": "text", "text": "这张图片里有什么？"}
        ]
    }
]

# 使用处理器处理信息
input_data = processor(messages)