Infini-Megrez 开源项目教程-优快云博客

本文链接：https://blog.youkuaiyun.com/gitblog_00624/article/details/147379643

Infini-Megrez 开源项目教程

Infini-Megrez 项目地址: https://gitcode.com/gh_mirrors/in/Infini-Megrez

1. 项目介绍

Infini-Megrez 是由无问芯穹（Infinigence AI）研发的开源项目，旨在通过软硬协同理念，打造一款极速推理、小巧精悍、极易上手的端侧智能解决方案。项目包含了 Megrez-3B、Megrez-3B-Instruct 和 Megrez-3B-Omni 等模型，这些模型在图像理解、语言理解和语音理解等方面具有出色的性能。

2. 项目快速启动

环境准备

在开始之前，请确保您的环境中安装了以下依赖：

Python 3.6 或更高版本
PyTorch
Transformers

您可以使用以下命令安装 PyTorch 和 Transformers：

pip install torch transformers

模型加载与推理

以下是使用 Megrez-3B-Omni 进行图文交互的一个简单示例：

import torch
from transformers import AutoModelForCausalLM

# 模型路径，请替换为实际路径
path = "{{PATH_TO_PRETRAINED_MODEL}}"

# 加载模型
model = AutoModelForCausalLM.from_pretrained(
    path,
    trust_remote_code=True,
    torch_dtype=torch.bfloat16,
    attn_implementation="flash_attention_2",
).eval().cuda()

# 定义消息格式
messages = [
    {
        "role": "user",
        "content": {
            "text": "Please describe the content of the image.",
            "image": "./data/sample_image.jpg",
        },
    }
]

# 推理
MAX_NEW_TOKENS = 100
response = model.chat(messages, sampling=False, max_new_tokens=MAX_NEW_TOKENS, temperature=0)

# 输出结果
print(response)

请确保将 {{PATH_TO_PRETRAINED_MODEL}} 替换为实际的模型路径，并将 ./data/sample_image.jpg 替换为实际图片路径。