音乐生成开源项目YuE使用教程

最新推荐文章于 2025-04-19 14:24:51 发布

郜垒富Maddox

最新推荐文章于 2025-04-19 14:24:51 发布

阅读量538

点赞数 18

本文链接：https://blog.youkuaiyun.com/gitblog_00648/article/details/146584091

版权

音乐生成开源项目YuE使用教程

YuE YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open 项目地址: https://gitcode.com/gh_mirrors/yue/YuE

1. 项目介绍

YuE（乐）是一个开源的音乐生成基础模型，旨在将歌词转化为完整的歌曲。这个项目支持多种语言和音乐风格，能够生成包含人声和伴奏的几分钟长的完整歌曲。YuE模型的特点是多样性、灵活性和高质量的音乐生成能力，适用于各种音乐创作场景。

2. 项目快速启动

以下是快速启动YuE项目的步骤：

首先，确保你的环境中安装了以下依赖：

Python 3.8 或更高版本
CUDA 11.8 或更高版本
PyTorch、torchvision 和 torchaudio
FlashAttention 2

安装环境：

conda create -n yue python=3.8
conda activate yue
conda install pytorch torchvision torchaudio cudatoolkit=11.8 -c pytorch -c nvidia
pip install -r <(curl -sSL https://raw.githubusercontent.com/multimodal-art-projection/YuE/main/requirements.txt)
pip install flash-attn --no-build-isolation

下载代码和模型：

git lfs install
git clone https://github.com/multimodal-art-projection/YuE.git
cd YuE/inference/
git clone https://huggingface.co/m-a-p/xcodec_mini_infer

运行推理：

cd YuE/inference/
python infer.py \
--cuda_idx 0 \
--stage1_model m-a-p/YuE-s1-7B-anneal-en-cot \
--stage2_model m-a-p/YuE-s2-1B-general \
--genre_txt ../prompt_egs/genre.txt \
--lyrics_txt ..

根据需要，你可以调整--stage2_batch_size来加速推理，但请注意内存溢出的问题。