【明明安装flash-attn成功，但是import的时候报错】

原创已于 2024-12-09 20:27:23 修改 · 4.7k 阅读

12 ·

CC 4.0 BY-SA版权

文章标签：

#python #自然语言处理

于 2024-12-09 20:23:18 首次发布

该文章已生成可运行项目，

ImportError: This modeling file requires the following packages that were not found in your environment: flash_attn. 明明安装flash-attn成功，但是import的时候报错

问题描述：

运行Llama2需要flash-attn，在 https://github.com/Dao-AILab/flash-attention/releases上找到对应的whl文件安装

使用命令nvidia-smi、pip show torch、pip debug查询到环境是CUDA12.2、torch2.1.2和cp310-cp310-manylinux_2_35_x86_64。

因此，对应whl文件参数cu12、torch2.1、cp310-cp310-linux_x86_64

运行一下安装命令

pip install https://github.com/Dao-AILab/flash-attention/releases/download/v2.7.1.post4/flash_attn-2.7.1.post4+cu12torch2.1cxx11abiTRUE-cp310-cp310-linux_x86_64.whl --no-build-isolation

pip list 显示是有flash-attn的

明明安装flash-attn成功，但是import的时候报错

ImportError: This modeling file requires the following packages that were not found in your environment: flash_attn.
undefined symbol: _ZN3c105ErrorC2ENS_14SourceLocationENSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE

解决方案：

原先下载的文件：flash_attn-2.7.1.post4+cu12torch2.1cxx11abiTRUE-cp310-cp310-linux_x86_64.whl

如果whl文件名上包含参数abiTRUE，则会报错。需要安装包含abiFALSE的whl文件

卸载原有的flash-attn

pip uninstall flash-attn

更换下载的whl文件：flash_attn-2.7.1.post4+cu12torch2.1cxx11abiFALSE-cp310-cp310-linux_x86_64.whl

pip install https://github.com/Dao-AILab/flash-attention/releases/download/v2.7.1.post4/flash_attn-2.7.1.post4+cu12torch2.1cxx11abiFALSE-cp310-cp310-linux_x86_64.whl --no-build-isolation