FlashAttention2 安装；报错 RuntimeError: FlashAttention only supports Ampere GPUs or newer.

作者：小蓝xlanll | 2024-03-20 03:14:28

踩

runtimeerror: flashattention only supports ampere gpus or newer.

cuda12.0环境；pytorch 2.1.2+cu118；transformers 4.38.0

pip install flash-attn --no-build-isolation --use-pep517 
1

在这里插入图片描述

参考：
https://github.com/Dao-AILab/flash-attention
FlashAttention2暂时不支持 T卡，后续支持，如果要使用先用1.X版本

声明：本文内容由网友自发贡献，不代表【wpsshop博客】立场，版权归原作者所有，本站不承担相应法律责任。如您发现有侵权的内容，请联系我们。转载请注明出处：https://www.wpsshop.cn/w/小蓝xlanll/article/detail/270264