How to enable experimental memory efficient attention on ROCm RDNA3.

This commit is contained in:
comfyanonymous 2024-11-29 06:18:56 -05:00
parent 82c5308561
commit 20a560eb97
1 changed files with 6 additions and 0 deletions

View File

@ -213,6 +213,12 @@ For 6700, 6600 and maybe other RDNA2 or older: ```HSA_OVERRIDE_GFX_VERSION=10.3.
For AMD 7600 and maybe other RDNA3 cards: ```HSA_OVERRIDE_GFX_VERSION=11.0.0 python main.py```
### AMD ROCm Tips
You can enable experimental memory efficient attention on pytorch 2.5 in ComfyUI on RDNA3 and potentially other AMD GPUs using this command:
```TORCH_ROCM_AOTRITON_ENABLE_EXPERIMENTAL=1 python main.py --use-pytorch-cross-attention```
# Notes
Only parts of the graph that have an output with all the correct inputs will be executed.