05676942b7
Tomesd now uses q instead of x to decide which tokens to merge because it seems to give better results. |
||
---|---|---|
.. | ||
diffusionmodules | ||
distributions | ||
encoders | ||
attention.py | ||
ema.py | ||
sub_quadratic_attention.py |