Would love to see a faster, more memory efficient attention implemented like Flash Attention. :)
Would love to see a faster, more memory efficient attention implemented like Flash Attention. :)