Skip to content

[Kernels]Updated Triton kernels into 2.1.0 and adding flash-decoding for llama token attention #4965

Merged
CjhHa1 merged 17 commits intohpcaitech:mainfrom
tiandiao123:lcq_decoding_flash
Oct 30, 2023
Merged

[Kernels]Updated Triton kernels into 2.1.0 and adding flash-decoding for llama token attention #4965
CjhHa1 merged 17 commits intohpcaitech:mainfrom
tiandiao123:lcq_decoding_flash

Commits

Commits on Oct 24, 2023

Commits on Oct 25, 2023

Commits on Oct 27, 2023

Commits on Oct 29, 2023

Commits on Oct 30, 2023