Skip to content

[Inference] Refactor modeling attention layer by abstracting attention backends#5771

Merged
char-1ee merged 6 commits intohpcaitech:mainfrom
char-1ee:refactor/modeling
Jun 10, 2024
Merged

[Inference] Refactor modeling attention layer by abstracting attention backends#5771
char-1ee merged 6 commits intohpcaitech:mainfrom
char-1ee:refactor/modeling

Commits

Commits on Jun 7, 2024

Commits on Jun 10, 2024