[Inference] Refactor modeling attention layer by abstracting attention backends#5771
Merged
char-1ee merged 6 commits intohpcaitech:mainfrom Jun 10, 2024
Merged
[Inference] Refactor modeling attention layer by abstracting attention backends#5771char-1ee merged 6 commits intohpcaitech:mainfrom
char-1ee merged 6 commits intohpcaitech:mainfrom
Commits
Commits on Jun 7, 2024
Commits on Jun 10, 2024
- committed