Skip to content

cuda: Bring DMMV_F16 and KQUANTS_ITER Makefile flags over from llama.

f28decf
Select commit
Loading
Failed to load commit list.
Merged

Fix hordeconfig max context setting, and add Makefile flags for cuda F16/KQuants per iter. #252

cuda: Bring DMMV_F16 and KQUANTS_ITER Makefile flags over from llama.
f28decf
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs