Skip to content

Q2_K_HIFI updates#38

Merged
geoffmunn merged 9 commits intomasterfrom
Q2_K_HIFI_v2
Mar 2, 2026
Merged

Q2_K_HIFI updates#38
geoffmunn merged 9 commits intomasterfrom
Q2_K_HIFI_v2

Conversation

@geoffmunn
Copy link
Owner

Q2_K_HIFI working but it sucks

…on and matrix multiplication kernels. Define necessary constants and update type handling for Q2_K_HIFI in ggml-metal files.
…instead of INT8 residual corrections. Update related structures and functions for improved outlier handling and precision recovery during quantization and dequantization processes.
…er-first and residual modes. Update related functions and structures for improved handling of outlier corrections and precision recovery during quantization and dequantization processes.
… Update comments to clarify model size thresholds and the rationale for excluding FFN projections from high-fidelity upgrades, ensuring better performance without compromising model quality.
@geoffmunn geoffmunn merged commit 1a6936d into master Mar 2, 2026
18 of 75 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant