Skip to content

Q5: Slightly faster AVX2 implementation#1197

Merged
ggerganov merged 1 commit intoggml-org:masterfrom
sw:q5-avx
Apr 26, 2023
Merged

Q5: Slightly faster AVX2 implementation#1197
ggerganov merged 1 commit intoggml-org:masterfrom
sw:q5-avx

Conversation

@sw
Copy link
Copy Markdown
Contributor

@sw sw commented Apr 26, 2023

Addendum to #1187: use _mm256_shuffle_epi8 and AND/OR operations instead of a 256-entry lookup table.

@ggerganov ggerganov merged commit 0b2da20 into ggml-org:master Apr 26, 2023
@sw sw deleted the q5-avx branch April 27, 2023 16:21
Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026
phuongncn pushed a commit to phuongncn/llama.cpp-gx10-dgx-sparks-deepseekv4 that referenced this pull request Apr 28, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants