Skip to content

(merge) bounds checking for input prefix#492

Merged
anzz1 merged 1 commit intomasterfrom
patch-prefix-arg-bounds
Mar 25, 2023
Merged

(merge) bounds checking for input prefix#492
anzz1 merged 1 commit intomasterfrom
patch-prefix-arg-bounds

Conversation

@anzz1
Copy link
Copy Markdown
Contributor

@anzz1 anzz1 commented Mar 25, 2023

@anzz1 anzz1 merged commit e899bf5 into master Mar 25, 2023
@anzz1 anzz1 deleted the patch-prefix-arg-bounds branch March 25, 2023 12:42
Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026
phuongncn pushed a commit to phuongncn/llama.cpp-gx10-dgx-sparks-deepseekv4 that referenced this pull request Apr 28, 2026
phuongncn pushed a commit to phuongncn/llama.cpp-gx10-dgx-sparks-deepseekv4 that referenced this pull request Apr 28, 2026
* iq1_s_r4: CUDA dequantize

* iq1_s_r4: CUDA GEMV

* iq1_s_r4: MMQ on CUDA

Requires Turing or better (will fall back to dequantize+cuBLAS on older cards).

---------

Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants