Skip to content

ggml : fix handling of zero blocks in IQ quants#7955

Merged
ggerganov merged 1 commit intomasterfrom
gg/ggml-fix-zero-blocks
Jun 16, 2024
Merged

ggml : fix handling of zero blocks in IQ quants#7955
ggerganov merged 1 commit intomasterfrom
gg/ggml-fix-zero-blocks

Conversation

@ggerganov
Copy link
Copy Markdown
Member

Allow for blocks full of 0s

@github-actions github-actions Bot added the ggml changes relating to the ggml tensor library for machine learning label Jun 16, 2024
@mofosyne mofosyne added the Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix label Jun 16, 2024
Comment thread ggml-quants.c
int num_neighbors = neighbours[0];
GGML_ASSERT(num_neighbors > 0);
float best_score = 0;
float best_score = -FLT_MAX;
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This function isn't used any more, but why negative FLT_MAX?

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Any negative value works, but used -FLT_MAX for consistency with usage of FLT_MAX elsewhere in the source

@ggerganov ggerganov merged commit cddaf02 into master Jun 16, 2024
Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026
phuongncn pushed a commit to phuongncn/llama.cpp-gx10-dgx-sparks-deepseekv4 that referenced this pull request Apr 28, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ggml changes relating to the ggml tensor library for machine learning Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants