Skip to content

quantize : fix precedence of cli args#6541

Merged
ggerganov merged 1 commit intomasterfrom
gg/quantize-args
Apr 8, 2024
Merged

quantize : fix precedence of cli args#6541
ggerganov merged 1 commit intomasterfrom
gg/quantize-args

Conversation

@ggerganov
Copy link
Copy Markdown
Member

Increase precedence of --token-embedding-type and --output-tensor-type.
Allows for example to use --token-embedding-type without specifying --pure.

@ggerganov ggerganov merged commit b73e564 into master Apr 8, 2024
@ggerganov ggerganov deleted the gg/quantize-args branch April 8, 2024 13:23
Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026
phuongncn pushed a commit to phuongncn/llama.cpp-gx10-dgx-sparks-deepseekv4 that referenced this pull request Apr 28, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants