Fix autoawq gemma convert by dengzheng-cloud · Pull Request #6704 · ggml-org/llama.cpp

dengzheng-cloud · 2024-04-16T13:07:49Z

when using autoawq to quantize gemma-2b-it model, output model-00001-of-00002.safetensors will include lm_head.weight, this pr could prevent convert error happening in

python convert-hf-to-gguf.py ${quantized_path} --outfile model.gguf

fix #6633

using autoawq to quantize gemma model will include a lm_head.weight tensor in model-00001-of-00002.safetensors. it result in this situation that convert-hf-to-gguf.py can't map lm_head.weight. skip loading this tensor could prevent this error.

ggerganov · 2024-04-16T17:12:38Z

+            if "lm_head.weight" in name:
+                    continue


Let's change this to full string match and also print a short message that the tensor is being skipped

already done in a new commit

change code to full string match and print a short message to inform users that lm_head.weight has been skipped.

* fix autoawq quantized gemma model convert error using autoawq to quantize gemma model will include a lm_head.weight tensor in model-00001-of-00002.safetensors. it result in this situation that convert-hf-to-gguf.py can't map lm_head.weight. skip loading this tensor could prevent this error. * change code to full string match and print necessary message change code to full string match and print a short message to inform users that lm_head.weight has been skipped. --------- Co-authored-by: Zheng.Deng <32841220+CUGfred@users.noreply.github.com>

dengzheng-cloud and others added 2 commits April 15, 2024 23:44

Merge branch 'ggerganov:master' into fix-awq-gemma-convert

0060ccd

ggerganov approved these changes Apr 16, 2024

View reviewed changes

change code to full string match and print necessary message

6d84a42

change code to full string match and print a short message to inform users that lm_head.weight has been skipped.

ggerganov merged commit facb8b5 into ggml-org:master Apr 16, 2024

dengzheng-cloud deleted the fix-awq-gemma-convert branch April 17, 2024 02:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix autoawq gemma convert#6704

Fix autoawq gemma convert#6704
ggerganov merged 3 commits intoggml-org:masterfrom
dengzheng-cloud:fix-awq-gemma-convert

dengzheng-cloud commented Apr 16, 2024

Uh oh!

ggerganov Apr 16, 2024

Uh oh!

dengzheng-cloud Apr 16, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dengzheng-cloud commented Apr 16, 2024

Uh oh!

ggerganov Apr 16, 2024

Choose a reason for hiding this comment

Uh oh!

dengzheng-cloud Apr 16, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants