Support conversion of Qwen3-Embedding models#15023
Conversation
ggerganov
left a comment
There was a problem hiding this comment.
Thanks for looking into this - it would be great to get this model supported.
|
Ooops, was a bit trigger happy, forgot @iamlemec care to add the model there in a new PR? Edit: It has to go in |
|
@iamlemec Did you ever speak to Qwen about the official GGUFs? Are they okay to use in their current form or would they have to be updated? |
|
@Mushoz looks like they uploaded new GGUFs to HF! Just tested the 0.6B version and it works as expected with correct pooling. |
|
@iamlemec which ones are you talking about? I am looking at this one: https://huggingface.co/Qwen/Qwen3-Embedding-0.6B-GGUF/tree/main Same holds true for the 8b version: https://huggingface.co/Qwen/Qwen3-Embedding-8B-GGUF/tree/main |
|
@Mushoz Huh, you're right. Maybe I missed that update before and was still using the old ones? Anyway, the ones you linked to have the pooling type properly specified and seem to work as expected! |
This adds GGUF conversion support for the Qwen3-Embedding class of models and makes pooling work properly by default. I'm not sure how the official Qwen GGUFs were produced, but I'll try to get them updated if this gets merged. More details:
qwen2as usual, the HF tokenizer adds an EOT token that makes the checksum different from the usual text generation modelsmodel.like most other models in this class