Add support for multilingual-e5-small: Fixes#123#190
Add support for multilingual-e5-small: Fixes#123#190Ya-shh wants to merge 8 commits intoqdrant:mainfrom Ya-shh:e5_small
Conversation
|
Check out this pull request on See visual diffs & provide feedback on Jupyter Notebooks. Powered by ReviewNB |
|
@Anush008 could you please review this ? |
|
@Anush008 The file fastembed/tests/test_text_onnx_embeddings.py is indicating a need for different canonical vector values. I think we need to adjust that test, as we encountered a similar issue previously with the e5-large-instruct model (#181). When I updated those values, all tests passed. Here's the logged error from that instance: As this model is already converted to onnx on HF:https://huggingface.co/intfloat/multilingual-e5-small/tree/main/onnx so the canonical vector values of this model can't be incorrect |
|
The CI logs a different issue though. Due to fastembed/fastembed/text/e5_onnx_embedding.py Lines 45 to 50 in f340a73 |
|
Yes, I agree with you. Interestingly, after updating the
|
|
I think the issue might be due to the |
canonical vector values: https://colab.research.google.com/drive/1GEGTzXB7ccakTOTQcRYL3JmInf-E3Q54#scrollTo=spJiCcNWs6-O.
onnx model: https://huggingface.co/intfloat/multilingual-e5-small/tree/main/onnx