Support latest (breaking) changes to GGML file format#61
Support latest (breaking) changes to GGML file format#61setzer22 merged 5 commits intorustformers:mainfrom
Conversation
|
Hi! Thanks a lot for the PR 😄
That's really strange. IIRC the only changes to the format are that now vocab scores are embedded in the model. Weights should be exactly the same. Can you make a test with |
setzer22
left a comment
There was a problem hiding this comment.
Looks good! Thanks again for the changes. Can you sort out the merge conflict and the formatting / clippy issues from CI?
Once that's taken care of, we're ready to merge 😄
|
Yup, will doublecheck output vs. main branch on an older model to ensure all is okay. |
Breaking changes in ggml: - Scores added in vocabulary - Format version added - Magic updated
- Generalize u32, i32 and f32 reading without a boilerplate-y trait
|
Yeah looks good to me. @setzer22 can you approve the workflow? |
|
Merged! Took care of some minor clippy lints myself. Thanks again :) |
Latest ggml models have:
This PR maintains support for older ('legacy') models
A model downloaded and converted on latest master in
llama.cppnow works withllama-rsbut the output seems qualitatively worse. Haven't played around much withllama-rsthough so it's hard to say whether this is a regression.