Documentation fix: Quantum -> Quantized. by Ujjawal-K-Panchal · Pull Request #8666 · ggml-org/llama.cpp

Ujjawal-K-Panchal · 2024-07-24T09:01:38Z

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

Description of changes

I just fixed a small spelling error at imatrix/readme and server/readme (specifically: quantum models -> quantized models). While it would be great to have support for Quantum Mechanics based language models in future 1, I am sure this refers to quantized models (models compressed by reducing precision of parameters).

Why I bothered making such a small PR?

I refer to these documents (imatrix/readme, server/readme) very often. At first, I just ignored the spellchecks. After a while, it became irritating. So here I am.

ExtReMLapin · 2024-07-24T09:08:58Z

Adding Quantum in your llm model name increases t/s by 20%.

ggerganov · 2024-07-24T09:23:41Z

Isn't "quantum" actually more appropriate than "quantized"? That's why we say "quantum physics" instead of "quantized physics".

ngxson · 2024-07-24T09:29:03Z

Isn't "quantum" actually more appropriate than "quantized"? That's why we say "quantum physics" instead of "quantized physics".

Hmm not sure but I think "quantized" here meaning quantization in signal processing, not in physics (or I didn't get the joke? let me know if I should remove my comment)

ggerganov · 2024-07-24T09:55:19Z

Heh, I wasn't joking - I genuinely think that "quantum model" is correct. Yes, the process of converting continuous values into discrete values is "quantization". In a similar way, in quantum mechanics, bound states are quantized to discrete quantities [0]. I'm drawing the analogy that quantum mechanics is modelling nature, similar to how LLMs are modelling language and hence the same terminology should apply.

[0] - https://en.wikipedia.org/wiki/Quantum_mechanics

ngxson · 2024-07-24T10:13:23Z

Hmm ok that make sense. Thanks for the explanation.

But mixing naming quantum/quantized maybe a bit confused, so I think this PR is not really redundant (just my opinion)

Ujjawal-K-Panchal · 2024-07-24T10:53:56Z

Interesting! While the ML community in general seems to use the word "quantized" (search "quantized" at: 1, 2, 3), I see the analogy @ggerganov is making. Thanks for the explanation!

Maybe one could add a short note somewhere in the two docs for the average user about your analogy for clarity. What do you think about that?

JohannesGaessler · 2024-07-24T11:19:05Z

Isn't "quantum" actually more appropriate than "quantized"? That's why we say "quantum physics" instead of "quantized physics".

No, I disagree. In quantum physics things like discrete energy levels are inherent properties. Models that have been compressed from 16 bit floats to low-precision integers on the other hand have been actively changed from their original state. I personally don't like or use the term "quantum model" because it in my opinion just sounds like some marketing buzzword but I ultimately think which term is used is inconsequential; to me this is on the same level as whether or not the first letter of a code comment should be capitalized and use punctuation at the end of the line.

ggerganov

I agree it's better to be consistent, rather than which term we use specifically. So let's go with the established "quantized models"

* docfix: imatrix readme, quantum models -> quantized models. * docfix: server readme: quantum models -> quantized models.

Ujjawal-K-Panchal added 2 commits July 21, 2024 14:15

docfix: imatrix readme, quantum models -> quantized models.

0ab192f

docfix: server readme: quantum models -> quantized models.

e182819

github-actions Bot added examples server labels Jul 24, 2024

ggerganov approved these changes Jul 25, 2024

View reviewed changes

ggerganov merged commit 4b0eff3 into ggml-org:master Jul 25, 2024

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Jul 27, 2024

docs : Quantum -> Quantized (ggml-org#8666)

5a5d5d2

* docfix: imatrix readme, quantum models -> quantized models. * docfix: server readme: quantum models -> quantized models.

Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026

docs : Quantum -> Quantized (ggml-org#8666)

3132f92

* docfix: imatrix readme, quantum models -> quantized models. * docfix: server readme: quantum models -> quantized models.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Documentation fix: Quantum -> Quantized.#8666

Documentation fix: Quantum -> Quantized.#8666
ggerganov merged 2 commits intoggml-org:masterfrom
Ujjawal-K-Panchal:master

Ujjawal-K-Panchal commented Jul 24, 2024

Uh oh!

ExtReMLapin commented Jul 24, 2024

Uh oh!

ggerganov commented Jul 24, 2024

Uh oh!

ngxson commented Jul 24, 2024

Uh oh!

ggerganov commented Jul 24, 2024

Uh oh!

ngxson commented Jul 24, 2024

Uh oh!

Ujjawal-K-Panchal commented Jul 24, 2024

Uh oh!

JohannesGaessler commented Jul 24, 2024

Uh oh!

ggerganov left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

Ujjawal-K-Panchal commented Jul 24, 2024

Description of changes

Why I bothered making such a small PR?

Uh oh!

ExtReMLapin commented Jul 24, 2024

Uh oh!

ggerganov commented Jul 24, 2024

Uh oh!

ngxson commented Jul 24, 2024

Uh oh!

ggerganov commented Jul 24, 2024

Uh oh!

ngxson commented Jul 24, 2024

Uh oh!

Ujjawal-K-Panchal commented Jul 24, 2024

Uh oh!

JohannesGaessler commented Jul 24, 2024

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants