Skip to content

support/fix more IQ OPs#6521

Merged
NeoZhangJianyu merged 1 commit intoggml-org:masterfrom
NeoZhangJianyu:support_more_IQ_ops
Apr 7, 2024
Merged

support/fix more IQ OPs#6521
NeoZhangJianyu merged 1 commit intoggml-org:masterfrom
NeoZhangJianyu:support_more_IQ_ops

Conversation

@NeoZhangJianyu
Copy link
Copy Markdown
Contributor

  1. support/fix the more OPs:
  • GGML_TYPE_IQ4_NL
  • GGML_TYPE_IQ4_XS
  • GGML_TYPE_IQ3_XXS
  • GGML_TYPE_IQ3_S
  • GGML_TYPE_IQ2_XXS
  • GGML_TYPE_IQ2_XS
  • GGML_TYPE_IQ2_S
  • GGML_TYPE_IQ1_S
  • GGML_TYPE_IQ1_M.
  1. fix the all fault Unit-test cases.
    except:
    • all MUL_MAT_ID cases, due to the interface is changed.
    • MUL_MAT(type_a=iq4_nl,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1]): [MUL_MAT] NMSE = 0.292746658 > 0.000500000 FAIL
      above fault cases will be fixed later.

…, GGML_TYPE_IQ3_S, GGML_TYPE_IQ2_XXS, GGML_TYPE_IQ2_XS, GGML_TYPE_IQ2_S, GGML_TYPE_IQ1_S, GGML_TYPE_IQ1_M
@NeoZhangJianyu NeoZhangJianyu merged commit d4f220a into ggml-org:master Apr 7, 2024
Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026
…, GGML_TYPE_IQ3_S, GGML_TYPE_IQ2_XXS, GGML_TYPE_IQ2_XS, GGML_TYPE_IQ2_S, GGML_TYPE_IQ1_S, GGML_TYPE_IQ1_M (ggml-org#6521)
phuongncn pushed a commit to phuongncn/llama.cpp-gx10-dgx-sparks-deepseekv4 that referenced this pull request Apr 28, 2026
…, GGML_TYPE_IQ3_S, GGML_TYPE_IQ2_XXS, GGML_TYPE_IQ2_XS, GGML_TYPE_IQ2_S, GGML_TYPE_IQ1_S, GGML_TYPE_IQ1_M (ggml-org#6521)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants