Skip to content

Conversation

@NeoZhangJianyu
Copy link
Collaborator

  1. support/fix the more OPs:
  • GGML_TYPE_IQ4_NL
  • GGML_TYPE_IQ4_XS
  • GGML_TYPE_IQ3_XXS
  • GGML_TYPE_IQ3_S
  • GGML_TYPE_IQ2_XXS
  • GGML_TYPE_IQ2_XS
  • GGML_TYPE_IQ2_S
  • GGML_TYPE_IQ1_S
  • GGML_TYPE_IQ1_M.
  1. fix the all fault Unit-test cases.
    except:
    • all MUL_MAT_ID cases, due to the interface is changed.
    • MUL_MAT(type_a=iq4_nl,type_b=f32,m=16,n=1,k=256,bs=[1,1],nr=[1,1]): [MUL_MAT] NMSE = 0.292746658 > 0.000500000 FAIL
      above fault cases will be fixed later.

…, GGML_TYPE_IQ3_S, GGML_TYPE_IQ2_XXS, GGML_TYPE_IQ2_XS, GGML_TYPE_IQ2_S, GGML_TYPE_IQ1_S, GGML_TYPE_IQ1_M
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants