examples : fix benchmark-matmult #1554

ggerganov · 2023-05-21T14:02:25Z

The precision for Q4_0 has degraded since #1508
Use Q4_1 instead

The precision for Q4_0 has degraded since #1508

stsydow

I tested for Q5.1 and it seem to work as well.

cebtenzzre · 2023-09-20T03:57:02Z

Could we merge this? I was going to fix the use of abs instead of std::abs in the delta calculation and add a few missing static qualifiers, but there's no point if the example doesn't even run successfully.

The precision for Q4_0 has degraded since ggml-org#1508

examples : fix benchmark-matmult

b16c085

The precision for Q4_0 has degraded since #1508

stsydow approved these changes May 21, 2023

View reviewed changes

ggerganov merged commit d119c04 into master Sep 20, 2023

ggerganov deleted the fix-benchmark-matmult branch September 20, 2023 07:02

pkrmf pushed a commit to morlockstudios-com/llama.cpp that referenced this pull request Sep 26, 2023

examples : fix benchmark-matmult (ggml-org#1554)

487e22a

The precision for Q4_0 has degraded since ggml-org#1508

Bearsaerker mentioned this pull request Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

examples : fix benchmark-matmult #1554

examples : fix benchmark-matmult #1554

Uh oh!

ggerganov commented May 21, 2023

Uh oh!

stsydow left a comment

Uh oh!

cebtenzzre commented Sep 20, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

examples : fix benchmark-matmult #1554

examples : fix benchmark-matmult #1554

Uh oh!

Conversation

ggerganov commented May 21, 2023

Uh oh!

stsydow left a comment

Choose a reason for hiding this comment

Uh oh!

cebtenzzre commented Sep 20, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants