add wait() to make code stable #5895

NeoZhangJianyu · 2024-03-06T03:32:58Z

add wait() to make code stable.
use fp32 on oneMKL gemm_batch for better performance.
add debug function.

Current performance reference:

GPU: 1 Arc 770
OS: ubuntu 22.04
Param: -mg 0 -sm none
Model: llama-2-7b.Q4_0.gguf

Avg: 30.66 tokens per second

airMeng

better leave some performance data for future reference

NeoZhangJianyu · 2024-03-06T03:50:38Z

better leave some performance data for future reference

Yes, update it in first comment.

airMeng · 2024-03-06T03:54:34Z

better leave some performance data for future reference

Yes, update it in first comment.

comparisons before and after?

add wait() to make code stable

418f4f7

NeoZhangJianyu requested a review from airMeng March 6, 2024 03:33

airMeng approved these changes Mar 6, 2024

View reviewed changes

NeoZhangJianyu merged commit 8ced9f7 into ggml-org:master Mar 6, 2024

hazelnutcloud pushed a commit to hazelnutcloud/llama.cpp that referenced this pull request Mar 10, 2024

add wait() to make code stable (ggml-org#5895)

57b7f51

NeoZhangJianyu added a commit to NeoZhangJianyu/llama.cpp that referenced this pull request Mar 12, 2024

add wait() to make code stable (ggml-org#5895)

3da3399

jordankanter pushed a commit to jordankanter/llama.cpp that referenced this pull request Mar 13, 2024

add wait() to make code stable (ggml-org#5895)

77671c8

NeoZhangJianyu mentioned this pull request Mar 20, 2024

[SYCL] Fix batched impl for NVidia GPU #6164

Merged

arthw mentioned this pull request May 12, 2024

[SYCL]rm wait() to improve the performance #7233

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add wait() to make code stable #5895

add wait() to make code stable #5895

Uh oh!

NeoZhangJianyu commented Mar 6, 2024 •

edited

Loading

Uh oh!

airMeng left a comment

Uh oh!

NeoZhangJianyu commented Mar 6, 2024

Uh oh!

airMeng commented Mar 6, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

add wait() to make code stable #5895

add wait() to make code stable #5895

Uh oh!

Conversation

NeoZhangJianyu commented Mar 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

airMeng left a comment

Choose a reason for hiding this comment

Uh oh!

NeoZhangJianyu commented Mar 6, 2024

Uh oh!

airMeng commented Mar 6, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

NeoZhangJianyu commented Mar 6, 2024 •

edited

Loading