[SYCL] fix multi-gpu issue on sycl by ClarkChin08 · Pull Request #8554 · ggml-org/llama.cpp

ClarkChin08 · 2024-07-18T07:02:07Z

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

fix multi-gpu crash issue by filter the platforms of sycl.

Signed-off-by: Chen Xi <xi2chen@intel.com>

airMeng · 2024-07-18T07:25:50Z

@luoyu-intel

airMeng · 2024-07-18T07:30:30Z

@ClarkChin08 can you attach the measurements results? like llama3-70B on 8 GPUs, memory consumption on each GPU, performance?

ClarkChin08 · 2024-07-18T07:52:07Z

This is the llama2-70b memory consumption data and the performance data:
0. run command: ./build/bin/llama-cli -m ../llama-2-70b-chat.Q4_0.gguf -p "how to build a website?" -n 400 -e -ngl 81 -sm layer

memory consumption

2. performance

Signed-off-by: Chen Xi <xi2chen@intel.com>

Signed-off-by: Chen Xi <xi2.chen@intel.com>

ClarkChin08 · 2024-07-24T04:06:55Z

This is the new performance table with input=6 and output=32

OuadiElfarouki

All good thank you!

Signed-off-by: Chen Xi <xi2.chen@intel.com>

--------- Signed-off-by: Chen Xi <xi2chen@intel.com> Co-authored-by: Meng, Hengyu <hengyu.meng@intel.com>

fix multi-gpu issue on sycl

cd296fe

Signed-off-by: Chen Xi <xi2chen@intel.com>

github-actions Bot added the SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language label Jul 18, 2024

ClarkChin08 changed the title ~~fix multi-gpu issue on sycl~~ [SYCL] fix multi-gpu issue on sycl Jul 18, 2024

Chen Xi added 3 commits July 18, 2024 07:13

fix some typo

6b4f7b2

Signed-off-by: Chen Xi <xi2chen@intel.com>

remove unnecessary whitespace

d096de2

Signed-off-by: Chen Xi <xi2chen@intel.com>

file format issue

bd71cda

Signed-off-by: Chen Xi <xi2chen@intel.com>

ClarkChin08 mentioned this pull request Jul 18, 2024

Bug: [SYCL] Inference not working correctly on multiple GPUs #8294

Closed

airMeng requested review from OuadiElfarouki and airMeng July 18, 2024 07:25

sycl::queue can directly use as shared_ptr

6160a76

Signed-off-by: Chen Xi <xi2chen@intel.com>

OuadiElfarouki reviewed Jul 18, 2024

View reviewed changes

Comment thread ggml/src/ggml-sycl/dpct/helper.hpp Outdated

Comment thread ggml/src/ggml-sycl/dpct/helper.hpp Outdated

OuadiElfarouki reviewed Jul 18, 2024

View reviewed changes

Comment thread ggml/src/ggml-sycl/dpct/helper.hpp

ClarkChin08 added 2 commits July 24, 2024 03:58

fix the perf issue of multi-device

e4b86a1

Signed-off-by: Chen Xi <xi2.chen@intel.com>

fix intel mkl

22c72c5

Signed-off-by: Chen Xi <xi2.chen@intel.com>

luoyu-intel approved these changes Jul 24, 2024

View reviewed changes

fix format

f3db6d7

airMeng requested a review from OuadiElfarouki July 24, 2024 06:56

OuadiElfarouki approved these changes Jul 24, 2024

View reviewed changes

add doc for sycl multi-card

fcce873

Signed-off-by: Chen Xi <xi2.chen@intel.com>

github-actions Bot added the documentation Improvements or additions to documentation label Jul 25, 2024

ClarkChin08 added 2 commits July 25, 2024 08:42

add linux part change on doc

8fe8086

Signed-off-by: Chen Xi <xi2.chen@intel.com>

fix typo

fc76684

Signed-off-by: Chen Xi <xi2.chen@intel.com>

airMeng approved these changes Jul 25, 2024

View reviewed changes

airMeng merged commit ed67bcb into ggml-org:master Jul 25, 2024

Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026

[SYCL] fix multi-gpu issue on sycl (ggml-org#8554)

03bb312

--------- Signed-off-by: Chen Xi <xi2chen@intel.com> Co-authored-by: Meng, Hengyu <hengyu.meng@intel.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCL] fix multi-gpu issue on sycl#8554

[SYCL] fix multi-gpu issue on sycl#8554
airMeng merged 11 commits intoggml-org:masterfrom
ClarkChin08:multi_device

ClarkChin08 commented Jul 18, 2024 •

edited by airMeng

Loading

Uh oh!

airMeng commented Jul 18, 2024

Uh oh!

airMeng commented Jul 18, 2024

Uh oh!

ClarkChin08 commented Jul 18, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ClarkChin08 commented Jul 24, 2024

Uh oh!

OuadiElfarouki left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

ClarkChin08 commented Jul 18, 2024 • edited by airMeng Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

airMeng commented Jul 18, 2024

Uh oh!

airMeng commented Jul 18, 2024

Uh oh!

ClarkChin08 commented Jul 18, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ClarkChin08 commented Jul 24, 2024

Uh oh!

OuadiElfarouki left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

ClarkChin08 commented Jul 18, 2024 •

edited by airMeng

Loading