vulkan: disable mmvq on Intel Windows driver#20672
Conversation
No worries, it's good to get some performance data from Intel Windows, and to see it working decently well. |
|
Some additional info: disabling MMVQ also helps with stability, avoiding driver errors at Windows (ARC-140V). |
|
Hi @zero-one-soft And which driver version do you have? |
|
Hi @savvadesogle also noticed that if i use the sycl binaries it is now even worse to run the benchmark on gpt-oss-20b. I get like 4 tokens/s |
|
@zero-one-soft ps |
|
@zero-one-soft
You can try another command in the new CMD
|
|
The iGPU will not be used automatically, unless no dGPU is available. It is not necessary to hide it. |
|
Thanks Ruben, I didn't know that. Sorry for the misinformation. @0cc4m |
|
@savvadesogle I have had the situation where my Igpu and my CPU would also be processing something so offloading to the GPU is important, I tend to use the -mg 0 in all of my commands just to make sure I will try your advice in terms of using other builds , or I will just use the smaller models , I was just curious why you managed to get such great performance on your A770 and my B580 struggles with that model :) |
@zero-one-soft But I managed to get a little more (#17628 (comment)). Come to our OpenArс community discord (https://github.com/SearchSavior/OpenArc), we have guys there who have B580 and B50/60 + if you need to, you can directly ask Jianyu (maintainer SYCL backend) about the SYCL backend
OpenVINO, llama.cpp, vLLM etc on Intel GPUs. |
I think this kind of crash is called TDR (Timeout Detection and Recovery). It also happens with my ARC-140 at Windows. Happens less with MMVQ disabled, but I had some even so. Seems like it stops only if I disable COOPMAT. For sure it's an issue with the Intel Driver. Not sure if there are other ways to avoid. |
@gustrd |
|
It seems to me that my crashes happen with the bigger models more than the smaller that link state power management is off on my machine , is that correct ? |
|
@savvadesogle , thank you! I really believe you pinpointed the issue. After changing the power management I spent a whole day of intensive inference without a single TDR. It should be published somewhere. |
* vulkan: disable mmvq on Intel Windows driver * improve comment
* vulkan: disable mmvq on Intel Windows driver * improve comment
* vulkan: disable mmvq on Intel Windows driver * improve comment












Fixes #17628
@savvadesogle This disables MMVQ entirely on Intel Windows, that should remove the need to use the env var. Please try it.