Conversation
|
nice one , i am guessing that after this change |
|
It won't be needed after this change. With nocompute, it disabled the compute queue, so the backend used the graphics queue instead. That is what made it faster (but don't ask me why). This PR changes it to use the graphics queue even if a compute-only queue is available. |
|
I just installed this version and I am using llama-server with immersive translate addon to translate the web article for me. I set the addon to send an API request to the llama-server every 10 seconds just to make some translation. With this feature now my KDE desktop lag when the LLM is processing. I also tried playing youtube video on Firefox and it drop 30% of the frame while playing 1080p video. Does this happen on your side? Sorry I forgot to provide more info. My laptop is running AMD Ryzen AI 360 with 880M iGPU. |
|
I downloaded latest release and my tg/s dropped about 40% compared to release b8333 ! I use 2 Radeon cards 9070XT and 6700 XT, i would suggest removing changes from this update until further tested |
Linux or Windows? |
|
Windows |
* vulkan: use graphics queue on AMD for slightly better performance * disable async transfer queue on AMD
* vulkan: use graphics queue on AMD for slightly better performance * disable async transfer queue on AMD
* vulkan: use graphics queue on AMD for slightly better performance * disable async transfer queue on AMD
I'm not sure why, but the graphics queue is slightly faster in tg on AMD than the compute queue, and this also fixes the partial offload issue I fixed in #19976, so the second queue no longer has to be enabled by default. I got the idea from @zedbytes reporting that tg goes up when running with
RADV_DEBUG=nocompute.AMD RX 9070 XT
AMD Radeon Pro VII