Conversation
|
This is a strange case. The access to Because the unused threads are not synchronized at the end of the computation, what happens is that Making It's also worth noting that this is a very common case, when fully offloading a model to the GPU, the CPU will only be used with 1 thread, but the threadpool will still be created with the full number of threads. It would be very desirable if the additional threads are sleeping on a mutex rather than constantly spinning looking for work. cc @fmz @max-krasnyansky, any suggestions? |
|
@slaren @ggerganov
|
|
Superseded by #9461 |
On Mac without OpenMP, I get the following thread sanitizer error:
This patch fixes it, though I'm not sure if there is a better solution