ggml-cuda: flush legacy pool on OOM and retry #22155
+23
−2
Merged
Loading