See explanation here: https://github.com/ggerganov/llama.cpp/pull/439
See explanation here: #439