Context
At the moment it is only possible to split after convertion or quantization. Mentionned by @Artefact2 in this [comment](https://github.com/ggerganov/llama.cpp/pull/6135#issuecomment-2003942162):
as an alternative, add the splitting logic directly to tools that produce ggufs, like convert.py and quantize.
Proposition
Include split options in convert*.py, support splits in quantize
Context
At the moment it is only possible to split after convertion or quantization. Mentionned by @Artefact2 in this
[comment](https://github.com/ggerganov/llama.cpp/pull/6135#issuecomment-2003942162):Proposition
Include split options in
convert*.py, support splits inquantize