mtmd: qwen3-asr wrong output#22343
Conversation
|
closing this as you did not disclosed AI usage and this is not the intended way to fix |
|
The code comments are clearly written by AI |
|
Your fix cannot be accepted as-is because it introduces a hack into the existing preprocessor, it is not clean and potentially break other models. The cgraph implementation is redundant because (1) you can use the 4th dim for batching and (2) you replaced the old build_vit with essentially the over-complicated version equivalent of it. But overall, I do not wish to proceed with contributions where AI is used but not properly disclosures. |
|
qwen3a can be implemented as mtmd_audio_preprocessor_qwen3a, but it was merged in the whisper preprocessor. mtmd: qwen3 audio support (qwen3-omni and qwen3-asr) (#19441) tools/mtmd/mtmd.cpp : |
Overview
qwen3-asr output is wrong with llama.cpp