When TTS is enabled, the current streaming behavior will display the text as it comes in, 8 tokens at a time, but will only be passed to TTS when the entire render is finished. This request is for a switch to enable sending the packets to TTS as they become available, as well. I realize for certain very small models this could cause some kind of overflow but the feature is meant to be used with discretion and not meant to be robust.
When TTS is enabled, the current streaming behavior will display the text as it comes in, 8 tokens at a time, but will only be passed to TTS when the entire render is finished. This request is for a switch to enable sending the packets to TTS as they become available, as well. I realize for certain very small models this could cause some kind of overflow but the feature is meant to be used with discretion and not meant to be robust.