TTS reading rhythm improvement #635

zsogitbe · 2024-08-17T07:45:33Z

I am looking for a way to insert silence with different duration into the output of the TTS audio similar to how humans read. Based on the text we pause sometimes longer and not just read every word. The best way would be if we could insert a special token into the text like [SILENCE], this could, for example, mean 100ms silence, if repeated twice [SILENCE][SILENCE] means 200ms silence, etc. This would make the audio naturally sound.

Is this already implemented in some way? Thank you.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TTS reading rhythm improvement #635

TTS reading rhythm improvement #635

zsogitbe commented Aug 17, 2024

TTS reading rhythm improvement #635

TTS reading rhythm improvement #635

Comments

zsogitbe commented Aug 17, 2024