Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TTS reading rhythm improvement #635

Open
zsogitbe opened this issue Aug 17, 2024 · 0 comments
Open

TTS reading rhythm improvement #635

zsogitbe opened this issue Aug 17, 2024 · 0 comments

Comments

@zsogitbe
Copy link

I am looking for a way to insert silence with different duration into the output of the TTS audio similar to how humans read. Based on the text we pause sometimes longer and not just read every word. The best way would be if we could insert a special token into the text like [SILENCE], this could, for example, mean 100ms silence, if repeated twice [SILENCE][SILENCE] means 200ms silence, etc. This would make the audio naturally sound.

Is this already implemented in some way? Thank you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant