ysay-hotkey

ysay (2023) - Output (read aloud) selected text with speech synthesis (piper) (in German) as an audio stream using a key combination.

Inspired by:

the use of xsel from the script xsay by Alex (https://www.youtube.com/watch?v=UjBtKRd7c34)
the procedure for voice output with piper from Thorsten-Voice (https://www.thorsten-voice.de/kostenloses-deutsches-text-to-speech-tts)

Requirements:

piper: https://github.com/rhasspy/piper/releases
espeak,xsel: sudo apt-get install espeak-ng xsel
speech files: https://huggingface.co/rhasspy/piper-voices/tree/main/de/de_DE (.onnx und onnx.json of the corresponding voice)

Installation:

copy the directory of the unpacked piper package to ~bin/ and place the language files in the piper directory (~/bin/piper/)
copy ysay and ysaykill to ~bin/
now assign the key combinations in the system settings, i have decided on Win+S
- Super/Meta+S output marked text (~/bin/ysay)
- Super/Meta+Shift+S cancel current audio output (~/bin/ysaykill)

The scripts, language files and piper can of course also be copied to any other path, e.g. under /usr/local/, in which case the path details in the scripts must be adapted.

Tested with Kubuntu 22.04 LTS, 23.10 and 24.04 LTS (should work with any distro) with four German (many more available) language models. Should work on all other display managers (though ot tested) other than KDE.

History:
Over the last few years, i've tried to set up a reasonable voice output under Linux from time to time and have tried practically everything with combinations of espeak/mbrola/speechd/flite/ivona/festival/festvox/jovie/gespeaker/ktts/orca/etc...
None of these had really worked (well) in terms of usability and output quality, especially not in German.
Especially in comparison to Android, where there has been very good speech output for years even without Google/Cloud, I find the situation on the Linux desktop pathetic, even under KDE the languages sound bad via speechd.
I used xsay for a short time, but rarely because of the poor sound quality of the Pico voice.

Why another script then?
This one works ;-), has a better audio quality and omits the need for temporary wav- or mp3-files
During my latest spontaneous attempt, I came across piper (rather by chance) and was amazed at how quick and easy it was.
The voice quality with all the voices (that I tested) is much better than with Pico and is good enough for me to have longer texts read aloud.

Errors/improvements
I tested it on two computers, it was a successful quick fix for me and I have no plans to turn it into a larger development project.
If you have any errors or suggestions for improvement, please feel free to contact me.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
README.md		README.md
ysay		ysay
ysaykill		ysaykill

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ysay-hotkey

About

Releases

Packages

Languages

License

thefoster/ysay-hotkey

Folders and files

Latest commit

History

Repository files navigation

ysay-hotkey

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages