by xanatos » Mon Jun 01, 2020 7:52 am
The delays have become a minimal issue over time. Running "Belle" on my Raspberry Pi 4 (4Gig) with Buster, the delays are trivial at this point. The sentence "Knowledge is knowing a tomato is a fruit, wisdom is knowing to not put it in a fruit salad" takes less than a second from being passed to the speech engine to the time speech begins.
The main issue that remains for me is the fact that the voices need to write the wav file to the disk, then it must be played back. Because the system uses an SD card, this creates many writes to the card over time. I would like to find an app that could take the wav that would normally be written and instead buffer it to ram and speak it without creating a physical wav file on the card. I would guess this to be a relatively simple task by redirecting the output in they way R Pi users can direct output to stdout, or /dev/null, etc., but I haven't researched this particular function in any detail yet, concentrating instead on cognitive architecture and language processing for now... but it is an issue I will need to revisit in the not-too-distant future.
Glad to see people are still using this thread and these voices. These voices still stand out as among the best available anywhere for embedded, offline, standalone applications. I'm hoping that the developers do continue to add functionality over time, especially some vastly more robust ability to script voice tone, tempo, emphasis, etc. I use the in-line markups but they are only minimally helpful.
Dave