Predicting Pronounciation Failure

Neep help editing a voice's lexicon? Want to share your custom lexicon?

Predicting Pronounciation Failure

Postby Yard-ape » Tue Oct 06, 2009 12:26 pm

How can I tell which words swift will have to guess at pronouncing in a given text file? That is, without actually listening to the whole output. If I could do that, then I could preemptively add to my lexicon.txt for any input, without having to listen to all the wrong guesses.

Example: many foreign names end up "spelled out" by Callie, but I can't tell which among a whole book of them without listening to all the output.

Does the swift engine have a built-in lexicon available which I could use as a standard?

Thanks in advance!
Yard-ape
 
Posts: 2
Joined: Tue Oct 06, 2009 11:54 am

Postby AdamW » Tue Oct 13, 2009 10:55 am

A very good question indeed. All voices do have an internal per-language lexicon since enumeration is the only way of covering non-standard pronunciations. Words that are not in the lexicon are pronounced using the letter-to-sound rules, and that's where things can get weird.

I am not sure whether we can release the internal lexicon since its content could well be a "trade secret". It would allow the competition to know your weaknesses.

All I can promise is to check into it and see how we can possibly remedy this if possible without hurting the company. One solution might be to add the ability to return the words from the input text that are pronounced using the letter-to-sound rules. If that happens to work, we could consider adding it to the list of requested enhancements for a future release.
AdamW
 

Postby Yard-ape » Tue Oct 13, 2009 5:32 pm

One solution might be to add the ability to return the words from the input text that are pronounced using the letter-to-sound rules.


Oh, that would be invaluable! So far I've been using the difference between the Brown corpus and my text file to get a preemptive lexicon editing list, but I can imagine:

$ echo "# War and Peace Lexicon:" >>/opt/swift/voices/Callie/lexicon.txt
$ swift --print_unpronounceable -f War_and_Peace.txt >>/opt/swift/voices/Callie/lexicon.txt
$ vi /opt/swift/voices/Callie/lexicon.txt
$ swift -f War_and_Peace.txt -o War_and_Peace.wav

Thanks very much for your response and interest in the issue, and looking forward to that possibility in future releases.

Cheers!
Yard-ape
 
Posts: 2
Joined: Tue Oct 06, 2009 11:54 am


Return to Pronunciation

Who is online

Users browsing this forum: No registered users and 2 guests

cron