Page 1 of 1

How do I made the voice sound excited?

PostPosted: Wed May 22, 2013 10:03 am
by siyanz
Hey,
I have been trying to twig the intonation of Allison's voice so that she sounds excited. For example, when she says "good job!", her voice should not only have a higher pitch but also have stresses to make her sound really happy. I tried pitch contour but it changes the voice to a man's voice. Does anyone have any good suggestions? Thanks!

Re: How do I made the voice sound excited?

PostPosted: Thu May 23, 2013 1:06 pm
by NicholeH
That's a great question. Probably the best way to modify your voice is to use SSML tags. Three SSML tags in particular might help you to achieve a stronger, more excited tone in your voice. These are pitch, volume, and emphasis. It sounds like you may have tried pitch, but it's possible that you set the pitch to low rather than high, which might have created a more masculine sounding pitch. Here is an example of these tags in use:

swift -n Allison "Thank you for completing the task. <emphasis level='moderate'><prosody pitch='+40%' volume='loud'>Good</prosody> <prosody pitch ='+25%' volume='x-loud'> job</prosody></emphasis>"

You can read more about using SSML tags on our SSML tutorial, located here: http://www.cepstral.com/en/tutorials/view/ssml

Re: How do I made the voice sound excited?

PostPosted: Thu May 23, 2013 4:09 pm
by siyanz
NicholeH wrote:That's a great question. Probably the best way to modify your voice is to use SSML tags. Three SSML tags in particular might help you to achieve a stronger, more excited tone in your voice. These are pitch, volume, and emphasis. It sounds like you may have tried pitch, but it's possible that you set the pitch to low rather than high, which might have created a more masculine sounding pitch. Here is an example of these tags in use:

swift -n Allison "Thank you for completing the task. <emphasis level='moderate'><prosody pitch='+40%' volume='loud'>Good</prosody> <prosody pitch ='+25%' volume='x-loud'> job</prosody></emphasis>"

You can read more about using SSML tags on our SSML tutorial, located here: http://www.cepstral.com/en/tutorials/view/ssml


Hi Nicole,
That is a good solution.Thank you! But is there a shorter way to do this? Besides, for different sentence, I have to tune it differently and this might be time-consuming as I have a whole script to do. I might just be super picky but please bear with me. :)

Re: How do I made the voice sound excited?

PostPosted: Fri May 24, 2013 11:11 am
by ChrisM
Hi,

Unfortunately there is no quicker way to do this. Unlike humans, text to speech engines (not just ours!) have trouble determining how something should be said based on context. This is why you have to tell it how to say things with SSML. I know that at the moment this is a rather time consuming process, and we are investigating ways to make it faster and easier for people to get our voices to sound exactly like they want. If we were to write a program that helped people sculpt our TTS engine to say things how they would like, would you have any features in particular you would like to see? We would be happy to note them and attempt to implement them in the future if we do decide to build such a tool.

Sorry for the disappointing but be sure that we take any customer feedback under heavy advisement when developing new features and programs.

Re: How do I made the voice sound excited?

PostPosted: Sat May 28, 2016 2:52 pm
by makafre
Using a competitor's software named l.....o :) it's easy to change the intonation using ! and ?. I would have expected Cepstral to do the same..will you guys look into it in the future?

Re: How do I made the voice sound excited?

PostPosted: Mon Jun 06, 2016 10:29 am
by Aerdegath
I've been looking at something similar; trying to make the voices sound a bit more natural in their intonation.

As far as I can tell the contour SSML tag is going to be the best way to do this but as OP said this changes the voice drastically. Does this mean the contour tag is simply unsupported or is there something I'm doing wrong?

BTW just using the example from the SSML spec: <prosody contour="(0%,+20Hz) (10%,+30%) (40%,+10Hz)">good morning</prosody>