Speech Generator
Dave Hein
Posts: 6,347
I would like to convert text to speech on the prop, and I'm trying to find out what's already been done. I do know about the VocalTract object in the OBEX. I have also tried Phil's text-to-speech demo that he did a few years ago. This stuff is amazing, but I find it hard to understand unless I read the text as I hear the audio. Has anymore work been done on this to improve the quality?
Comments
This will not be very usefull for things like reading the daily news articles or an audio Bible or book or anything like that, though. For those type things, you will need to go a different route. Possibly, you can get a usefull phoneme set using the above method and string phonemes along like above. That would also require a text-to-speech routine, and you will always need exceptions for things like personal names, foreign words, etc.
I downloaded this archive but cannot remember the place... don't know if you already have seen it.
I just had time to quickly peek at the sources, and maybe it would be useable with Phil's phoneme-to-vocaltract if the phoneme representation, which is different, could be matched between the two.
Licence is Public Domain, so I think it's ok to re-upload it.
Alessandro
I actually tried this a couple of years ago....the resulting 'voice' was exceptionally
creepy sounding. Everyone kept playing with it because it was so hilarious :-)
It was like a cross between a jibbering chinaman and someone possessed by satan. LoL
AY THIHNGk AY wIHl CHEYnj IHt tUW yUWz DHAX fOWnIYm rEHprIYzEHntEYSHAXn DHAEt IHz yUWzd bAY dIHkSHAXnEHrIYs WHEHn SHOWIHNG EY fOWnEHtIHk spEHlIHNG
and I'll probably change it to generate
ahy thingk ahy wil cheynj it too yooz thuh foh-neem rep-ri-zen-tay-shuhn that iz yoozd bahy dik-shuh-ner-eez hwen shoh-ing uh fo-ne-tik spel-ing
It's a bit more readable to me, and easier to remember.
@Holly, your comment makes more interested in trying the recorded-phoneme method. It sounds like it could be quite entertaining. There is a tendency for synthesized stuff to come across as creepy if it is close to being natural, but not quite there. This is evident in the CGI material that is used in movies, where it almost looks real, and ends up looking odd instead.
audio editing software. Then I made a simple app for windows that could
play sequences of them to make words...it really was funny :-)
I recommend working with a PC app to get the phonemes right...it would be
rough doing it on a uC.
As the C64 had only 64kB I guess the RAM needed was quite low. Does anybody rember how many kB?
here's a video of the demo file http://www.youtube.com/watch?v=Rm4ZCGgzeeU
Just found this files for downloading http://the-cbm-files.tripod.com/speak/
sam64.zip is 28kB big when etxracted.
So there might be a chance to re-use the SAM phonemic data with SID-Cog. But I have no idea how much work this will be
best regards
Stefan
http://www.omniboard.be/projects/kit7/kit7.htm
Quality is about the same as from the speech synthesis chips from 25 years ago.
Something like it could perhaps be implemented on the Propeller.
-Phil
ftp://ftp.propeller-chip.com/PropC3/Games/Jay_T_Cook/jtc_prop-talker_c3_v001/