Speech Synthesis on the Propeller
cgracey
Posts: 14,256
Here is the data I used for the speech synthesis presentation·last night·at the UPEW.
This represents a lot of hard-won knowledge that's taken me years to accumulate. It's been·one of the most fascinating endeavors·of my life.·I hope a few of you out there will get some enjoyment from this. If you are interested enough to spend a few hours delving into the .zip below, reading the documentation, running the examples, and doing some experimentation of your own, you may get·the bug, yourself.
I wish you fun and learning.
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔
Chip Gracey
Parallax, Inc.
This represents a lot of hard-won knowledge that's taken me years to accumulate. It's been·one of the most fascinating endeavors·of my life.·I hope a few of you out there will get some enjoyment from this. If you are interested enough to spend a few hours delving into the .zip below, reading the documentation, running the examples, and doing some experimentation of your own, you may get·the bug, yourself.
I wish you fun and learning.
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔
Chip Gracey
Parallax, Inc.
Comments
It definitely would have been helpful last year when I was trying to do speech on an old video game system (in the end, I basically just made recordings, then converted those to a spectrograph and output the 4 strongest waves, frequently updating the output). I'll have to combine this with random parameters and get a modern day Q*Bert speech synthesis [noparse]:)[/noparse]. Ha... I love this quote from your doc: "Once mastered, though, you will have gained untold insight into the mechanics of speech that you’ll be eager to share with others who will be less interested"... so true, for just about everything I do [noparse];)[/noparse].
DogP
I've just been playing with it and I was wondering if you had built up a speech object and tables so that you could pass byte/word parameters in a buffered output stream to it plus extend that table with macros. Obviously the examples are hard coded and I did just look at modifying some parts of it to hear the result, it's quite interesting. Probably what I'd like to do is build up some word/syllable tables so that I can tie this into a voice prompt application and pass actual word/character/syllable/phoneme parameters direct from the application.
If I don't get too bogged down with actual work I might have some fun and proceed along these lines and see where I can build on this excellent vocal-tract micro-engine.
*Peter*
Good Stuff.
I have wanted an instruction set for the synthesis object ever since I first played with it. This is great. It combines super technical mumbo jumbo with simple easy to follow explanations of how to make that object really "sing".
I have been wanting to do more than make the Singing Sevens say dirty words for·some time, now I can.
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔
BH
Post Edited (BADHABIT) : 6/29/2009 3:44:18 AM GMT
It's really fascinating. If you've ever done some speech-to-text "writing", it seems simple enough, but boy you would be wrong. Speech recognition and speech synthesis is an absolutely fascinating field.
Thank you, Chip, for your hardwork in producing this gem. [noparse]:)[/noparse]
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔
Chip Gracey
Parallax, Inc.
"Mommyyyy, Dad's scaring me again" ...... "Chip, you keep that "talking prop head" away from our daughter!"
Who says you can't combine work and play. Obviously they haven't worked with the Propeller chip.
*Peter*
Yup, and what a Great Expo it was ! Thanks.
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔
http://www.propgfx.co.uk/forum/·home of the PropGFX Lite
·
Hmm, you could examine the formants, nasal params, etc and get a pretty good idea of when the mouth should be doing I think, anyone for a prop head with moving lips?
*Peter*
I learned much about speech and how the ear functions.
Between that and some interesting discussion with Beau Monday
during lunch, I have an interesting idea for a related project.
I am starting to wonder if Chip reads biology and life science
encyclopedias in his "spare" time for entertainment purposes. [noparse]:)[/noparse] [noparse]:)[/noparse]
OBC
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔
New to the Propeller?
Visit the: The Propeller Pages @ Warranty Void.
Interesting. I have a talking head (2d) project driven by a prop that talks in synch to a wav file (played by the same prop) but it requires external data (stored in a data file) to determine the mouth movement timing associated with the phonemes. It would be cool to be able to drive the mouth movements directly from the talk track.
Here's a link to a video of it talking:
http://sidnancy.posterous.com/
Post Edited (sssidney) : 6/30/2009 3:45:16 PM GMT