Singing monk squad

Graham Stabler · 2006-10-29 12:45

I think it must be my dirty mind but with a bit more vibrato I was sure they were singing effing effing effing seven [noparse]:)[/noparse]

This is totally awesome, I don't understand it but I really want to. I'll be loading that site on my Sony reader when it comes next week.

Graham

p.s. Yes I know I just dropped that note about the Sony reader in without provocation but I'm excited!!

ALIBE · 2006-10-29 13:30

Chip,
this is awesome!!

I listen to a lot of music - But, "Studying Music" is one of those things that is beyond my reach. Despite that handicap, I can tell this is great work - keep up the voyage!

BTW, did not know that you are music savvy also

▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔
"any small object, accidentally dropped, goes and hides behind a larger object."

ALIBE - Artificial LIfe BEing. In search of building autonoumous land robot
http://ALIBE.crosscity.com/
·

Harley · 2006-10-29 16:46

Paul said...
Chip, so you didn't have a chorus of monks sing "seven"?
"Seven" was Chip's test vector for improving the quality of the vocal tract, so our side of the building has been filled with the sounds of "seven" for months now.

Excuse my 'ignorance', but what is this "Seven", some sort of 'music'?___ Used Wikipedia, but didn't notice anything to do with music that registered. Oh, there were lots of references to numbers, astronomy, religion, etc.

Were the 'monks' singing Latin or gibberish?___ I'd sure like to hear some spoken English in .mp3 format. I suppose one could also fairly easily make it speak nearly any language?

▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔
Harley Shanko
h.a.s. designn

Paul Baker · 2006-10-29 20:53

Have you listened to the mp3 I posted? Seven is 7 the number, it was his test word for the synth that we heard tens to hundreds of times daily as he worked on the project, so I was joking about it. Spoken and sung is identical except the way the vocal tract is excited. Definition of the speech cookbooks will likely fall upon those Parallax customers who choose to pick up the ball on this. But Chip and/or I will explain the method he used to pick out the required formants for those that want additional words.

▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔
Paul Baker
Propeller Applications Engineer

Parallax, Inc.

Harley · 2006-10-29 22:45

Paul said...
Have you listened to the mp3 I posted? Seven is 7 the number, it was his test word for the synth that we heard tens to hundreds of times daily as he worked on the project, so I was joking about it. Spoken and sung is identical except the way the vocal tract is excited. Definition of the speech cookbooks will likely fall upon those Parallax customers who choose to pick up the ball on this. But Chip and/or I will explain the method he used to pick out the required formants for those that want additional words.

Yes, I did listen to the mp3 file. I'm going to have to add the components to have the 'audio' on my PropSTICK.

"Seven" 'spoken' hundreds of times in the lab or office can be stressful. But that's what is involved in trying to make something work right. Thanks for clearing up my misunderstanding. Great work going on at Parallax. Love the Propeller.

▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔
Harley Shanko
h.a.s. designn

Paul Baker · 2006-10-29 22:57

It wasn't stressfull, just a running joke at the office. I just slip on the headphones if I dont want to hear it.

▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔
Paul Baker
Propeller Applications Engineer

Parallax, Inc.

cbmeeks · 2006-11-01 12:41

Absolutely tremendous work. Totally awesome.

I don't have my Prop totally finished yet but I can't wait to get this going. I haven't been able to read through all the source (still learning Spin) but it sounds like you are getting 3-4 voices (not speech but channels) going at once with ONE cog?

So two cogs running 4 channels each could easily get me 8 voices? Man, I think this single chip is goint to handle everything for my homebrew computer. I can't wait until that 1 Mhz 6502 is driving this Propeller....hahaha

▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔
Coders' Hangout
A place for programmers to hangout!
http://www.codershangout.com

METROID?
Metroid Classic

cgracey · 2006-11-01 13:06

cbmeeks said...

I haven't been able to read through all the source (still learning Spin) but it sounds like you are getting 3-4 voices (not speech but channels) going at once with ONE cog?

So two cogs running 4 channels each could easily get me 8 voices?

It actually uses a whole·COG·for each of the four·voices.·Another COG is running the stereo spatializer (which processes the·four "channels"). A final COG·is running Spin code to control everything.

▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔

Chip Gracey
Parallax, Inc.

cbmeeks · 2006-11-01 13:14

So that would be 6 cogs in all? So, I could add one more cog, get 5 channels (voices) which would take me to 7 cogs and then the 8th cog could be used to communicate with the rest of the homebrew computer? That's still not too bad for an 8 bit computer.

It looks like my homebrew computer is going to have three propellers! (audio/video/IO) hehehe

▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔
Coders' Hangout
A place for programmers to hangout!
http://www.codershangout.com

METROID?
Metroid Classic

kelvin james · 2006-11-03 19:20

Through some playing around with the vocal tract, it seems there is a need to provide some type of variable setting to adjust the amplitude. To make it simple, just an attack and release. Letters like P are short and abrupt, where as H is soft and sustained. It seems just a timing operation on the rise and fall of the amplitude could do this.
If there is something there already, i do not see it.

kelvin

cgracey · 2006-11-03 19:55

kelvin james said...
Through some playing around with the vocal tract, it seems there is a need to provide some type of variable setting to adjust the amplitude. To make it simple, just an attack and release. Letters like P are short and abrupt, where as H is soft and sustained. It seems just a timing operation on the rise and fall of the amplitude could do this.
If there is something there already, i do not see it.

kelvin

Yeah, you just change whatever parameter(s) need changing and then tell it the amount of time to make the transition(s) in. It will linearly glide from the last set of parameters to the new set of parameters over the time you specified. If you don't change any parameters and just do a go(time), it will sustain the last settings for the specified amount of time.

▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔

Chip Gracey
Parallax, Inc.

Graham Stabler · 2006-11-04 20:12

I'm feeling like a bit of a dunce, can someone suggest some parameters to get a basic tone out of the vocal tract, I was trying to just enter them in one after another and then run .go. I seem to get white noise or a mess. A super simple example would be great, I don't know what I am doing wrong.

Thanks

Graham

Ym2413a · 2006-11-04 20:24

"seven" sounds pretty good. ^^ (lol)
A whole song about the number 7.

cgracey · 2006-11-04 20:42

Graham Stabler said...
I'm feeling like a bit of a dunce, can someone suggest some parameters to get a basic tone out of the vocal tract, I was trying to just enter them in one after another and then run .go. I seem to get white noise or a mess. A super simple example would be great, I don't know what I am doing wrong.

Graham, about the f1-f4 parameters, you need to keep 'em separated, lest overflow occurs and you wind up with piercing noise. Here is a short example taken out of the spatializer demo:

CON _clkmode = xtal1 + pll16x
·· ·_xinfreq = 5_000_000

OBJ v : "VocalTract"

VAR byte· aa,ga,gp,vp,vr,f1,f2,f3,f4,na,nf,fa,ff

PUB start

· v.start(@aa, 10, 11, -1)··· 'start vocal tract,·output to pins 10 and 11

· gp := 88··················· 'set pitch to·F#
· f1 := constant(670 / 19)····'set "uh" sound···
· f2 := constant(1160 / 19)···'(This is the exact·noise I make when I'm
· f3 := constant(2600 / 19)···'trying to answer a question.)
· f4 := constant(3100 / 19)
· vp := 20··················· 'add some vibrato for dramatic flare
· vr := 10
· v.go(0)···················· 'transition to settings as fast as possible

· ga := 50····················'set·glottal amplitude
· v.go(5000)················· 'ramp up glottal amplitude slowly

· vr := 50····················'ready increased vibrato rate
· gp := 20··················· 'ready decreased pitch
· v.go(10000)················ 'slowly transition
·

▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔

Chip Gracey
Parallax, Inc.

Post Edited (Chip Gracey (Parallax)) : 11/4/2006 9:11:28 PM GMT

BTX · 2006-11-04 20:48

Chip.
Is it possible to change nf, na to get better sound in Spanish words ?? could be correct changing those two parameters ??
I'm begining, and it so difficult for me, to locate, where do you post their initial values..

Alberto.

cgracey · 2006-11-04 21:07

BTX said...
Chip.
Is it possible to change nf, na to get better sound in Spanish words ?? could be correct changing those two parameters ??
I'm begining, and it so difficult for me, to locate, where do you post their initial values..

Alberto.

I don't think·the nf/na parameters would be especially unique in Spanish, but I do think that rapidly repeating short sequences could generate sounds like rolled r's.

Did you read that Computalker link? It's a bit of investment, but worth it. If you digest that, and then play with that spectograph program, you will start to see how all this ties together.

You can always experiment, and just see what you get, but I think that understanding the spectral nature of speech·is critical. On this thread you should see links I posted to other resources, including that spectograph program.

▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔

Chip Gracey
Parallax, Inc.

BTX · 2006-11-04 21:15

Chip said...
Did you read that Computalker link?

No I didn't Chip, but I will, and try to understand it.
I thought that computalker only works on English phonetics.......sorry, like I said I wil read it first.

Thanks so much.

cgracey · 2006-11-04 21:25

BTX said...

Chip said...
Did you read that Computalker link?

No I didn't Chip, but I will, and try to understand it.
I thought that computalker only works on English phonetics.......sorry, like I said I wil read it first.

Thanks so much.

Though the Computalker was probably designed with English in mind, it should be capable of making sounds from any language, as it models the vocal tract. I would imagine that to do some dialects especially well, some optimization to a simple vocal tract model might be in order, but you could get well in the ball-park with a simple model.

▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔

Chip Gracey
Parallax, Inc.

Graham Stabler · 2006-11-04 23:05

Thanks Chip, I think I must have had overlapping formants. I also hadn't checked out the spatializer because all my mice are at work (don't ask). I'm hoping to analyse my own voice a bit and get Phil's speech object sounding more like me.

Cheers,

Graham

Graham Stabler · 2006-11-05 01:21

I'm not sure how useful this will prove but I've made a program for TV and keyboard so you can fiddle with the parameters and see the effects. Might be handy for tweaking.

Find attached.

Graham

cgracey · 2006-11-06 08:29

Graham,

I looked at your code, but didn't run it because I didn't have a tv and keyboard hooked up. It looks great, though.

Your program is·a good idea for allowing people to get a quick handle on what the various parameters do. I noticed that you were stepping the frequencies by 100Hz (~5 unit steps). What about stepping by $10 and using shift-key to step by $01. That way, they could try the entire 8 bits of range out without too much typing. This would be good for finding the 'ga' limit for a set of formants, for example. You would have to show the parameter value, and maybe its translated unit value.

Anyway, thanks for posting this code. I will try it later.

Graham Stabler said...
I'm not sure how useful this will prove but I've made a program for TV and keyboard so you can fiddle with the parameters and see the effects. Might be handy for tweaking.

Find attached.

Graham

▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔

Chip Gracey
Parallax, Inc.

KenLem · 2006-11-06 14:37

Well done Chip!

Thanks for the link too.· I really enjoy digging into something the texts from the early days of computing.· There are some real gems buried in there.· I just bought the origial Dr Drobbs with Li-Chen Wang's Tiny Basic listing in it.· I don't have any plans for it but it's just wonderful to read.

▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔
www.speechchips.com

Speech & Video IC's for BasicStamps

Graham Stabler · 2006-11-06 22:07

Thanks Chip, I'll definately make those changes, I've just found an old PS2 keyboard, I think I'm going to mark it out with the controls.

I've now produced spectrums for all of the sounds of my voice needed in Phil's program, I used a program called Spectrogram that I downloaded (10 day free trial) so I'll be trying to build up the sounds. One problem I have found so far is that the first Formant for a Yorkshireman seems rather lower than for the average American (though my voice is not all that deep) I always seem to have to lower ga a lot so it doesn't complain.

I got some interesting stuff from the library (took my camera) which I'll share as time allows.

Cheers,

Graham

Heather Dewey-Hagborg · 2007-08-06 16:37

using the education kit what is the easiest way to get the singing monk voices out of the propeller and into a pair of speakers?

-heather

parts-man73 · 2007-08-06 17:08

Take a look at OldBitCollectors "Propeller Cookbook" - I have it hosted on my website, or he has it mirrored elsewhere as well.

Look at it here - ucontroller.com/

▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔
Brian

uController.com - home of SpinStudio

Heather Dewey-Hagborg · 2007-08-06 17:45

The cookbook circuit works perfectly for the tone generating examples in the counters lab, but I still can't hear a peep from the singing monks...

-heather

Oldbitcollector (Jeff) · 2007-08-06 18:54

Strange,

The monks are singing sweetly here...

The audio circuit has been an established standard here, wonder what changed?

Oldbitcollector

▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔
The comments and code above are proof that a million monkeys with a million propeller chips *could* write Shakespeare!

Heather Dewey-Hagborg · 2007-08-06 21:22

got it working! looks like i was having brown out reset issues and wasn't actually executing the code. what a relief!

Javalin · 2007-08-07 09:52

With the MP3's posted here - Chip could release a "Propellor Monks" album (on-line only of course). Perhaps a free copy for all purchases over $50?

Cool stuff Chip.

Javalin

mcstar · 2007-08-08 21:23

Does anyone know why the VocalTract library squeaks/pops/hisses and does other nasty things when you attemp certain frequency combinations?· For instance, I'm attempting to create· the "oo" (F1=425, F2=2000, F3=2400,F4=3000) vowel sound and I have to lower the gutteral (gp) to around 60 or 65hz.· This means female voices are out of the question.· Is there any way to rectify this to get more natural higher pitched sounds??

Singing monk squad

Comments