just how much data can you get from a mic?
Muncher
Posts: 38
Can you retrieve enough info from a plain old mic to do voice recognition and how? I do not need any code, although code for the BS2 would be helpful.
Thanks in advance,
Muncher
Thanks in advance,
Muncher
Comments
http://www.cs.cmu.edu/~awb/papers/ICASSP2006/0100185.pdf#search='embedded speech recognition'
Also, check out www.sensoryinc.com. They make both a speech recognition chip and speech synthesis chips.
A Stamp would be too slow to do the signal processing (usually recording the timing of zero crossings and average
energy in the sounds), but a SX or PIC microprocessor (or the Propellor) could do the job.
I am still open to any advice (of course).
-Muncher
On the other hand, if you have an application and are just looking for a part to get a solution, then try the Voice Direct 364. I haven't used it myself, but it specs at fifteen words max. I'm sure you can google that name and get enough hits to find a source for the part.
You run the audio signal to this chip. I don't remember what conditioning is required. The way it looks to your system is one pin that goes high when it hears any word it's configured to listen for. Then one of eight pins goes high to indicate which word. You can drive other circuits from those outputs, like an on/off circuit, or make them inputs to a Stamp and do anything you like. (And no, I don't remember how they use eight pins to distinguish which of fifteen words were heard. Presumably a pin with a high order bit that means which bank of words caught the 'hit'.) I looked at this part for an application a couple of years ago but never got around to sampling it.
It's about fifty bucks apiece in small quantities I think.
between "YES" and "NO" from my voice. Prior to making the distinction, I made a program that would display the average amplitude of sound over time. After studying this I was able to discern
a pattern from this that I could use to determine a Yes or No response. From what I remember, graphically the word "YES" ramps up slowly and drops off quickly, where as the word "NO" does
virtually the opposite. It ramps up quickly, and then tapers off slowly.
Depending on the actual words or sounds you want to detect, it could be doable but it will most likely be a meticulous re-iterative process looking for the specific patterns. ...And keep in mind
what works for your voice may or may not work with someone else's.
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔
Beau Schwabe
IC Layout Engineer
Parallax, Inc.
http://forums.parallax.com/showthread.php?p=579853
It's around page 2.
I hope this helps.
▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔▔
There is always an answer.
There is always a way.
There is always a reason.··· -SciTech02.
Post Edited (Harbor) : 6/8/2006 2:42:43 AM GMT