Voice command recognition (also referred to as "automated voice command recognition", "computer voice recognition", or in a slightly altered form "voice recognition") is the process of converting audio signals specific to the human voice into a sequence of words, using an algorithm implemented as a computer program. Next we will use the abbreviation VCR to refer to the current title.

In June 2006, I've developed such a computer software, for my master dissertation thesis.

In the first step it receives the human voice from a microphone. Next it applies a Fast Fourier Transformation on the data set recorded with the microphone. The result is that for every sound it records, it processes a sequence of numbers based on the distribution of the frequencies.
You can see this as a number based descriptor, for various audio signals.
These sequences can then be applied to a neural network for the purpose of recognizing them.

This is only the main idea, more work has been actually done to cut representative voice signals out of the continuous stream coming from the microphone. The processor cost is quite high in this phase and it needs improvement, but overall the algorithm works well.

Here is a short video with me showing this software:

Hope you like it

Radu Motisan

(Visited 87 times, 1 visits today)

11 thoughts on “Voice command recognition

  • September 12, 2008 at 3:13 pm

    Altought not quite understanding all that is going on, it’s allways nice to put a face (and a voice), on a name! ๐Ÿ˜€

  • September 13, 2008 at 12:12 pm

    Thanks Vasco ๐Ÿ™‚

    Regarding the content, I might try to port it on Windows Mobile, I still need to see how the hardware can handle it, and to get some time for this, of course.

    Also there are some additional improvements required to transform a didactic application into a commercial one.

  • March 13, 2009 at 8:04 am

    by seiing the mobile its a latest content involved but it s still need to see the ore ppts and web based relating these projects

  • March 13, 2009 at 8:08 am

    Thanks for giving such information but we need more information because as we are doing a project in this so we require extra ppts an some well preferred matter considering 30to 40 pages

  • Pingback: PocketMagic » Windows Mobile Tech Seminar in Lisbon

  • December 15, 2009 at 3:10 am

    Good ideas, thanks for putting this on your blog.

  • December 15, 2009 at 11:41 am


  • March 3, 2010 at 1:23 am

    Elsewhere you have mentioned the posibility of porting your software to ATMEL microcontrollers.
    I was wondering if you have had any success in this area?

    Great work by the way,

  • March 3, 2010 at 10:56 am

    Thanks Jim, I had some success but I’ve put the project on pause before completing it, because of missing time.

    I hope to be able to make a nice post about that soon.

  • February 26, 2012 at 8:25 pm

    Nice work. But wouldn’t it be better to port your software on a DSP since there is a lot of calculating required (Fourier Transform). With this solution you would be able to put the main intelligence on a ยตC board.

  • February 26, 2012 at 11:49 pm

    Thank you. Yes, that would work.

Leave a Reply

Your email address will not be published. Required fields are marked *