New version page


Upgrade to remove ads
Upgrade to remove ads
Unformatted text preview:

Voice RecognitionWhat is Voice RecognitionHow does it work?Practical ApplicationsDownside/ DifficultySlide 6Math 5Professor BarnettTimothy G. McManusAnthony P. PastoorsVoice recognition is "the technology by which sounds, words or phrases spoken by humans are converted into electrical signals, and these signals are transformed into coding patterns to which meaning has been assigned" Concept could more generally be called "sound recognition", we focus here on the human voice because we most often and most naturally use our voices to communicate our ideas to others in our immediate surroundings.““Template matching"Template matching" simplest technique and has the highest accuracy when used properly, but it also suffers from the most limitations. The electrical signal from the microphone is digitized by an "analog-to-digital (A/D) converter", and is stored in memory. To determine the "meaning" of this voice input, the computer attempts to match the input with a digitized voice sample, or template, that has a known meaning. The program contains the input template, and attempts to match this template with the actual input using a simple conditional statement.““Feature analysisFeature analysis"“Speaker-independent" voice recognition. Does not need to find an exact or near-exact match between the actual voice input and a previously stored voice templateProcesses the voice input using "Fourier transforms" or "linear predictive coding (LPC)"Attempts to find characteristic similarities between the expected inputs and the actual digitized voice input. These similarities will be present for a wide range of speakers, and so the system need not be trained by each new user.Military PolicePeople with DisabilitiesHealth CareVehicle useSecurityThe difficulty in using voice as an input to a computer simulation lies in the fundamental differences between human speech and the more traditional forms of computer input. While computer programs are commonly designed to produce a precise and well-defined response upon receiving the proper input, the human voice and spoken words are anything but precise. Each human voice is different, and identical words can have different meanings if spoken with different inflections or in different

View Full Document
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...

Join to view VOICE RECOGNITION and access 3M+ class-specific study document.

We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view VOICE RECOGNITION 2 2 and access 3M+ class-specific study document.


By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?