DOC PREVIEW
CMU CS 15492 - Speech Recognition

This preview shows page 1-2-3-25-26-27 out of 27 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 27 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 27 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 27 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 27 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 27 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 27 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 27 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

Speech Processing 15-492/18-492Speech RecognitionIntroAcoustic modellingHMMsSpeech RecognitionFrom acoustics to textFrom acoustics to textAcoustic modelingAcoustic modelingRecognizing all forms of all phonemesRecognizing all forms of all phonemesLanguage modelingLanguage modelingExpectation of what might be saidExpectation of what might be saidWe need both to do recognitionWe need both to do recognitionAcoustics are not enoughLast Saturday in Hawaii, numerous Last Saturday in Hawaii, numerous WaipouliWaipoulivacationers were vacationers were shocked to find their beach cordoned off for a UC Berkeley Dramashocked to find their beach cordoned off for a UC Berkeley Dramaenactment of "Personal office space". The play features exclusivenactment of "Personal office space". The play features exclusively ely topless men and women in an everyday office environment. Richardtopless men and women in an everyday office environment. RichardCarlson, one of the annoyed tourists and a regular swimmer at Carlson, one of the annoyed tourists and a regular swimmer at WaipouliWaipoulibeach, complained that they really knew how to wreck a nice beach, complained that they really knew how to wreck a nice beach with the nudist play. Many of the tourists appeared rufflebeach with the nudist play. Many of the tourists appeared ruffled by the d by the content and fled the scene to avoid compromising photos.content and fled the scene to avoid compromising photos.In yesterday's press release, AT&T unveiled In yesterday's press release, AT&T unveiled SpeechKitSpeechKit, its new , its new speech recognition toolkit. According to Michael Armstrong, the speech recognition toolkit. According to Michael Armstrong, the COO COO of the company, the most innovative feature of the system is itsof the company, the most innovative feature of the system is itsrevolutionary threerevolutionary three--dimensional interface, which opens a new universe dimensional interface, which opens a new universe of possibilities for the speech recognition community. During tof possibilities for the speech recognition community. During the he official software release, Jonathan Blues, a senior researcher aofficial software release, Jonathan Blues, a senior researcher at AT&T t AT&T Labs, explained how to recognize speech with the new display, anLabs, explained how to recognize speech with the new display, and d how the toolkit has already played a crucial role in his researchow the toolkit has already played a crucial role in his research.h.Acoustics are not enoughLast Saturday in Hawaii, numerous Last Saturday in Hawaii, numerous WaipouliWaipoulivacationers were vacationers were shocked to find their beach cordoned off for a UC Berkeley Dramashocked to find their beach cordoned off for a UC Berkeley Dramaenactment of "Personal office space". The play features exclusivenactment of "Personal office space". The play features exclusively ely topless men and women in an everyday office environment. Richardtopless men and women in an everyday office environment. RichardCarlson, one of the annoyed tourists and a regular swimmer at Carlson, one of the annoyed tourists and a regular swimmer at WaipouliWaipoulibeach, complained that they really knew beach, complained that they really knew how to wreck a nice how to wreck a nice beach with this nudist playbeach with this nudist play. Many of the tourists appeared ruffled by . Many of the tourists appeared ruffled by the content and fled the scene to avoid compromising photos.the content and fled the scene to avoid compromising photos.In yesterday's press release, AT&T unveiled In yesterday's press release, AT&T unveiled SpeechKitSpeechKit, its new , its new speech recognition toolkit. According to Michael Armstrong, the speech recognition toolkit. According to Michael Armstrong, the COO COO of the company, the most innovative feature of the system is itsof the company, the most innovative feature of the system is itsrevolutionary threerevolutionary three--dimensional interface, which opens a new universe dimensional interface, which opens a new universe of possibilities for the speech recognition community. During tof possibilities for the speech recognition community. During the he official software release, Jonathan Blues, a senior researcher aofficial software release, Jonathan Blues, a senior researcher at AT&T t AT&T Labs, explained Labs, explained how to recognize speech with this new displayhow to recognize speech with this new display, and , and how the toolkit has already played a crucial role in his researchow the toolkit has already played a crucial role in his research.h.Split the taskBuild Acoustic modelsBuild Acoustic modelsProbability of phones given acousticsProbability of phones given acousticsBuild Language modelsBuild Language modelsProbability of word stringProbability of word stringAcoustic modelsRepresent all ways to say each phonemeRepresent all ways to say each phonemeLike “templates” for each phonemeLike “templates” for each phonemeAverages over multiple examplesAverages over multiple examplesDifferent phonetic contextsDifferent phonetic contexts“sow” “sow” vsvs“see” etc“see” etcDifferent people speakingDifferent people speakingDifferent acoustic environmentDifferent acoustic environmentDifferent channels Different channels (assume channel is similar)(assume channel is similar)Better Acoustic ModelsDTW TemplateDTW TemplateCould be averages over multiple examplesCould be averages over multiple examplesNeed to be time normalizedNeed to be time normalizedLinear interpolate or try to matchLinear interpolate or try to matchMatching probabilisticallyMatching probabilisticallyWhat is the probability that example matchesWhat is the probability that example matchesTest each frameTest each frameHidden Markov Models• Markov Process– Future can be predicted from the past• Hidden Markov Models:– When the state is unknown– A probability is given for each statesHidden Markov ModelKey RequirementsFind Probability of ObservationGiven observation O and model MGiven observation O and model MEfficiently file P(O|M)Efficiently file P(O|M)Called Called decodingdecodingFind sum of all paths probabilitiesFind sum of all paths probabilitiesEach path Each


View Full Document
Download Speech Recognition
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Speech Recognition and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Speech Recognition 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?