Slide 1OutlineProblem: Dialect RecognitionMotivation: Why Study Dialect Recognition?Motivation: Cues that May Distinguish Dialects/AccentsMotivation: Cues that May Distinguish Dialects/AccentsOutlineCase Study: Arabic DialectsCorpora – Four Dialects – DATA IOutlineProbabilistic Framework for Language IDOutlinePhonotactic ApproachPhonotactic Approach – IdentificationApplying Parallel PRLM (Zissman, 1996)Our Parallel PRLM Results – 10-Fold Cross ValidationOutlineProsodic Differences Across DialectsNew Approach: Prosodic ModelingNew Approach for Prosodic ModelingProsodic Modeling – Results (2m test utterances)OutlineBaseline: Acoustic ModelingCorpora – Four Dialects – DATA IINIST LREC Evaluation FrameworkResults (DET curves of PRLM and GMM-UBM) – 30s Cuts (Data II)Our GMM-UBM Improved with fMLLRResults – GMM-UBM-fMLLR – 30s UtterancesOutlineDiscriminative PhonotacticsObtaining CD-PhonesCD-Phone Universal Background Acoustic ModelObtaining CD-Phones + Frame AlignmentMAP Adaptation of each CD-Phone InstanceMAP Adaptation of each CD-Phone InstanceSlide 36Discriminative Phonotactics – CD-Phone ClassificationCD-Phone Classifier ResultsExtraction of Linguistic KnowledgeLabeling Phone Sequences with Dialect HypothesesTextual Feature Extraction for Discriminative PhonotacticsExperiments – Training Two ModelsDiscriminative Phonotactics – Dialect RecognitionResults – Discriminative PhonotacticsResults per DialectComparison to the State-of-the-ArtResearch PlanAcknowledgmentsProsodic Differences Across DialectsFrame AlignmentAutomatic Dialect/Accent RecognitionFadi BiadsyApril 12th, 20101PhD Proposal – Fadi BiadsyOutlineProblem MotivationCorporaFramework for Language RecognitionExperiments in Dialect Recognition Phonotactic Modeling Prosodic ModelingAcoustic ModelingDiscriminative Phonotactics 2PhD Proposal – Fadi BiadsyProblem: Dialect RecognitionGiven a speech segment of a predetermined languageGreat deal of work on language recognition Dialect and Accent recognition have more recently begun to receive attention Dialect recognition more difficult problem than language recognition3Dialect = {D1, D2,…,DN}PhD Proposal – Fadi BiadsyMotivation: Why Study Dialect Recognition?Discover differences between dialectsTo improve Automatic Speech Recognition (ASR)Model adaptation: Pronunciation, Acoustic, Morphological, Language modelsTo infer speaker’s regional origin forSpeech to speech translationAnnotations for Broadcast News MonitoringSpoken dialogue systems – adapt TTS systemsCharismatic speechCall centers – crucial in emergency situations4PhD Proposal – Fadi BiadsyMotivation: Cues that May Distinguish Dialects/Accents Phonetic cues:Differences in phonemic inventoryPhonemic differencesAllophonic differences (context-dependent phones) Phonotactics: Rules/Distribution that govern phonemes and their sequences in a dialect5(Al-Tamimi & Ferragne, 2005)Example: /r/ Approximant in American English [ɹ] – modifies preceding vowels Trilled in Scottish English in [Consonant] – /r/ – [Vowel] and in other contexts Example: /r/ Approximant in American English [ɹ] – modifies preceding vowels Trilled in Scottish English in [Consonant] – /r/ – [Vowel] and in other contexts MSA: /s/ /a/ /t/ /u/ /q/ /A/ /b/ /i/ /l/ /u/ /h/ /u/ Egy: /H/ /a/ /t/ /?/ /a/ /b/ /l/ /u/ Lev: /r/ /a/ /H/ /t/ /g/ /A/ /b/ /l/ /u/MSA: /s/ /a/ /t/ /u/ /q/ /A/ /b/ /i/ /l/ /u/ /h/ /u/ Egy: /H/ /a/ /t/ /?/ /a/ /b/ /l/ /u/ Lev: /r/ /a/ /H/ /t/ /g/ /A/ /b/ /l/ /u/Differences in MorphologyDifferences in MorphologyDifferences in phonetic inventory and vowel usageDifferences in phonetic inventory and vowel usage“She will meet him”PhD Proposal – Fadi BiadsyMotivation: Cues that May Distinguish Dialects/AccentsProsodic differencesIntonational patterns Timing and rhythm Spectral distribution (Acoustic frame-based features) Morphological, lexical, and syntactic differences 6Subjects rely on intonational cues to distinguish two German dialects (Hamburg urban dialects vs. Northern Standard German) (Peters et al., 2002) Subjects rely on intonational cues to distinguish two German dialects (Hamburg urban dialects vs. Northern Standard German) (Peters et al., 2002)PhD Proposal – Fadi BiadsyOutlineProblem MotivationCorporaFramework for Language RecognitionExperiments in Dialect Recognition Phonotactic Modeling Prosodic ModelingAcoustic ModelingDiscriminative Phonotactics Contributions Future WorkResearch Plan7PhD Proposal – Fadi BiadsyCase Study: Arabic DialectsIraqi Arabic: Baghdadi, Northern, and Southern Gulf Arabic: Omani, UAE, and Saudi Arabic Levantine Arabic: Jordanian, Lebanese, Palestinian, and Syrian Arabic Egyptian Arabic: primarily Cairene Arabic8PhD Proposal – Fadi BiadsyCorpora – Four Dialects – DATA IRecordings of spontaneous telephone conversation produced by native speakers of the four dialects available from LDCDialect # Speakers Total Duration TestSpeakersCorpusGulf 965 41h 150Gulf Arabic conversational telephone Speech database(Appen Pty Ltd, 2006a)Iraqi 475 26h 150Iraqi Arabic conversational telephone Speech database(Appen Pty Ltd, 2006b)Egyptian 398 76h 150CallHome Egyptian and its Supplement (Canavan et al., 1997) CallFriend Egyptian (Canavan and Zipperlen,1996)Levantine 1258 79h 150 Arabic CTS Levantine Fisher Training Data Set 1-3 (Maamouri, 2006)9PhD Proposal – Fadi BiadsyOutlineProblem MotivationCorporaFramework for Language RecognitionExperiments in Dialect Recognition Phonotactic ModelingProsodic ModelingAcoustic ModelingDiscriminative Phonotactics Contributions Future WorkResearch Plan10PhD Proposal – Fadi BiadsyProbabilistic Framework for Language ID11 Task:Hazen and Zue’s (1993) contribution:Acoustic modelProsodic modelPhonotacticPriorPhD Proposal – Fadi BiadsyOutlineProblem MotivationCorporaFramework for Language RecognitionExperiments in Dialect Recognition Phonotactic Modeling Prosodic ModelingAcoustic ModelingDiscriminative Phonotactics Contributions Future WorkResearch Plan12PhD Proposal – Fadi BiadsyPhonotactic Approach13dh uw z hh ih n d uw ey...f uw v ow z l iy g
View Full Document