DOC PREVIEW
Columbia COMS W4706 - Accenting and Information Status

This preview shows page 1-2-3-19-20-39-40-41 out of 41 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 41 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 41 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 41 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 41 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 41 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 41 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 41 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 41 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 41 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

Accenting and Information StatusInformation StatusToday: Acent and Given/NewA Simple DefinitionRole in Speech TechnologiesSlide 6Prince ’81: A More Complex ModelSlide 8Prince ’92: A Still More Complex ModelSlide 10Given/New and Pitch AccentBrown ‘83: Accent Status and Subclasses of Given/NewSlide 13Boston Directions Corpus (Hirschberg & Nakatani ’96)Hearer and Discourse Given/New LabelingSlide 18Does Given/New Status Predict Deaccenting?And….Bard’99: Givenness, deaccenting and intelligibilitySlide 21What else is going on?Experimental DesignSlide 24Slide 25Slide 26Slide 27Slide 28Slide 29Experimental ConditionsProsodic AnalysisGrammatical Role/Surface Position Accenting ‘Score’FindingsGiven/New Isn’t Just About Discourse EntitiesHow can we determine automatically whether a discourse entity is given or new?What else can we do?How important is it to accent given/new items appropriately?Intonational cues in on-line processingDahan et al. (2002)Slide 41Accent, Given/New, and Grammatical FunctionSlide 43Next Class01/13/19 1Accenting and Information StatusJulia HirschbergCS 470601/13/19 2Information Status•Topic/comment, theme/rhemeThe orangutan we wanted to buy escaped from the pet store.•Focus of attentionI only bought candy for that orangutan.•Given/newI only bought candy for that orangutan. I would never buy an ape drugs!•All commonly signaled in human speech by intonation01/13/19 3Today: Acent and Given/New•Motivation in speech technology•Models of Given/New•Experiments on Given/New and pitch accent•Possible models of intonation wrt given/new entities•How might we identify given/new information automatically?•How should we produce given/new information appropriately?•Why is this important?01/13/19 4A Simple Definition•Given: Recoverable from some form of context or, what a Speaker believes to be in a Hearer’s consciousness•New: Not recoverable from context or, what a Speaker believes is not in a Hearer’s consciousness01/13/19 5Role in Speech Technologies•TTS: Natural production–Given information is often deaccented –New information is usually accented•ASR: Improved recognition–Given information may already have been recognized earlier–New information may be important cue to topic shift•Summarization: Improved precision–Given information less likely to be included in a summary; new information more likely01/13/19 6•Spoken Dialogue Systems: Grounding–Critical for system to convey what is given and what is new to facilitate Hearer comprehension01/13/19 7Prince ’81: A More Complex Model•Speaker (S) and Hearer (H), in a discourse, construct a discourse model–Includes discourse entities, attributes, and links between entities–Discourse entities: individuals, classes, exemplars, substances, concepts (NPs)•Entities when first introduced are new–Brand-new (H must create a new entity)My dog bit a rhinoceros this morning.01/13/19 8–Unused (H already knows of this entity)The sun came out this morning.•Evoked entities are old, or ‘given’ -- already in the discourse–Explicitly evoked (in text or speech)The rhinoceros was wearing suspenders. Rather unusual for a rhino.–Situationally evokedWatch out for the snake!•Inferables are also old, or ‘given’I bought a new car. The gear shift is a bit tricky.01/13/19 9Prince ’92: A Still More Complex Model•Hearer-centric information status:–Given: what S believes H has in his/her consciousness–New: what S believes H does not have in his/her consciousness•But discourse entities may also be given and new wrt the current discourse–Discourse-old: already evoked in the discourse–Discourse-new: not evoked01/13/19 10 The stars are very bright tonight (Hearer-given; Discourse-new)When I see stars this bright, I think of my vacations in the mountains. (Hearer-given; Discourse-given)My friend Buddy and I would sneak out late at night. (Hearer-new; Discourse-new)I said, “My friend BUDDY…” (Hearer-new; Discourse-given)01/13/19 11Given/New and Pitch Accent•New information is often accented and given information is often deaccented (Halliday ‘67, Brown ‘83, Terken ‘84) –But there are many exceptions: a simple TTS rule: accent ‘new’ and deaccent ‘given’ will make 25-30% errors–How can we reduce these errors, to produce human-like intonation?01/13/19 12Brown ‘83: Accent Status and Subclasses of Given/New•Speech elicitation in laboratory–12 Scottish-English undergrads–A describes a diagram for B to draw, which B cannot seeDraw a black triangle.Draw a circle in the middle.Draw a blue triangle next to the black one with a line from the top angle to the bottom.•Analysis: based on Prince ‘81 categories with modifications01/13/19 13–Brand-new (a triangle), given:inferrable (middle, angle), given:contextually evoked (the page), given:‘textually’ evoked (divided into current topic vs. earlier mention)–Accent status of all entity-referring NPs•Results:–Brand-new information accented (87%)•Note: new entity/old expression issue–Given: contextually evoked information deaccented (98%)–Given: ’textually’ evoked deaccented (current topic 100%; earlier: 96%)–Given: inferable information accented (79%)01/13/19 14Boston Directions Corpus (Hirschberg & Nakatani ’96)•Experimental Design•12 speakers: 4 used•Spontaneous and read versions of 9 direction-giving tasks (monologues)•Corpus: 50m read; 67m spon•Labeling–Prosodic: ToBI intonational labeling–Given/new (Prince ’92), grammatical function, p.o.s.,…01/13/19 17Hearer and Discourse Given/New Labelingfirstenter the Harvard Square T stopand buy a tokenthenproceed to get on theinboundumRed Lineuh subwayandtake the subwayfrom Harvard Squareto Central Squareand then to Kendall Squarethen get off the T01/13/19 18Hearer and Discourse Given/New Labelingfirstenter <HG/DN the Harvard Square T stop>and buy <HI/DN a token>thenproceed to get on <HI/DN theinboundumRed Lineuh subway>andtake <HG/DG the subway>from <HG/DG Harvard Square>to <HG/DN Central Square>and then to <HG/DN Kendall Square>then get off <HG/DG the T>01/13/19 19Does Given/New Status Predict Deaccenting?NPa HG HI HN DG DNDeaccented 37.1% 53.9% 26.2% 43.3% 38.8%Total 1009 406 130 596 950HG: Hearer Given HI: Hearer Inferable HN: Hearer New DG: Discourse Given DN: Discourse New39.4% of (H or D) Given items deaccented…36.9% of (H or D) New Items are deaccented…01/13/19


View Full Document

Columbia COMS W4706 - Accenting and Information Status

Download Accenting and Information Status
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Accenting and Information Status and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Accenting and Information Status 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?