“Effect of Genre, Speaker, and Word Class on the Realization of Given and New Information” Julia Agustín Gravano & Julia Hirschberg {agus,

Slides:



Advertisements
Similar presentations
Mini Presentations: How To
Advertisements

Atomatic summarization of voic messages using lexical and prosodic features Koumpis and Renals Presented by Daniel Vassilev.
Specialized models and ranking for coreference resolution Pascal Denis ALPAGE Project Team INRIA Rocquencourt F Le Chesnay, France Jason Baldridge.
Language Use and Understanding BCS 261 LIN 241 PSY 261 CLASS 6: EFFECTS OF DISFLUENCY ON REFERENCE COMPREHENSION.
Prosody Modeling (in Speech) by Julia Hirschberg Presented by Elaine Chew QMUL: ELE021/ELED021/ELEM March 2012.
“Downstepped contours in the given/new distinction” Agustín Gravano Spoken Language Processing Group Columbia University, New York On the Role of Prosody.
Spoken Vs Written Language. Introduction Languages are first spoken, then written, and then an understanding.
Stress.
CS 4705 Discourse Structure and Text Coherence What makes a text/dialogue coherent? Incoherent? “Consider, for example, the difference between passages.
® Towards Using Structural Events To Assess Non-Native Speech Lei Chen, Joel Tetreault, Xiaoming Xi Educational Testing Service (ETS) The 5th Workshop.
English Pronunciation Hilton1 Lecture 5 Lecture 5 (last, but not least) English "Prosody" or Phrasing (Putting It All Together)
Accent and reference resolution (Dahan, Tanenhaus & Chambers, in press) Experiment 1: –Tested hypothesis that de-accented noun is interpreted as referring.
1 Spoken Dialogue Systems Dialogue and Conversational Agents (Part IV) Chapter 19: Draft of May 18, 2005 Speech and Language Processing: An Introduction.
Agustín Gravano 1 · Stefan Benus 2 · Julia Hirschberg 1 Elisa Sneed German 3 · Gregory Ward 3 1 Columbia University 2 Univerzity Konštantína Filozofa.
Comparing American and Palestinian Perceptions of Charisma Using Acoustic-Prosodic and Lexical Analysis Fadi Biadsy, Julia Hirschberg, Andrew Rosenberg,
Accenting, Givenness, and Syntactic Role By E.G. Bard and M.P. Aylett Presented by David Vespe.
Prosodic Cues to Discourse Segment Boundaries in Human-Computer Dialogue SIGDial 2004 Gina-Anne Levow April 30, 2004.
Spoken Language Processing Lab Who we are: Julia Hirschberg, Stefan Benus, Fadi Biadsy, Frank Enos, Agus Gravano, Jackson Liscombe, Sameer Maskey, Andrew.
Corpus 06 Discourse Characteristics. Reasons why discourse studies are not corpus-based: 1. Many discourse features cannot be identified automatically.
Automatic Prosody Labeling Final Presentation Andrew Rosenberg ELEN Speech and Audio Processing and Recognition 4/27/05.
Information Status Varieties of Information Status –Contrast John wanted a poodle but Becky preferred a corgi. –Topic/comment The corgi they bought turned.
1 Noun Homograph Disambiguation Using Local Context in Large Text Corpora Marti A. Hearst Presented by: Heng Ji Mar. 29, 2004.
On the Correlation between Energy and Pitch Accent in Read English Speech Andrew Rosenberg, Julia Hirschberg Columbia University Interspeech /14/06.
On the Correlation between Energy and Pitch Accent in Read English Speech Andrew Rosenberg Weekly Speech Lab Talk 6/27/06.
High Frequency Word Entrainment in Spoken Dialogue ACL, June Columbus, OH Department of Computer and Information Science University of Pennsylvania.
Turn-taking in Mandarin Dialogue: Interactions of Tone and Intonation Gina-Anne Levow University of Chicago October 14, 2005.
Classification of Discourse Functions of Affirmative Words in Spoken Dialogue Julia Agustín Gravano, Stefan Benus, Julia Hirschberg Shira Mitchell, Ilia.
9/5/20051 Acoustic/Prosodic and Lexical Correlates of Charismatic Speech Andrew Rosenberg & Julia Hirschberg Columbia University Interspeech Lisbon.
10/10/20051 Acoustic/Prosodic and Lexical Correlates of Charismatic Speech Andrew Rosenberg & Julia Hirschberg Columbia University 10/10/05 - IBM.
National Curriculum Key Stage 2
Funded by NIH grant RO1 HD-4152 to J. Arnold NSF BCS and NSF BCS to Z. Griffin Why do speakers modulate acoustic prominence? Listener-oriented.
Copyright 2007, Toshiba Corporation. How (not) to Select Your Voice Corpus: Random Selection vs. Phonologically Balanced Tanya Lambert, Norbert Braunschweiler,
PHRASES & CLAUSES AND WHY COMMAS ARE IMPORTANT!. WORD CLASSES Every word in the English language belongs to a “class”. It will be one of the following:
Structural Metadata Annotation of Speech Corpora: Comparing Broadcast News and Broadcast Conversations Jáchym KolářJan Švec University of West Bohemia.
Recognizing Discourse Structure: Speech Discourse & Dialogue CMSC October 11, 2006.
The Games Corpus Design, implementation and annotation Agustín Gravano Spoken Language Processing Group Columbia University.
Imposing native speakers’ prosody on non-native speakers’ utterances: Preliminary studies Kyuchul Yoon Spring 2006 NAELL The Division of English Kyungnam.
What do we mean by Syntax? Unit 6 – Presentation 1 “the order or arrangement of words within a sentence” And what is a ‘sentence’? A group of words that.
Contrast and accent in Dutch and Romanian Marc Swerts Communication & Cognition Tilburg University.
Differences between Spoken and Written Discourse
Lexical, Prosodic, and Syntactics Cues for Dialog Acts.
Sentence Structure By: Amanda Garrett Bailey. What is the function of: Nouns Pronouns Verbs Adjectives Adverbs.
Phrase Definition review. Consists of an appositive and any modifiers the appositive has.
Acoustic Cues to Emotional Speech Julia Hirschberg (joint work with Jennifer Venditti and Jackson Liscombe) Columbia University 26 June 2003.
Cross-Dialectal Data Transferring for Gaussian Mixture Model Training in Arabic Speech Recognition Po-Sen Huang Mark Hasegawa-Johnson University of Illinois.
On the role of context and prosody in the interpretation of ‘okay’ Julia Agustín Gravano, Stefan Benus, Julia Hirschberg Héctor Chávez, and Lauren Wilcox.
SWS PRACTICE Parts of Speech. NOUN The definition of a noun is: A) person, place or thing B) person, place, thing or idea C) person, place, thing, quality.
TRUE or FALSE? Syntax= the order of words in a sentence.
Information Status.
Investigating Pitch Accent Recognition in Non-native Speech
Studying Intonation Julia Hirschberg CS /21/2018.
Meanings of Intonational Contours
Studying Intonation Julia Hirschberg CS /21/2018.
Intonational and Its Meanings
Intonational and Its Meanings
Comparing American and Palestinian Perceptions of Charisma Using Acoustic-Prosodic and Lexical Analysis Fadi Biadsy, Julia Hirschberg, Andrew Rosenberg,
Accenting and Information Status
Accenting and Information Status
Information Structure and Prosody
Meanings of Intonational Contours
Advanced NLP: Speech Research and Technologies
“Downstepped contours in the given/new distinction”
Fadi Biadsy. , Andrew Rosenberg. , Rolf Carlson†, Julia Hirschberg
High Frequency Word Entrainment in Spoken Dialogue
Agustín Gravano & Julia Hirschberg {agus,
Discourse Structure in Generation
Agustín Gravano1 · Stefan Benus2 · Julia Hirschberg1
Parts of Speech II.
Acoustic-Prosodic and Lexical Entrainment in Deceptive Dialogue
Automatic Prosodic Event Detection
Presentation transcript:

“Effect of Genre, Speaker, and Word Class on the Realization of Given and New Information” Julia Agustín Gravano & Julia Hirschberg {agus, Interspeech Pittsburgh, PA Spoken Language Processing Group Columbia University

Agustín Gravano Interspeech Motivation Speakers of American English tend to: accent references to “new” information, and deaccent references to “old” (or “given”) information. (Chafe 1974, Prince 1981 & 1992, inter alia) Variation of prominence in “given” entities is strongly affected by the persistence of: grammatical function (subject, object, etc.) and position in the sentence. (Terken & Hirschberg, 1994)

Agustín Gravano Interspeech Motivation Possible applications: Improve naturalness of TTS systems. Aid ASR. Questions: What are other sources of variation? What is the effect of: speaker? genre? word class?

Agustín Gravano Interspeech Main Results Speakers vary the manner in which they realize differences in information status. Speakers tend to produce “given” verbs with higher intensity than “new” verbs, both in read and spontaneous speech.

Agustín Gravano Interspeech Overview Materials and Methods Corpus Information status Word classes Features Results Nouns Verbs Discussion Conclusions

Agustín Gravano Interspeech Boston Directions Corpus Hirschberg & Nakatani 1996 Spontaneous and read monologues. 9 increasingly complex direction-giving tasks: Describe how to get to MIT from Harvard. Method: 1. Spontaneous speech recorded and transcribed. 2. Speakers returned and read. 4 speakers (3 male, 1 female).

Agustín Gravano Interspeech Boston Directions Corpus Mean length of tasks: Spontaneous:111s Read: 84s Excerpt from the spontaneous part of the corpus: first # enter the Harvard Square T stop # and buy a token # then proceed to get on the # Inbound um Red Line # uh subway [...] Corpus size: Spontaneous:~66m Read:~50m Prosody labeled using the ToBI convention.

Agustín Gravano Interspeech Information Status Prince 1981: Entities are new when first introduced in the discourse. Evoked entities are given. They are already in the discourse. Simple definition: A word w is given if in the current task there is at least one previous occurrence of a word with the same stem. Otherwise, we say that w is new.

Agustín Gravano Interspeech Word Classes Automatically labeled the part-of-speech of all the words in the corpus using the Brill Tagger. Categorized words into: Nouns Verbs Adjectives Adverbs Others Significant results only for Nouns and Verbs.

Agustín Gravano Interspeech Features Word acoustic features, extracted using Praat: Max, mean, min pitch Max, mean, min intensity Pitch and intensity features were also normalized with respect to the mean value of: ± 1 second around the target word, ± 5 words around the target word, the target word’s Intermediate Phrase. Pause before and after the word.

Agustín Gravano Interspeech Results: Nouns READSPON S1S2S3S4S1S2S3S4 Max Pitchg Mean Pitchnng Min Pitchng Max Pitch / Context Mean Pitchnnnn Mean Pitch/ Context Mean Pitchnnnn Min Pitch/ Context Mean Pitchnn Max Intensitynngg Mean Intensitynngg Min Intensityggg Max Int/ Context Mean Intensitynnnn Mean Int/ Context Mean Intensitynnngg Min Int / Context Mean Intensityg Pause Beforennnnn Pause Afterggg n = mean value for the new words is significantly larger than for the given words g = mean value for the given words is significantly larger than for the new words

Agustín Gravano Interspeech Results: Verbs READSPON S1S2S3S4S1S2S3S4 Max Pitchn Mean Pitchggn Min Pitchgn Max Pitch / Context Mean Pitchgn Mean Pitch/ Context Mean Pitchg Min Pitch/ Context Mean Pitchgn Max Intensityggggggg Mean Intensityggggggg Min Intensityggg Max Int/ Context Mean Intensityggggg Mean Int/ Context Mean Intensitygggggggg Min Int / Context Mean Intensitygg PauseBeforegg PauseAfterg n = mean value for the new words is significantly larger than for the given words g = mean value for the given words is significantly larger than for the new words

Agustín Gravano Interspeech Discussion: Variation of intensity in verbs Examples: you get out of the T stop # you cross Massachusetts Avenue [...] you wanna cross Mass Ave opposite that # there's usually a bunch of cabs and and people standing around there # so # then once you've crossed it you're you're in Harvard Yard proper then you're right at the entrance to what is called the Infinite Corridor # and it's called the Infinite Corridor because it's this really long # place you can walk entirely indoors Direct objects of ‘cross’ and ‘call’ are either deaccented or pronominalized in the second and third mentions. With no other salient accented items in their phrases, the given mentions of these verbs are more prominent.

Agustín Gravano Interspeech Discussion: Variation of intensity in verbs Example: so you're going to have to transfer # you transfer by going to Government Center which is inbound The increased intensity of the second mention of ‘transfer’ might be due to the change in its verb form. Similar to Terken & Hirschberg, 1994: Given nouns tend to be accented if they represent a different grammatical function from the first mention.

Agustín Gravano Interspeech Conclusions and Future Work Evidence of: Speaker variation in the way they realize differences in information status. Given verbs tend to be produced with a greater intensity than new verbs. Nouns and verbs behave very differently. Only preliminary results: more work needed. Future Work: Repeat and deepen these analyses on larger corpora of read and spontaneous speech, and in conversation.

“Effect of Genre, Speaker, and Word Class on the Realization of Given and New Information” Julia Agustín Gravano & Julia Hirschberg {agus, Interspeech Pittsburgh, PA Spoken Language Processing Group Columbia University