Control of prosodic features under perturbation in collaboration with Frank Guenther Dept. of Cognitive and Neural Systems, BU Carrie Niziolek [carrien]

Slides:



Advertisements
Similar presentations
Information structuring in English dialogue class 4
Advertisements

Frequency representation The ability to use the spectrum or the fine structure of sound to detect, discriminate, or identify sound.
CNBH, Physiology Department, Cambridge University 2. Experimental procedure The experiment is a 2AFC paradigm design in which.
Phonetics as a scientific study of speech
The Role of F0 in the Perceived Accentedness of L2 Speech Mary Grantham O’Brien Stephen Winters GLAC-15, Banff, Alberta May 1, 2009.
Hearing relative phases for two harmonic components D. Timothy Ives 1, H. Martin Reimann 2, Ralph van Dinther 1 and Roy D. Patterson 1 1. Introduction.
American English Speech Patterns
Periodicity and Pitch Importance of fine structure representation in hearing.
SPEECH PERCEPTION 2 DAY 17 – OCT 4, 2013 Brain & Language LING NSCI Harry Howard Tulane University.
The Neuroscience of Language. What is language? What is it for? Rapid efficient communication – (as such, other kinds of communication might be called.
Jessica E. Huber Ph.D. in Speech Science from University at Buffalo MA in Speech-Language Pathology, Certified Speech- Language Pathologist Assistant Professor,
IBM Labs in Haifa © 2007 IBM Corporation SSW-6, Bonn, August 23th, 2007 Maximum-Likelihood Dynamic Intonation Model for Concatenative Text to Speech System.
Nonsegmentals or Suprasegmentals Most of the material we’ve discussed to this point concerns the segmental characteristics of speech. Segmental: This.
Nuclear Accent Shape and the Perception of Prominence Rachael-Anne Knight Prosody and Pragmatics 15 th November 2003.
Speech and speaker normalization (in vowel normalization)
Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence Sankaranarayanan Ananthakrishnan, Shrikanth S. Narayanan IEEE 2007 Min-Hsuan.
Analyzing Students’ Pronunciation and Improving Tonal Teaching Ropngrong Liao Marilyn Chakwin Defense.
Prosodic Signalling of (Un)Expected Information in South Swedish Gilbert Ambrazaitis Linguistics and Phonetics Centre for Languages and Literature.
PHONETICS AND PHONOLOGY
FLST: Prosodic Models FLST: Prosodic Models for Speech Technology Bernd Möbius
Effectiveness of spatial cues, prosody, and talker characteristics in selective attention C.J. Darwin & R.W. Hukin.
Vocal Emotion Recognition with Cochlear Implants Xin Luo, Qian-Jie Fu, John J. Galvin III Presentation By Archie Archibong.
Phonetic Similarity Effects in Masked Priming Marja-Liisa Mailend 1, Edwin Maas 1, & Kenneth I. Forster 2 1 Department of Speech, Language, and Hearing.
A.Diederich– International University Bremen – Sensation and Perception – Fall Frequency Analysis in the Cochlea and Auditory Nerve cont'd The Perception.
Introduction to Intonation Jennifer J. Venditti Cognitive Science March 2001.
Prosodic Cues to Discourse Segment Boundaries in Human-Computer Dialogue SIGDial 2004 Gina-Anne Levow April 30, 2004.
Kai Alter Newcastle Auditory Group Segmentation in speech: On the processing of boundaries and accents.
1 Phonetics Study of the sounds of Speech Articulatory Acoustic Experimental.
What is Phonetics? Short answer: The study of speech sounds in all their aspects. Phonetics is about describing speech. (Note: phonetics ¹ phonics) Phonetic.
Turn-taking in Mandarin Dialogue: Interactions of Tone and Intonation Gina-Anne Levow University of Chicago October 14, 2005.
Auditory-acoustic relations and effects on language inventory Carrie Niziolek [carrien] may 2004.
Toshiba Update 14/09/2005 Zeynep Inanoglu Machine Intelligence Laboratory CU Engineering Department Supervisor: Prof. Steve Young A Statistical Approach.
Phonetics and Phonology
Auditory cortical monitoring prevents speech errors before they happen Caroline A. Niziolek UCSF Depts. of Radiology and Otolaryngology – Head and Neck.
Funded by NIH grant RO1 HD-4152 to J. Arnold NSF BCS and NSF BCS to Z. Griffin Why do speakers modulate acoustic prominence? Listener-oriented.
Whither Linguistic Interpretation of Acoustic Pronunciation Variation Annika Hämäläinen, Yan Han, Lou Boves & Louis ten Bosch.
A Study in Cross-Cultural Interpretations of Back-Channeling Behavior Yaffa Al Bayyari Nigel Ward The University of Texas at El Paso Department of Computer.
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
Suprasegmentals Segmental Segmental refers to phonemes and allophones and their attributes refers to phonemes and allophones and their attributes Supra-
Speech Perception1 Fricatives and Affricates We will be looking at acoustic cues in terms of … –Manner –Place –voicing.
Prosody-driven Sentence Processing: An Event-related Brain Potential Study Ann Pannekamp, Ulrike Toepel, Kai Alter, Anja Hahne and Angela D. Friederici.
Speech Science Fall 2009 Nov 2, Outline Suprasegmental features of speech Stress Intonation Duration and Juncture Role of feedback in speech production.
Is phonetic variation represented in memory for pitch accents ? Amelia E. Kimball Jennifer Cole Gary Dell Stefanie Shattuck-Hufnagel ETAP 3 May 28, 2015.
The role of prosody in dialect synthesis and authentication Kyuchul Yoon Division of English Kyungnam University Spring 2008 Joint Conference of KSPS.
Connected speech processes Coarticulation Suprasegmentals.
English Linguistics: An Introduction
CNS Speech Lab Dept. of Cognitive and Neural Systems Boston University Frank Guenther, Ph.D Director.
Alignment of tonal targets: 30 years on Bob Ladd University of Edinburgh.
Speech Perception 4/4/00.
SPEECH PERCEPTION DAY 16 – OCT 2, 2013 Brain & Language LING NSCI Harry Howard Tulane University.
Evaluating prosody prediction in synthesis with respect to Modern Greek prenuclear accents Elisabeth Chorianopoulou MSc in Speech and Language Processing.
The Effect of Pitch Span on Intonational Plateaux Rachael-Anne Knight University of Cambridge Speech Prosody 2002.
When you say the word [NOSTRIL], you pronounce the [NOS] slightly louder, at a slightly higher pitch, and for a slightly longer duration than when.
Recognizing Discourse Structure: Speech Discourse & Dialogue CMSC October 11, 2006.
1 Cross-language evidence for three factors in speech perception Sandra Anacleto uOttawa.
Phonetics, part III: Suprasegmentals October 19, 2012.
Levels of Linguistic Analysis
Nuclear Accent Shape and the Perception of Syllable Pitch Rachael-Anne Knight LAGB 16 April 2003.
Suprasegmental Properties of Speech Robert A. Prosek, Ph.D. CSD 301 Robert A. Prosek, Ph.D. CSD 301.
Suprasegmental features and Prosody Lect 6A&B LING1005/6105.
Risto Näätänen University of Tartu, Estonia
Linguistic knowledge for Speech recognition
Investigating Pitch Accent Recognition in Non-native Speech
SUPRASEGMENTAL PHONEME
Phonetics SPAU 3343 Chap. 10 – Grasping the melody of language
What is Phonetics? Short answer: The study of speech sounds in all their aspects. Phonetics is about describing speech. (Note: phonetics ¹ phonics) Phonetic.
Childhood Apraxia of Speech: Treatment Types
Patricia Keating, Marco Baroni, Sven Mattys, Rebecca Scarborough,
Levels of Linguistic Analysis
Speech Perception (acoustic cues)
Presentation transcript:

Control of prosodic features under perturbation in collaboration with Frank Guenther Dept. of Cognitive and Neural Systems, BU Carrie Niziolek [carrien] 14 sept 2005

Introduction Speech prosody: patterns of intonation and vocal stress In English, conveys non-lexical information, such as emphasis. The pattern of F0, duration, and intensity affects interpretation of phrases. How are prosodic cues controlled? What are the neural bases of speech segmentation?

Purpose of study: To understand feedback-based control of phrase-level prosody. Prosodic features (F0, intensity, duration) integrated? controlled independently?

Motivation Prosody conveys differences in both linguistic and affective state Not subservient to speech segments, but the “scaffolding” that holds different levels of phonetic description together

F0 as a stress indicator Observation: possible to characterize stressed syllables in terms of prosodic features Longer duration and greater intensity than unstressed syllables (Somewhat) predictable pitch contour “BOB caught a dog” t F0 threshold

F0 as a stress indicator BOB caught a dogBob CAUGHT a dog F0 ≈ 160 Hz

Methodology “BOB caught a dog” t F0 “Bob caught a dog” t F0 Flatten curve by shifting down all F0s above the threshold value, making the syllable sound less stressed

Experimental Question Do subjects respond by compensating for such a perturbation, increasing the F0 on the syllable they want to stress? Are other compensations also evident? Determine the degree to which prosodic aspects of speech are controlled in an integrated or independent manner.

Who caught a dog? BOB caught a dog. What did Bill do to a kid? Bill BIT a kid.

Method Present modulated feedback in real time Track and shift pitch Compare subject’s output F0 in baseline and perturbed conditions baselineramp full-pert post-pert shift amount trial #

Normal and pitch-shifted speech “Bill BIT a kid”

Results: pitch Averaging across all syllable positions, peak F0 is higher during perturbation than during the baseline condition (before and after perturbation). Slight compensation. Separating by stress position suggests that the effect may be larger for second syllable stress.

Results: mean-energy intensity Average mean-energy intensity is slightly higher during full- pert condition than during baseline. Significant? (This could be a function of F0, not a separate phenomenon.)

Future work Subjects needed! Continued analysis of F0 and intensity data Brain imaging Determine what structures and neural circuits are responsible for this prosodic control Model simulations After incorporating mechanisms for prosodic control into the model, compare DIVA with human psychophysical data

Summary What are the mechanisms responsible for the control of prosody? Auditory perturbation study: prosody manipulated and presented as feedback Some degree of compensation in pitch occurs during the perturbation. No evidence for any adaptation effects.

Thanks Frank Guenther, BU Kevin Reilly, BU Rupal Patel, Northeastern 1. Streeter et al. (1983). Acoustic and perceptual indicators of emotional stress. J. Ac. Soc. Am. 73, Pierrehumbert, J.B. (1980). The Phonology and Phonetics of English Intonation. MIT PhD dissertation, Cambridge, MA. References