Spoken Language Analysis Dept. of General & Comparative Linguistics Christian-Albrechts-Universität zu Kiel 20.08.2011 Oliver Niebuhr 1 At the Segment-Prosody.

Slides:



Advertisements
Similar presentations
Analysis of Spoken Language Department of General & Comparative Linguistics Christian-Albrechts-Universität zu Kiel Oliver Niebuhr 1 Vowel.
Advertisements

The Role of F0 in the Perceived Accentedness of L2 Speech Mary Grantham O’Brien Stephen Winters GLAC-15, Banff, Alberta May 1, 2009.
Sounds that “move” Diphthongs, glides and liquids.
Basic Spectrogram & Clinical Application: Consonants
Acoustic Characteristics of Consonants
Philip Harrison J P French Associates & Department of Language & Linguistic Science, York University IAFPA 2006 Annual Conference Göteborg, Sweden Variability.
JPN494: Japanese Language and Linguistics JPN543: Advanced Japanese Language and Linguistics Phonology & Phonetics (2)
Acoustic Characteristics of Vowels
The sound patterns of language
Nasal Stops.
Using prosody to avoid ambiguity: Effects of speaker awareness and referential context Snedeker and Trueswell (2003) Psych 526 Eun-Kyung Lee.
Speech Science XII Speech Perception (acoustic cues) Version
1 The Effect of Pitch Span on the Alignment of Intonational Peaks and Plateaux Rachael-Anne Knight University of Cambridge.
INTONATION Chapters 15 & 16.
Prosodics, Part 1 LIN Prosodics, or Suprasegmentals Remember, from our first discussions in class, that speech is really a continuous flow of initiation,
Nuclear Accent Shape and the Perception of Prominence Rachael-Anne Knight Prosody and Pragmatics 15 th November 2003.
Digital Systems: Hardware Organization and Design
Niebuhr, D‘Imperio, Gili Fivela, Cangemi 1 Are there “Shapers” and “Aligners” ? Individual differences in signalling pitch accent category.
Prosodic Signalling of (Un)Expected Information in South Swedish Gilbert Ambrazaitis Linguistics and Phonetics Centre for Languages and Literature.
CENTER FOR SPOKEN LANGUAGE UNDERSTANDING 1 PREDICTION AND SYNTHESIS OF PROSODIC EFFECTS ON SPECTRAL BALANCE OF VOWELS Jan P.H. van Santen and Xiaochuan.
Introduction to Linguistics 2 The Sound System
Basic Spectrogram Lab 8. Spectrograms §Spectrograph: Produces visible patterns of acoustic energy called spectrograms §Spectrographic Analysis: l Acoustic.
ACOUSTICAL THEORY OF SPEECH PRODUCTION
Introduction to Acoustics Words contain sequences of sounds Each sound (phone) is produced by sending signals from the brain to the vocal articulators.
Unit 4 Articulation I.The Stops II.The Fricatives III.The Affricates IV.The Nasals.
Development of coarticulatory patterns in spontaneous speech Melinda Fricke Keith Johnson University of California, Berkeley.
Phonetic details in prosodic phenomena Oliver Niebuhr Presentation at the Laboratoire de Phonétique et Phonologie, Paris 3 January, 30th, 2009
Analysis of Spoken Language at the Dept. of General Linguistics Christian-Albrechts-Universität zu Kiel Oliver Niebuhr 1 On the Domain of Auditory.
What is Phonetics? Short answer: The study of speech sounds in all their aspects. Phonetics is about describing speech. (Note: phonetics ¹ phonics) Phonetic.
Chapter three Phonology
The Description of Speech
Speech Sounds of American English and Some Iranian Languages
Segment Duration and Vowel Quality in German Lexical Stress Perception Klaus J. Kohler University of Kiel, Germany Paper presented at Speech Prosody 2012.
Hoarse meeting in Liverpool April 22, 2005 Subglottal pressure and NAQ variation in Classically Trained Baritone Singers Eva Björkner*†, Johan Sundberg†,
The partner effect in non- native speech Speech Accommodation Group Jiwon Hwang May 9, 2007.
Phonetics and Phonology
Time-Domain Methods for Speech Processing 虞台文. Contents Introduction Time-Dependent Processing of Speech Short-Time Energy and Average Magnitude Short-Time.
Perceived prominence and nuclear accent shape Rachael-Anne Knight LAGB 5 th September 2003.
Acoustic Phonetics 3/9/00. Acoustic Theory of Speech Production Modeling the vocal tract –Modeling= the construction of some replica of the actual physical.
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
Speech Science Fall 2009 Nov 2, Outline Suprasegmental features of speech Stress Intonation Duration and Juncture Role of feedback in speech production.
English Phonetics and Phonology
Speech Science Fall 2009 Oct 28, Outline Acoustical characteristics of Nasal Speech Sounds Stop Consonants Fricatives Affricates.
Connected speech processes Coarticulation Suprasegmentals.
English Linguistics: An Introduction
Say “blink” For each segment (phoneme) write a script using terms of the basic articulators that will say “blink.” Consider breathing, voicing, and controlling.
1 Linguistics week Phonetics 3. 2 Check table 6.2, p243.
♥♥♥♥ 1. Intro. 2. VTS Var.. 3. Method 4. Results 5. Concl. ♠♠ ◄◄ ►► 1/181. Intro.2. VTS Var..3. Method4. Results5. Concl ♠♠◄◄►► IIT Bombay NCC 2011 : 17.
Evaluating prosody prediction in synthesis with respect to Modern Greek prenuclear accents Elisabeth Chorianopoulou MSc in Speech and Language Processing.
SEPARATION OF CO-OCCURRING SYLLABLES: SEQUENTIAL AND SIMULTANEOUS GROUPING or CAN SCHEMATA OVERRULE PRIMITIVE GROUPING CUES IN SPEECH PERCEPTION? William.
Speech Science VI Resonances WS Resonances Reading: Borden, Harris & Raphael, p Kentp Pompino-Marschallp Reetzp
The Effect of Pitch Span on Intonational Plateaux Rachael-Anne Knight University of Cambridge Speech Prosody 2002.
Stops Stops include / p, b, t, d, k, g/ (and glottal stop)
Understanding English Variation Connected Speech Processes What are connected speech processes? Connected speech processes are changes in the pronunciation.
Chapter II phonology II. Classification of English speech sounds Vowels and Consonants The basic difference between these two classes is that in the production.
Introduction to Language Phonetics 1. Explore the relationship between sound and spelling Become familiar with International Phonetic Alphabet (IPA )
Nuclear Accent Shape and the Perception of Syllable Pitch Rachael-Anne Knight LAGB 16 April 2003.
Phonetics: consonants
IIT Bombay 17 th National Conference on Communications, Jan. 2011, Bangalore, India Sp Pr. 1, P3 1/21 Detection of Burst Onset Landmarks in Speech.
Acoustic Phonetics 3/14/00.
EXPRESS YOURSELF. NEUTRAL ACCENT Neutral accent is a way of speaking a language without regionalism. Accent means variation in pronunciation and it should.
Lecture Overview Prosodic features (suprasegmentals)
August 15, 2008, presented by Rio Akasaka
SUPRASEGMENTAL PHONEME
English Phonetics and Phonology
What is Phonetics? Short answer: The study of speech sounds in all their aspects. Phonetics is about describing speech. (Note: phonetics ¹ phonics) Phonetic.
Patricia Keating, Marco Baroni, Sven Mattys, Rebecca Scarborough,
S. M. Joshi College, Hadapsar, Pune-28.
Speech Perception (acoustic cues)
A Japanese trilogy: Segment duration, articulatory kinematics, and interarticulator programming Anders Löfqvist Haskins Laboratories New Haven, CT.
Presentation transcript:

Spoken Language Analysis Dept. of General & Comparative Linguistics Christian-Albrechts-Universität zu Kiel Oliver Niebuhr 1 At the Segment-Prosody Divide The Interplay of Intonation, Sibilant Pitch and Sibilant Assimilation Oliver Niebuhr, Cassandra Lill & Jessica Neuschulz 17th International Congress of Phonetic Science, Hong Kong, China Oral Session on Sibilant Sounds, August 20th, , Room S223

Spoken Language Analysis Dept. of General & Comparative Linguistics Christian-Albrechts-Universität zu Kiel Oliver Niebuhr 2 Presented study is part of a line of research on “Intonation segments” and “segmental intonations” The pitch curves of utterances are not only created by F0. By changing their sound qualities, sound segments can also create different spectral pitches (sibilant pitch, intrinsic pitch of vowels, …)  Are the pitch impressions caused by sound segments adjusted to the intonation context? What kinds of sounds are concerned? Which intonation contexts trigger spectral pitch adjustments? Why does the adjustment occur? (E.g., in order to fill voiceless gaps so that the utterance tune is perceived “subjectively continuous” ? cf. Jones 1909:275)  Widely neglected questions, even though it was already noted by Daniel Jones (1950) that different “voice pitch contexts” create allophonic variation Introduction

Spoken Language Analysis Dept. of General & Comparative Linguistics Christian-Albrechts-Universität zu Kiel Oliver Niebuhr 3 So far, studies on German focussed on utterance-final sound segments in L% and H% intonation contexts (Niebuhr 2008, Niebuhr 2009) Significant findings: Different diphthong dynamics in closing diphthongs of German: Shorter onset, longer transition in H% than in L% contexts. /  / /  / more open, fronted and diphthongized in H% than in L% contexts The same is true for vocalized (=[  ]) endings Fricatives have more high-frequency energy after H% than after L% “Fisch” /  /, fish “Buch” /x/, book  Queston here: Does this also happen utterance-medially? Tested with sibilant sequences Introduction

Spoken Language Analysis Dept. of General & Comparative Linguistics Christian-Albrechts-Universität zu Kiel Oliver Niebuhr 4 Acoustic analyses based on the ‘KIESEL‘ corpus Kieler Sammlung Expressiver Lesesprache, Kiel Collection of Expressive Read Speech 2x2 sentence mode and emphasis conditions Method Neutral Statement SN Emphatic Statement SE Neutral Question QN Emphatic Question QE The same 12 sentences with simple SVO structure  O= target word pairs of Function word + Noun (in singular; with nuclear pitch accent) “als Sänger” [     ], as a singer “aus Schweden”         , from Sweden “bis Sachsen” [        ], to Saxony “als Spender”     , as a donor /s  / assimilation condition /sz/ non-assimilation condition Created 6x2 different sibilant sequence conditions across word boundaries. E.g.,

Spoken Language Analysis Dept. of General & Comparative Linguistics Christian-Albrechts-Universität zu Kiel Oliver Niebuhr 5 Sentences of 8 female speakers were analyzed  12 x 8 x 4 = 384 sentences; 48 /s  / and 48 /sz/ tokens in each of the conditions SN, SE, QN, QE Crucial point: the /s  / and /sz/ sequences occurred in very different pitch contexts high pitch (H*) in statements  low pitch (L*) in questions Under emphasis: pitch level increases further SN  SE pitch level decreases further QN  QE Method question

Spoken Language Analysis Dept. of General & Comparative Linguistics Christian-Albrechts-Universität zu Kiel Oliver Niebuhr 6 Sentences of 8 female speakers were analyzed  12 x 8 x 4 = 384 sentences; 48 /s  / and 48 /sz/ tokens in each of the conditions SN, SE, QN, QE Crucial questions: (1) Is sibilant pitch adjusted to these different intonation/pitch contexts? If so, the sibilant pitches of /s  / and /sz/ will decrease in the following order SE > SN > QN >QE (2) In the cases of /s  /: Is regressive /s/-to-[  w ] (i.e. light-to-dark noise) assimilation involved in this sibilant-pitch adjustment? Method question

Spoken Language Analysis Dept. of General & Comparative Linguistics Christian-Albrechts-Universität zu Kiel Oliver Niebuhr 7 Measurements Spectral centre-of-gravity (CoG) values calculated in 10 ms intervals across each sibilant sequence. Based on these values: 1 mean CoG and 1 CoG range (max-min CoG) for each sibilant sequence Mean CoG = acoustic measure, but closely related to the perceived sibilant pitch impression CoG range was to estimate the variation of sibilant pitch in each sequence Durations of the sequence sequences F0 values of the H* peaks and L* valleys Statistical tests: Univariate ANOVAs with 3 fixed factors… (1) question vs statement, (2) /s  / vs /sz/, (3) neutral vs emphatic …For mean CoG, duration, and F0 measurements Method

Spoken Language Analysis Dept. of General & Comparative Linguistics Christian-Albrechts-Universität zu Kiel Oliver Niebuhr 8 Intonation, Pitch accent H* F0 peaks higher than L* F0 valleys (p < 0.001) Emphasis increased H* peaks and lowered L* valleys (p < 0.001) No significant effect of type of sibilant sequence on F0, but… Mean CoGs clearly higher in H* than in L* contexts (p < 0.001) higher for entirely alveolar /sz/ than for /s  / sequences (p < 0.001) significant interactions show… emphasis increases mean CoGs in statements, but decreases mean CoGs in questions: SE>SN>QN>QE pitch context effects stronger for /s  / than for /sz/ sequences Results H* L*

Spoken Language Analysis Dept. of General & Comparative Linguistics Christian-Albrechts-Universität zu Kiel Oliver Niebuhr 9 Mean CoGs clearly higher in H* than in L* contexts (p < 0.001) higher for entirely alveolar /sz/ than for /s  / sequences (p < 0.001) significant interactions show… emphasis increases mean CoGs in statements, but decreases mean CoGs in questions: SE>SN>QN>QE pitch context effects stronger for /s  / than for /sz/ sequences Results H* L* H* L* /s  / assimilation condition SE QEvsSN QNvs /sz/ non-assimilation condition SN QNvs

Spoken Language Analysis Dept. of General & Comparative Linguistics Christian-Albrechts-Universität zu Kiel Oliver Niebuhr 10 Sibilant-sequence durations Sibilant sequences became longer under emphasis (p < 0.001) Sibilant sequences were shorter in questions than in statements (p < 0.001) Most importantly, the /s  / sequences were not shorter than the /sz/ sequences. CoG ranges became successively smaller across SE, SN, QN, QE (  ²=3.8; p < 0.05)  /s  / sequences with low mean CoGs became spectrally as stable/homogeneous as /sz/ … …and remain equally long  must be due to greater /s/-to-[  ] assimilation of /s  / (/s/ elision  < duration) Results H* L* H* L*

Spoken Language Analysis Dept. of General & Comparative Linguistics Christian-Albrechts-Universität zu Kiel Oliver Niebuhr 11 The spectral characteristics of the /s  / and /sz/ sequences varied systematically and in parallel with the F0 contexts provided by the H* and L* pitch accents. If the mean CoGs are taken as sibilant-pitch estimations, we may conclude that the pitch impressions caused by the sibilant sequences are adjusted to the F0 (i.e. intonation) context. The adjustment of sibilant to intonation pitches was stronger for /s  / than for /sz/.  German /  / shows lip rounding  can be derounded  together with shape and place of articulation = inherently greater potential to vary sibilant pitch Compared with /sz/, the durational and spectral measurements indicate that regressive /s/-to-[  ] assimilation was used as an additional instrument to vary the sibilant pitch created by /s  / sequences. Conclusions

Spoken Language Analysis Dept. of General & Comparative Linguistics Christian-Albrechts-Universität zu Kiel Oliver Niebuhr 12 Spectral pitch of speech sounds are not only adjusted to the intonation in the context of different utterance-final boundary tones (L% vs H%), but also utterance-medially in the context of different H* and L* pitch accents. Altogether, this means for research on intonation and sound segments Conclusions

Spoken Language Analysis Dept. of General & Comparative Linguistics Christian-Albrechts-Universität zu Kiel Oliver Niebuhr 13 Thank you for your attention