Acoustic and Physiological Phonetics

Slides:



Advertisements
Similar presentations
Advances in Speech Synthesis
Advertisements

Tom Lentz (slides Ivana Brasileiro)
Normal Aspects of Articulation. Definitions Phonetics Phonology Articulatory phonetics Acoustic phonetics Speech perception Phonemic transcription Phonetic.
Sounds that “move” Diphthongs, glides and liquids.
SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.
Basic Spectrogram & Clinical Application: Consonants
Acoustic Characteristics of Consonants
Vowel Formants in a Spectogram Nural Akbayir, Kim Brodziak, Sabuha Erdogan.
Glides (/w/, /j/) & Liquids (/l/, /r/) Degree of Constriction Greater than vowels – P oral slightly greater than P atmos Less than fricatives – P oral.
Physical modeling of speech XV Pacific Voice Conference PVSF-PIXAR Brad Story Dept. of Speech, Language and Hearing Sciences University of Arizona.
From Resonance to Vowels March 8, 2013 Friday Frivolity Some project reports to hand back… Mystery spectrogram reading exercise: solved! We need to plan.
“Connecting the dots” How do articulatory processes “map” onto acoustic processes?
Basic Spectrogram & Clinical Application Lab 9. Spectrographic Features of Vowels n 1st formant carries much information about manner of articulation.
JPN494: Japanese Language and Linguistics JPN543: Advanced Japanese Language and Linguistics Phonology & Phonetics (2)
A two dimensional kinematic mapping between speech acoustics and vocal tract configurations : WISP A.Hatzis, P.D.Green1 History of Vowel.
Acoustic Characteristics of Vowels
Vowels and Tubes (again) March 22, 2011 Today’s Plan Perception experiment! Discuss vowel theory #2: tubes! Then: some thoughts on music. First: let’s.
Coarticulation Analysis of Dysarthric Speech Xiaochuan Niu, advised by Jan van Santen.
Nasal Stops.
Hillenbrand: Vowels1 The Acoustics and Perception of American English Vowels.
SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.
Vowel Acoustics, part 2 March 12, 2014 The Master Plan Today: How resonance relates to vowels (= formants) On Friday: In-class transcription exercise.
“Speech and the Hearing-Impaired Child: Theory and Practice” Ch. 13 Vowels and Diphthongs –Vowels are formed when sound produced at the glottal source.
The Human Voice. I. Speech production 1. The vocal organs
ACOUSTICAL THEORY OF SPEECH PRODUCTION
The Human Voice Chapters 15 and 17. Main Vocal Organs Lungs Reservoir and energy source Larynx Vocal folds Cavities: pharynx, nasal, oral Air exits through.
PHYSICAL PROPERTIES OF SPEECH SOUNDS
PH 105 Dr. Cecilia Vogel Lecture 14. OUTLINE  consonants  vowels  vocal folds as sound source  formants  speech spectrograms  singing.
Speech Group INRIA Lorraine
Vowel Acoustics, part 2 November 14, 2012 The Master Plan Acoustics Homeworks are due! Today: Source/Filter Theory On Friday: Transcription of Quantity/More.
Unit 4 Articulation I.The Stops II.The Fricatives III.The Affricates IV.The Nasals.
SPPA 6010 Advanced Speech Science 1 The Source-Filter Theory: The Sound Source.
Vowels Vowels: Articulatory Description (Ferrand, 2001) Tongue Position.
Anatomic Aspects Larynx: Sytem of muscles, cartileges and ligaments.
Nasal Stops. Nasals Distinct vocal tract configuration Pharyngeal cavity Oral cavity (closed) Nasal cavity (open)
SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.
Vowel Transcription.
Diphthongs Traditional Classification. Diphthongs  Diphthongs have a prominent shift in resonance, usually in the form of an off-glide EX: boy, how,
Source/Filter Theory and Vowels February 4, 2010.
CSD 5400 REHABILITATION PROCEDURES FOR THE HARD OF HEARING Auditory Perception of Speech and the Consequences of Hearing Loss.
Speech Production1 Articulation and Resonance Vocal tract as resonating body and sound source. Acoustic theory of vowel production.
Resonance, Revisited March 4, 2013 Leading Off… Project report #3 is due! Course Project #4 guidelines to hand out. Today: Resonance Before we get into.
Vowels, part 4 March 19, 2014 Just So You Know Today: Source-Filter Theory For Friday: vowel transcription! Turkish, British English and New Zealand.
Acoustic Phonetics 3/9/00. Acoustic Theory of Speech Production Modeling the vocal tract –Modeling= the construction of some replica of the actual physical.
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
Speech Science Fall 2009 Oct 26, Consonants Resonant Consonants They are produced in a similar way as vowels i.e., filtering the complex wave produced.
Vowel Acoustics November 2, 2012 Some Announcements Mid-terms will be back on Monday… Today: more resonance + the acoustics of vowels Also on Monday:
Speech Science Fall 2009 Oct 28, Outline Acoustical characteristics of Nasal Speech Sounds Stop Consonants Fricatives Affricates.
♥♥♥♥ 1. Intro. 2. VTS Var.. 3. Method 4. Results 5. Concl. ♠♠ ◄◄ ►► 1/181. Intro.2. VTS Var..3. Method4. Results5. Concl ♠♠◄◄►► IIT Bombay NCC 2011 : 17.
Formants, Resonance, and Deriving Schwa March 10, 2009.
Speech Science VI Resonances WS Resonances Reading: Borden, Harris & Raphael, p Kentp Pompino-Marschallp Reetzp
Resonance October 23, 2014 Leading Off… Don’t forget: Korean stops homework is due on Tuesday! Also new: mystery spectrograms! Today: Resonance Before.
From Resonance to Vowels March 13, 2012 Fun Stuff (= tracheotomy) Peter Ladefoged: “To record the pressure of the air associated with stressed as opposed.
Introduction to Language Phonetics 1. Explore the relationship between sound and spelling Become familiar with International Phonetic Alphabet (IPA )
Sonorant Acoustics + Place Transitions
Stop + Approximant Acoustics
SPPA 4030 Speech Science Stephen M. Tasko Ph.D. CCC-SLP Western Michigan University.
The Speech Chain (Denes & Pinson, 1993)
Vowels 2 LIN Some reminders about vowels… 1. Vowels are described in three basic parameters:  Tongue height: [i] vs. [a] (high vs. low /close.
Acoustic Phonetics 3/14/00.
Phonetics: A lecture Raung-fu Chung Southern Taiwan University
Introduction to English Pronunciation
The Human Voice. 1. The vocal organs
Vowels and Consonant Serikova Aigerim.
The Human Voice. 1. The vocal organs
English Phonetics and Phonology
The Acoustics and Perception of American English Vowels
Phonetics.
The Production of Speech
Evolution of human vocal production
Presentation transcript:

Acoustic and Physiological Phonetics Vowel Production and Perception Stephen M. Tasko

Learning Objectives Review source-filter theory and how it relates to vowel production Distinguish between source spectrum, transfer function and output spectrum. Calculate formant/resonant frequencies of a uniform tube based on its physical dimensions. Describe how the area function of an acoustic resonator is determined. Distinguish between and describe relation between area function and transfer function. Stephen M. Tasko

Source Filter Theory Speech (What We Hear) Source (Phonation) Filter (Resonator) Frequency Response Curve (Transfer Function) Input Spectrum Output Spectrum Stephen M. Tasko 3

Same Source, Different Filter Stephen M. Tasko 4

Frequency response curve/Transfer Function FRC peaks – resonant or formant frequency Tube resonators have an infinite number of formants F1, F2, F3 … denotes formants from low to high frequency F1 F2 F3 F4 Stephen M. Tasko

Vocal tract as a tube Tubes have physical characteristics (shapes) Tubes act as acoustic resonators Acoustic resonators have frequency response curves (FRC), also known as ‘transfer functions’ Tube shape dictates the frequency response curve. Stephen M. Tasko

The vocal tract shape during vowel production Can be (roughly) uniform in shape The vocal tract is fairly uniform in its cross-sectional diameter for neutral or central vowel (schwa) Can also be take on non-uniform shapes Are observed for non-neutral vowels Have a more complex geometry Does not allow simple calculations of formants Formant values are derived from the vocal tract area function Stephen M. Tasko

Vocal tract as a tube Straight tube, closed at one end, with a uniform cross-sectional diameter Straight tube, closed at one end, of differing cross-sectional diameter Vocal tract: bent tube, closed at one end, with differing Cross-sectional diameter. Stephen M. Tasko

What is an area function? … Area (cm2) Length along tube (cm) Stephen M. Tasko

Area function of a uniform tube Area function dictates the frequency response curve for that tube Area (cm2) Length along tube (cm) Stephen M. Tasko

Vocal Tract Area Function Stephen M. Tasko

Vocal Tract Area Function Stephen M. Tasko

Relationship between vocal tract shape, the area function and the frequency response curve FRC Stephen M. Tasko

Key points Vocal Tract has a variable shape, therefore It is a variable resonator Can have a variety of area functions Can generate a variety of frequency response curves A given area function can lead to one (and only one) frequency response curve A given frequency response curve and arise due to a variety of different area functions Stephen M. Tasko

Learning Objectives Describe the basic shape of the area function for the four corner vowels. Describe F1-F2 relations for English vowels with specific emphasis of the corner vowels Draw and recognize (1) wide band spectrograms, (2) spectrum envelopes, and (3) frequency response curves for the corner vowels Draw and interpret various plots that relate formants values for English vowels. Outline our basic tongue and lip rules for predicting formant shifts from the neutral position. Stephen M. Tasko

Vowels: Articulatory Description Stephen M. Tasko

Vowels: Articulatory Description Degree of lip rounding Rounded Unrounded Degree of tension Tense Lax Stephen M. Tasko

“Neutral” Configuration Vocal Tract Area Function Articulatory Configuration/ Vocal Tract Shape Frequency Response Curve Stephen M. Tasko

Low back vowel Stephen M. Tasko Vocal Tract Area Function Articulatory Configuration/ Vocal Tract Shape Frequency Response Curve Stephen M. Tasko

High back rounded vowel Vocal Tract Area Function Articulatory Configuration/ Vocal Tract Shape Frequency Response Curve Stephen M. Tasko

Low front vowel Stephen M. Tasko Vocal Tract Area Function Articulatory Configuration/ Vocal Tract Shape Frequency Response Curve Stephen M. Tasko

Relationship between vocal tract shape, the area function and the frequency response curve Vocal Tract Area Function Articulatory Configuration/ Vocal Tract Shape Frequency Response Curve Stephen M. Tasko

What distinguishes vowels in production and perception? Resonant (formant) Frequency F1, F2 frequency are particularly important F3 frequency plays a smaller role Landmark study: Peterson and Barney (1952) Stephen M. Tasko

Vowels: Spectrographic Patterns Stephen M. Tasko

Vowels: Frequency Response Curve Patterns Mid Central vowel F1: 500 Hz F2: 1500 Hz /i/ Gain /u/ // // Stephen M. Tasko

Observations /i/ & /u/ have a low F1 // & // have high F1 Tongue height ~ F1 Tongue height  F1  Tongue height  F1  /u/ & // have low F2 /i/ & // have high F2 Tongue advancement ~ F2 Tongue front F2  Tongue back F2  Stephen M. Tasko

Learning Objectives Outline the key assumptions and parameters of the Stevens & House (SH) articulatory model of vowel production. Describe the acoustic consequences of changing SH model parameters. Provide acoustic explanations for how (1) the SH model parameters influence area function and (2) how these area function changes influence acoustic (i.e. formant values) Compare the shape of the vowel quadrilateral and the F1-F2 plot Stephen M. Tasko

How do articulatory processes “map” onto acoustic processes? “Connecting the dots” How do articulatory processes “map” onto acoustic processes? Stephen M. Tasko

3-parameter model (Stevens & House, 1955) Model assumes No coupling with Nasal cavity trachea & pulmonary system Stephen M. Tasko

3-parameter model (Stevens & House, 1955) Model parameters Distance of major constriction from glottis (d0) Radius of major constriction (r0) Area (A) and length (l) of lip constriction A/l conductivity index Stephen M. Tasko

3-parameter model (Stevens & House, 1955) Stephen M. Tasko

Key Goal of Study Evaluate the effect of systematically changing each of these three “vocal tract” parameters on F1-F3 frequency Stephen M. Tasko

General Observations Stephen M. Tasko

General Observations Stephen M. Tasko

General Observations Stephen M. Tasko

Interpretation: Double Helmholtz Resonator Model Back Cavity Volume influences F1 Larger volume = lower F1 Smaller volume=higher F1 Front Cavity Volume influence F2 Larger volume= lower F2 Smaller volume=higher F2 Radius of Conduit (r0) influences F1 Larger radius = higher F1 Smaller radius=smaller F1 Back Cavity Front Cavity Major Constriction (ro) Stephen M. Tasko

Interpretations ∆ d0 = ∆ Vfront & Vback ↑ d0 = ↓ Vfront = ↑ F2 ↑ d0 = ↑ Vback = ↓ F1 Stephen M. Tasko

Interpretations ↓ r0 = ↓ F1 ↑ r0 = ↑ F1 ↑ lip rounding = ↓ A/l When d0 ↑ (anterior) ↓ r0 = ↓ Vfront = ↑ F2 ↑ lip rounding = ↓ A/l = ↓ F1 & F2 Stephen M. Tasko

Another way to look at the data (Minifie, 1974) Stephen M. Tasko

Articulatory Acoustic Comparisons Traditional F1-F2 Plot F1-F2 Plot adjusted to reflect ‘articulatory’ space r0 d0 - + Stephen M. Tasko

Learning Objectives Provide an explanation for why we treat women’s, men’s and children’s vowels as equivalent even though absolute values of formants differ a lot. Stephen M. Tasko

Stephen M. Tasko

“normalizing” formant values Stephen M. Tasko

Clinical Example Stephen M. Tasko

Acoustic variables related to the perception of vowel quality F1 and F2 Other formants (i.e. F3) Fundamental frequency (F0) Duration Spectral dynamics i.e. formant change over time Stephen M. Tasko

How helpful is F1 & F2? Data Source Human Listeners Pattern Classifier Peterson & Barney (1952) 94.4 % 74.9 % Hillenbrand et al. (1995) 95.2 % 68.2 % From Hillenbrand & Gayvert (1993) Stephen M. Tasko

How does adding more variables improve pattern classifier success? F1, F2 + F3 80-85 % F1, F2 + F0 F1, F2 + F3 + F0 89-90 % Stephen M. Tasko

Nearby vowels have different durations How about Duration? Nearby vowels have different durations Stephen M. Tasko

Stephen M. Tasko

What about Duration? Stephen M. Tasko

What about Duration? Some examples Stephen M. Tasko

What about formant variation? Stephen M. Tasko

What about formant variation? Stephen M. Tasko

What about formant variation? Naturally spoken /hAd/ Synthesized, preserving original formant contours Synthesized with flattened formants Stephen M. Tasko

What about formant variation? Conclusion: Spectral change patterns do matter. Stephen M. Tasko

What do we conclude? Stephen M. Tasko

Sinewave Speech Demonstration Sinewave speech examples (from HINT sentence intelligibility test): Stephen M. Tasko

Selected issues that are not resolved What do listener’s use? Specific formants vs. spectrum envelope What is the “planning space” used by speakers? Articulatory Acoustic Auditory Stephen M. Tasko

The important role of movement Articulatory movement = spectral change Spectral change occurs as speakers transition within and between sound sequences Spectral change plays a significant role in Perception of certain speech sounds Overall speech intelligibility Stephen M. Tasko

Diphthongs Slow gliding (~ 350 msec) between two vowel qualities Components Onglide- starting point of articulation Offglide- end point of articulation Articulatory Transition = formant transition Diphthongization: articulatory movement within the vowel Varies by geographic region Stephen M. Tasko

American English Diphthongs // - “bye” // - “bough” // - “boy” // - “bay” // - “bow” Stephen M. Tasko