A formant-trajectory model and its usage in comparing coarticulatory effects in Dysarthric and normal speech Xiaochuan Niu and Jan P. H. van Santen Center.

A formant-trajectory model and its usage in comparing coarticulatory effects in Dysarthric and normal speech Xiaochuan Niu and Jan P. H. van Santen Center for Spoken Language Understanding OGI School of Science and Engineering at Oregon Health & Science University, USA MAVEBA 2003 Florence, Italy December 10-12, 2003

What is Dysarthria? Group of speech disorders –Weakness / incoordination of speech muscles –result of damage to the brain or nerves Results in unintelligible speech MAVEBA 2003 Florence, Italy December 10-12, 2003

Long Term Project Goal Long term goal: Speech transformation –Device that works in real time –Not: Amplifier, spectral filter –But: Correct for dynamic articulatory problems Based on a dynamic model of coarticulation Today’s talk: –Test (very simple) model of vowel dynamics MAVEBA 2003 Florence, Italy December 10-12, 2003

Observation: Vowel Formants Median Formants in Vowel Centers [pics] MAVEBA 2003 Florence, Italy December 10-12, 2003

Framework Formant Trajectories –(linear or non-linear) interpolation –between vowel targets Three mechanisms for vowel triangle data: 1.More coarticulation (interpolation too smooth) 2.More random variability 3.Incorrect targets MAVEBA 2003 Florence, Italy December 10-12, 2003

Mechanism 1: Coarticulation Average formants of any given vowel … –… more strongly dependent on … –… the average of the virtual formants … –… of the surrounding consonants MAVEBA 2003 Florence, Italy December 10-12, 2003

Mechanism 2: Random Variability Average formants of any given vowel … –… result of broad distributions that are … –… skewed by the boundaries of vowel space MAVEBA 2003 Florence, Italy December 10-12, 2003

Mechanism 3: Incorrect Targets Average formants of any given vowel … –… result of a tendency to … –… to move articulators in the wrong direction MAVEBA 2003 Florence, Italy December 10-12, 2003

Linear Coarticulation Model MAVEBA 2003 Florence, Italy December 10-12, 2003 3x13x33x13x33x13x3 F (t|p v n) = A pt FpFp + B nt FnFn + (I - A pt - B nt )FvFv

Linear Coarticulation Model MAVEBA 2003 Florence, Italy December 10-12, 2003 3x13x33x13x33x13x3 F (t|p v n) = A pt FpFp + B nt FnFn + (I - A pt - B nt )FvFv Observed formant vector t: Time p: Preceding consonant v: Vowel n: Next consonant

Linear Coarticulation Model MAVEBA 2003 Florence, Italy December 10-12, 2003 3x13x33x13x33x13x3 F (t|p v n) = A pt FpFp + B nt FnFn + (I - A pt - B nt )FvFv Observed formant vector t: Time p: Preceding consonant v: Vowel n: Next consonant Weight Matrices

Linear Coarticulation Model MAVEBA 2003 Florence, Italy December 10-12, 2003 3x13x33x13x33x13x3 F (t|p v n) = A pt FpFp + B nt FnFn + (I - A pt - B nt )FvFv Observed formant vector t: Time p: Preceding consonant v: Vowel n: Next consonant Target Formants Weight Matrices

Linear Coarticulation Model MAVEBA 2003 Florence, Italy December 10-12, 2003 3x13x33x13x33x13x3 F (t|p v n) = A pt FpFp + B nt FnFn + (I - A pt - B nt )FvFv Based on earlier work by Broad, Oehman, Lindblom, Schouten, Pols, Stevens, …

How use for transformation? MAVEBA 2003 Florence, Italy December 10-12, 2003 3x13x33x13x33x13x3 F (t|p v n) = A pt FpFp + B nt FnFn + (I - A pt - B nt )FvFv

How use for transformation? MAVEBA 2003 Florence, Italy December 10-12, 2003 3x13x33x13x33x13x3 F (t|p v n) = A pt FpFp + B nt FnFn + (I - A pt - B nt )FvFv F v = est (I - A pt - B nt ) -1 (F (t|p v n) - A pt F p - B nt F n ) implies

How use for transformation? MAVEBA 2003 Florence, Italy December 10-12, 2003 3x13x33x13x33x13x3 F (t|p v n) = A pt FpFp + B nt FnFn + (I - A pt - B nt )FvFv F v = est (I - A pt - B nt ) -1 (F (t|p v n) - A pt F p - B nt F n ) implies Partial consonant recognition observed

Application I MAVEBA 2003 Florence, Italy December 10-12, 2003 3x13x33x13x33x13x3 F (t|p v n) = A pt FpFp + B nt FnFn + (I - A pt - B nt )FvFv a nt 00 0 0 00 A pt = [] b nt 00 0 0 00 B pt = [] Model F (t|p v n) at vowel midpoints Each token may have different values of A pt and B nt  No assumptions about dependency of weights on time. But: assume synchronicity for formant changes:

Application I: Targets [jj/ll] MAVEBA 2003 Florence, Italy December 10-12, 2003

Application I: Targets [00/09] MAVEBA 2003 Florence, Italy December 10-12, 2003

Application I: Weights [jj/ll] MAVEBA 2003 Florence, Italy December 10-12, 2003

Application I: Weights [00/09] MAVEBA 2003 Florence, Italy December 10-12, 2003

Application II MAVEBA 2003 Florence, Italy December 10-12, 2003 3x13x33x13x33x13x3 F (t|p v n) = A pt FpFp + B nt FnFn + (I - A pt - B nt )FvFv a nt 00 0a’ nt 0 00a” nt A pt = [] b nt 00 0b’ nt 0 00b” nt B pt = [] Model F (t|p v n) at vowel midpoints A pt and B nt same for all tokens.  Assumptions are made about dependency of weights on time. But: no synchronicity for formant changes:

Application I: Targets [00/09] MAVEBA 2003 Florence, Italy December 10-12, 2003

Application II: Weights [00/09] MAVEBA 2003 Florence, Italy December 10-12, 2003

Conclusions Proposed linear model of vowel dynamics –To be used for formant “correction” When used as analytic instrument –Gave meaningful results Strikingly “normal” target values –Without any normalizing bias in the estimation process Clear evidence for enhanced coarticulation MAVEBA 2003 Florence, Italy December 10-12, 2003

A formant-trajectory model and its usage in comparing coarticulatory effects in Dysarthric and normal speech Xiaochuan Niu and Jan P. H. van Santen Center.

Similar presentations

Presentation on theme: "A formant-trajectory model and its usage in comparing coarticulatory effects in Dysarthric and normal speech Xiaochuan Niu and Jan P. H. van Santen Center."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

A formant-trajectory model and its usage in comparing coarticulatory effects in Dysarthric and normal speech Xiaochuan Niu and Jan P. H. van Santen Center.

Similar presentations

Presentation on theme: "A formant-trajectory model and its usage in comparing coarticulatory effects in Dysarthric and normal speech Xiaochuan Niu and Jan P. H. van Santen Center."— Presentation transcript:

Similar presentations

About project

Feedback