Structure of Human Speech Chris Darwin Vocal Tract.

Slides:



Advertisements
Similar presentations
Tom Lentz (slides Ivana Brasileiro)
Advertisements

Chapter 12: Speech and Music Perception
CS 551/651: Structure of Spoken Language Spectrogram Reading: Approximants John-Paul Hosom Fall 2010.
Sounds that “move” Diphthongs, glides and liquids.
SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.
Basic Spectrogram & Clinical Application: Consonants
Acoustic Characteristics of Consonants
Speech Perception Dynamics of Speech
Vowel Formants in a Spectogram Nural Akbayir, Kim Brodziak, Sabuha Erdogan.
Glides (/w/, /j/) & Liquids (/l/, /r/) Degree of Constriction Greater than vowels – P oral slightly greater than P atmos Less than fricatives – P oral.
Human Speech Recognition Julia Hirschberg CS4706 (thanks to John-Paul Hosum for some slides)
From Resonance to Vowels March 8, 2013 Friday Frivolity Some project reports to hand back… Mystery spectrogram reading exercise: solved! We need to plan.
JPN494: Japanese Language and Linguistics JPN543: Advanced Japanese Language and Linguistics Phonology & Phonetics (2)
Speech Science XII Speech Perception (acoustic cues) Version
“Speech and the Hearing-Impaired Child: Theory and Practice” Ch. 13 Vowels and Diphthongs –Vowels are formed when sound produced at the glottal source.
Basic Spectrogram Lab 8. Spectrograms §Spectrograph: Produces visible patterns of acoustic energy called spectrograms §Spectrographic Analysis: l Acoustic.
The Human Voice. I. Speech production 1. The vocal organs
Speech Perception Overview of Questions Can computers perceive speech as well as humans? Does each word that we hear have a unique pattern associated.
The Human Voice Chapters 15 and 17. Main Vocal Organs Lungs Reservoir and energy source Larynx Vocal folds Cavities: pharynx, nasal, oral Air exits through.
Introduction to Acoustics Words contain sequences of sounds Each sound (phone) is produced by sending signals from the brain to the vocal articulators.
PH 105 Dr. Cecilia Vogel Lecture 14. OUTLINE  consonants  vowels  vocal folds as sound source  formants  speech spectrograms  singing.
PHYS 103 lecture 29 voice acoustics. Vocal anatomy Air flow through vocal folds produces “buzzing” (like lips) Frequency is determined by thickness (mass)
Vowel Acoustics, part 2 November 14, 2012 The Master Plan Acoustics Homeworks are due! Today: Source/Filter Theory On Friday: Transcription of Quantity/More.
Unit 4 Articulation I.The Stops II.The Fricatives III.The Affricates IV.The Nasals.
Chapter 6 Features PHONOLOGY (Lane 335).
PSY 369: Psycholinguistics
SPEECH PERCEPTION The Speech Stimulus Perceiving Phonemes Top-Down Processing Is Speech Special?
Hearing & Deafness (5) Timbre, Music & Speech Vocal Tract.
Introduction to Speech Production Lecture 1. Phonetics and Phonology Phonetics: The physical manifestation of language in sound waves. –How sounds are.
What is Phonetics? Short answer: The study of speech sounds in all their aspects. Phonetics is about describing speech. (Note: phonetics ¹ phonics) Phonetic.
Hearing & Deafness (5) Timbre, Music & Speech.
Linguistics I Chapter 4 The Sounds of Language.
Source/Filter Theory and Vowels February 4, 2010.
Speech Perception. Phoneme - a basic unit of a speech sound that distinguishes one word from another Phonemes do not have meaning on their own but they.
Phonetics: the generation of speech Phonemes “The shortest segment of speech that, if changed, would change the meaning of a word.” hog fog log *Phonemes.
Speech Production1 Articulation and Resonance Vocal tract as resonating body and sound source. Acoustic theory of vowel production.
Vowels, part 4 March 19, 2014 Just So You Know Today: Source-Filter Theory For Friday: vowel transcription! Turkish, British English and New Zealand.
Acoustic Phonetics 3/9/00. Acoustic Theory of Speech Production Modeling the vocal tract –Modeling= the construction of some replica of the actual physical.
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
Speech Science Fall 2009 Oct 26, Consonants Resonant Consonants They are produced in a similar way as vowels i.e., filtering the complex wave produced.
Speech Science VII Acoustic Structure of Speech Sounds WS
Vowel Acoustics November 2, 2012 Some Announcements Mid-terms will be back on Monday… Today: more resonance + the acoustics of vowels Also on Monday:
ECE 598: The Speech Chain Lecture 7: Fourier Transform; Speech Sources and Filters.
LING 001 Introduction to Linguistics Fall 2010 Sound Structure I: Phonetics Acoustic phonetics Jan. 27.
Speech Or can you hear me now?. Linguistic Parts of Speech Phone Phone Basic unit of speech sound Basic unit of speech sound Phoneme Phoneme Phone to.
Speech Science Fall 2009 Oct 28, Outline Acoustical characteristics of Nasal Speech Sounds Stop Consonants Fricatives Affricates.
Voice Quality + Stop Acoustics
Say “blink” For each segment (phoneme) write a script using terms of the basic articulators that will say “blink.” Consider breathing, voicing, and controlling.
Glides, Place and Perception March 18, 2010 News The hard drive on the computer has been fixed! A couple of new readings have been posted to the course.
Speech Science VI Resonances WS Resonances Reading: Borden, Harris & Raphael, p Kentp Pompino-Marschallp Reetzp
Sounds and speech perception Productivity of language Speech sounds Speech perception Integration of information.
Stops Stops include / p, b, t, d, k, g/ (and glottal stop)
Introduction to Language Phonetics 1. Explore the relationship between sound and spelling Become familiar with International Phonetic Alphabet (IPA )
Chapter 13: Speech Perception. The Acoustic Signal Produced by air that is pushed up from the lungs through the vocal cords and into the vocal tract Vowels.
Introduction to Digital Speech Processing Presented by Dr. Allam Mousa 1 An Najah National University SP_1_intro.
Stop Acoustics and Glides December 2, 2013 Where Do We Go From Here? The Final Exam has been scheduled! Wednesday, December 18 th 8-10 am (!) Kinesiology.
Language Perception.
Stop + Approximant Acoustics
Vowels, part 4 November 16, 2015 Just So You Know Today: Vowel remnants + Source-Filter Theory For Wednesday: vowel transcription! Turkish and British.
Transitions + Perception March 25, 2010 Tidbits Mystery spectrogram #3 is now up and ready for review! Final project ideas.
Suprasegmental Properties of Speech Robert A. Prosek, Ph.D. CSD 301 Robert A. Prosek, Ph.D. CSD 301.
Acoustic Phonetics 3/14/00.
PHONETICS AND PHONOLOGY
L 17 The Human Voice.
Structure of Spoken Language
What is Phonetics? Short answer: The study of speech sounds in all their aspects. Phonetics is about describing speech. (Note: phonetics ¹ phonics) Phonetic.
Remember me? The number of times this happens in 1 second determines the frequency of the sound wave.
Chapter 2 Phonology.
Speech Perception (acoustic cues)
Motor theory.
Presentation transcript:

Structure of Human Speech Chris Darwin

Vocal Tract

Pitch and Formants 1. Harmonics (giving pitch) produced by vocal cord vibration 2. Formant frequencies: resonances of the vocal tract 3. Formant frequencies change as you change the shape of your vocal tract

Source & Filter LarynxVocal tractOutput sound

Sex change Me (m) Higher pitch Shorter vocal-tract (higher formants) Both (-> f)

Prosody vs Segments Segmental: consonants / vowels -> words Prosodic: pitch contour, stress. Emphasis, pragmatics. “I thought she was married?” & "….!" “I thought she was married.” “I thought she was married!” “I thought she was married!” NB in tone languages pitch used segmentally. Are bird/mammal animal systems like human prosody? generally use different pitch contours yes!

Vowel production

narrow-band spectrogram sine-wave speech

Sine-wave speech

Orchestra in your throat

Tuvan throat music

Mynah bird speech Klatt &Stefanski (1974) How does a mynah bird imitate human speech? J Acoust Soc Amer, 55,

Mynah / Grey parrot Mynah produces "formants" but probably through changing syrinx resonances, not through changing vocal tract shape. (Klatt & Stefanski, 1974, J Acoust Soc Amer) Grey parrot has a longer vocal tract and may use changes in its shape to produce formant variation (more like human speech). (Warren, Patterson, Pepperburg, 1996, Auk)

Characteristics of speech No gaps between words Smoothly changing sound from one speech sound to the next So you can’t just shuffle the acoustic “words” Only silence is /g/ of “ago” Narrow-band spectrogram

from Clive Frankish

Speech is more like semaphore than like music Music: discrete targets giving discrete acoustic events Semaphore: discrete targets with transitions between targets Speech: articulatory transitions between targets

Semaphore

Formants in a wide-band spectrogram “w e g o” <-- F1 <-- F 2 <-- F 3 Burst -->

time "bag" Where are the segments? F1 F2 F3 /bæg/ Where are the segments?

Speech is more like speech than like semaphore Speech does not have invariant acoustic targets: consonants change with the vowel. Compare /s/ in /si/ with /s/ in /su/ This is due to co-articulation.

Different transition - same consonant dee da 1400 Hz Liberman et al. (1967) Perception of the speech code. Psych Rev 74,

Co-articulation Arises because (mainly) consonant gestures don’t involve all the articulators: eg /b/ is lips only, tongue free to take up position for next vowel. /d/ and /s/ just involve the tongue tip, touching the alveolar ridge, tongue body and lips free to take up position for next vowel - viz. /si/ /su/.

Same noise - different consonant pea ka 1400 Hz F 2 F 1 Burst --> Liberman et al. (1967) Perception of the speech code. Psych Rev 74,

Two articulatory systems Öhman suggested that articulation can be decomposed into two semi-independent systems: Slow movement from one vowel target to next eg /i/ -> /u/ Rapid consonantal movement superimposed eg /b/ /d/ So the /b/ in /ibu/ is not the same as in /ibi/

Co-articulation Advantages 1. information about different segments is spread across time (Hockett’s squashed eggs). You know that a /u/ is coming because of the type of /s/ you have heard.

Co-articulation 2. Liberman thought that this spreading across time makes it easier to transmit information at a fast rate. Liberman et al (1967) Psych Rev 74,

The disadvantage of co-articulation for perception is that there are no constant acoustic targets in speech. The same phoneme can be represented as different sounds in different contexts (/s/ before /u/ or /i/. Conversely, the same sound, can be heard as different consonants in different contexts (eg as /p/ before /i/ and /a/ but as /k/ before /u/). Co-articulation - 2

Speech Code Articulatory movement Co-articulation Rapid speech /djewonega?at/ Different vocal-tract sizes: men 15% longer than women Different dialects ---> Factors that make it hard (for machines) to recognise speech