Spectrogram & its reading

Slides:



Advertisements
Similar presentations
Phonetics COMD Taylor Transcription Practice & Consonants in Review.
Advertisements

Normal Aspects of Articulation. Definitions Phonetics Phonology Articulatory phonetics Acoustic phonetics Speech perception Phonemic transcription Phonetic.
CS 551/651: Structure of Spoken Language Spectrogram Reading: Approximants John-Paul Hosom Fall 2010.
Sounds that “move” Diphthongs, glides and liquids.
SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.
Basic Spectrogram & Clinical Application: Consonants
Acoustic Characteristics of Consonants
Speech Perception Dynamics of Speech
Articulation and the Vocal Tract Chapter 3 Perry C. Hanavan, Au.D. Audiologist.
Glides (/w/, /j/) & Liquids (/l/, /r/) Degree of Constriction Greater than vowels – P oral slightly greater than P atmos Less than fricatives – P oral.
1 CS 551/651: Structure of Spoken Language Lecture 4: Characteristics of Manner of Articulation John-Paul Hosom Fall 2008.
1 CS 551/651: Structure of Spoken Language Spectrogram Reading: Stops John-Paul Hosom Fall 2010.
Phonetics.
Phonetics Articulatory Phonetics; *Manner of Articulation:
Nasal Stops.
Speech Science XII Speech Perception (acoustic cues) Version
Digital Systems: Hardware Organization and Design
Basic Spectrogram Lab 8. Spectrograms §Spectrograph: Produces visible patterns of acoustic energy called spectrograms §Spectrographic Analysis: l Acoustic.
Today  Parts of vocal tract used in producing consonants  Articulatory Description of consonants Readings: it’s all about air!
Phonetics (Part 1) Dr. Ansa Hameed.
Unit 4 Articulation I.The Stops II.The Fricatives III.The Affricates IV.The Nasals.
Acoustic Phonetics Investigating physical properties of speech sounds.
Chapter 6 Features PHONOLOGY (Lane 335).
The following spectrogram is: A. burst B. spectrogram C. speech D. formant.
1 Sounds: the building blocks of language CA461 Speech Processing 1 Lecture 2.
Investigating physical properties of speech sounds
Consonants and vowel January Review where we’ve been We’ve listened to the sounds of “our” English, and assigned a set of symbols to them. We.
Natural classes and distinctive features
Speech Sounds of American English and Some Iranian Languages
1 CS 551/651: Structure of Spoken Language Lecture 4: Characteristics of Manner of Articulation John-Paul Hosom Fall 2010.
English Pronunciation Practice A Practical Course for Students of English By Wang Guizhen Faculty of English Language & Culture Guangdong University of.
Chapter 7: Consonantal Gestures
Laterals + Nasals December 2, 2009 Administrative Stuff Friday: some notes on audition (hearing) Also: in-class USRIs Production Exercise #4 is due on.
Speech Perception. Phoneme - a basic unit of a speech sound that distinguishes one word from another Phonemes do not have meaning on their own but they.
Phonetics Phonetics: It is the science of speech sounds. It is the study of the production and reception of speech sounds. It is concerned with the sounds.
Phonetics: the generation of speech Phonemes “The shortest segment of speech that, if changed, would change the meaning of a word.” hog fog log *Phonemes.
1 4. Consonants  Consonants are produced ‘ by a closure in the vocal tract, or by a narrowing which is so marked that air cannot escape without producing.
Speech Production1 Articulation and Resonance Vocal tract as resonating body and sound source. Acoustic theory of vowel production.
Speech Science Fall 2009 Oct 26, Consonants Resonant Consonants They are produced in a similar way as vowels i.e., filtering the complex wave produced.
Speech Science VII Acoustic Structure of Speech Sounds WS
Introduction to Linguistics Ms. Suha Jawabreh Lecture # 7.
Phonetics Class # 2 Chapter 6. Homework (Ex. 1 – page 268)  Judge [d ] or [ ǰ ]  Thomas [t]  Though [ ð ]  Easy [i]  Pneumonia [n]  Thought [ θ.
Speech Science Fall 2009 Oct 28, Outline Acoustical characteristics of Nasal Speech Sounds Stop Consonants Fricatives Affricates.
Voice Quality + Stop Acoustics
Say “blink” For each segment (phoneme) write a script using terms of the basic articulators that will say “blink.” Consider breathing, voicing, and controlling.
Glides, Place and Perception March 18, 2010 News The hard drive on the computer has been fixed! A couple of new readings have been posted to the course.
The end of vowels + The beginning of fricatives November 19, 2012.
Transitions + Perception March 27, 2012 Tidbits First: Guidelines for the final project report So far, I have two people who want to present their projects.
Stops Stops include / p, b, t, d, k, g/ (and glottal stop)
2. Fricatives Air is passed through a narrow channel, creating turbulence. Turbulence = noise. English fricatives: Voiceless: [f] [θ] (“theory”) [s] [
Statistical NLP Spring 2011
Introduction to Language Phonetics 1. Explore the relationship between sound and spelling Become familiar with International Phonetic Alphabet (IPA )
Sonorant Acoustics + Place Transitions
Stop Acoustics and Glides December 2, 2013 Where Do We Go From Here? The Final Exam has been scheduled! Wednesday, December 18 th 8-10 am (!) Kinesiology.
Stop + Approximant Acoustics
Ch4 – Features Features are partly acoustic partly articulatory aspects of sounds but they are used for phonology so sometimes they are created to distinguish.
Phonetics: consonants
Vowels, part 4 November 16, 2015 Just So You Know Today: Vowel remnants + Source-Filter Theory For Wednesday: vowel transcription! Turkish and British.
Nasals + Liquids + Everything Else
1 Acoustic Phonetics 3/28/00. 2 Nasal Consonants Produced with nasal radiation of acoustic energy Sound energy is transmitted through the nasal cavity.
Acoustic Phonetics 3/14/00.
Stop/Plosives.
Stop Acoustics + Glides December 2, 2015 Down The Stretch They Come Today: Stop and Glide Acoustics Friday: Sonorant Acoustics + USRI evaluations We’ll.
Consonant articulation
Consonant Exercises.
Structure of Spoken Language
Structure of Spoken Language
Phonetics: The Sounds of Language
Speech Perception (acoustic cues)
Eugeniusz Cyran KUL, Lublin
Presentation transcript:

Spectrogram & its reading by Tae-Yeoub Jang

What is spectrogram? Begin to be used since 1940s Another representation of frequency domain analysis The most popular way of representing spectral information 3 dimensional representation X-axis: Time Y-axis: Frequency Darkness (or color): Energy Reviving Sonus

Spectrogram example (color resolution of word “compute”) Reviving Sonus

Spectrogram example (grayscale of word “compute”) Reviving Sonus

Wideband vs. Narrowband spectrograms of the question "Is Pat sad, or mad?" The 5th, 10th and 15th harmonics have been marked by white squares in two of the vowels Reviving Sonus

Types of spectrogram Wideband spectrogram Narrowband spectrogram better time resolution eg) 15 msec window, 1 msec shift, 125 Hz bandwidth Narrowband spectrogram better frequency resolution eg) 50 msec window, 1 msec shift, 40 Hz bandwidth Reviving Sonus

Advantages & Disadvantages Time alignment Disadvantages Less reliable than waveform Reviving Sonus

Vowel Spectrogram Formant frequencies are critical cues for vowel distinction F1: Height high vowels: low F1 F2: Backness back vowels: low F2 Reviving Sonus

Example formant frequencies of English monophthongs          F3 2900 2550 2490 2640 2380 2300 2500 2390 F2 2250 1900 1770 1660 1100 1030 870 1500 1190 F1 280 400 550 690 710 450 310 900 640 Reviving Sonus

"heed, hid, head, had, hod, hawed, hood, who'd" (a male speaker, American English) Reviving Sonus

Consonant Spectrogram General Acoustic structure more complicated than vowels Adjacent sounds (especially vowels) convey important information  locus High frequency characteristics  especially for fricatives and affricates Reviving Sonus

What is LOCUS Information of formant transition from vowels into obstruents or from obstruents into vowels The target frequency that each formant transition is heading toward as an obstruction is made, or the frequency the transition comes as the obstruction is released The characteristic of the consonantal place and manner  roughly the same in different vowel contexts Reviving Sonus

Stops General Fairly distinct locus for each place Burst Silence during the closure (only at syllable onset position) Virtually no difference during the closure Reviving Sonus

Stops (cntd.) Voicing distinction voiced: vertical striations for voiced sounds, less abrupt burst, frequently weakened to be like fricatives or approximants voiceless: generally abrupt burst at higher frequency area Reviving Sonus

Stops (cntd.) Place distinction bilabial alveolar velar relatively low F2, F3 locus  rising into and falling out of vowel weak and spread vertical lines alveolar F2 locus about 1800 Hz Strong vertical lines velar Velar pinch: vowels F2, F3 merging often double burst long formant transitions Reviving Sonus

Stops (cntd.) Manner distinction Silence duration, VOT, vowel F0 aspirated short long high tense lax med low Reviving Sonus

Examples -- “a bab, a dad, a gag” Reviving Sonus

Place dependent loci Reviving Sonus

Fricatives General Random noise pattern especially in high frequency regions Place distinction Labiodental [f, v]: rising locus into the following vowel Dental [, ð]: major energy above 6000Hz Alveolar [s, z]: major energy above 4000Hz Alveopalatal [š, ž ]: major energy above 6000Hz Glottal [h]: the trace of formant frequencies of neighbouring vowels Reviving Sonus

Fricatives (cntd.) Weak vs. strong Strong [s, z, š, ž ]: darker bands Weak [f, v, , ð ]: spread and fainter Voiced [v, ð ]: often so weak and confused with nasals or approximants Cues to tell [] from [f]: higher formants of [] fall into adjacent vowels Reviving Sonus

Example – “fie, thigh, sigh, shy” Reviving Sonus

Example – “ever, weather, fizzer, pleasure” Reviving Sonus

Nasals General Place distinction Formants similar to vowels but fainter Very low F1 (about 250Hz), F2 (about 2500Hz), and F3 (about 3250Hz) Place distinction bilabial [m]: downward F2, F3 locus alveolar [n]: less amount of F2 transition velar [ŋ ]: velar pinch Reviving Sonus

Examples -- “a Pam, a tan, a kang” Reviving Sonus

Liquies & Approximants General Formants similar to vowels but fainter (especially at high frequency regions) Approximately F1(250Hz), F2(1200Hz), F3(2400Hz) Change in formant structure Reviving Sonus

Liquids & Approximants (cntd.) Phone specific properties Labial glide [w]: very low F1, F2 (600-1000Hz|) and gets too close to each relatively low F3 rapid falloff of spectral amplitude Palatal glide [y]: extremely low F1 extremely high F2, F3 Reviving Sonus

Liquids & Approximants (cntd.) Phone specific properties (cntd.) Flap [Ր]: soft burst, short duration Retroflex [r]: F3 dipping down close to F2 General lowering of F3, F4 Lateral [l]: Low F1, F2 (approx. F1 250Hz, F2 1200Hz) usually substantial energy in the high F region Reviving Sonus

Example – “led, red, wed, yell” Reviving Sonus

Final remarks Spectrogram is not the only cue for acoustic distinction of speech sounds Very often, the waveform is more reliable Reviving Sonus

References & Links http://cslu.cse.ogi.edu/tutordemos/SpectrogramReading/spectrogram_reading.html http://hctv.humnet.ucla.edu/departments/linguistics/VowelsandConsonants/course http://www.cs.indiana.edu/~port/teach/306/speech.acoustics.html http://www.phon.ucl.ac.uk/courses/spsci/b203/week2-5.pdf Reviving Sonus