Spectral Analysis Feburary 23, 2010 Sorting Things Out 1.On Thursday: back in the computer lab. Craigie Hall D 428 Analysis of Korean stops. 2.Remember:

Slides:



Advertisements
Similar presentations
Acoustic/Prosodic Features
Advertisements

Harmonics October 29, 2012 Where Were We? Were halfway through grading the mid-terms. For the next two weeks: more acoustics Its going to get worse before.
Perturbation Theory, part 2 November 4, 2014 Before I forget Course project report #3 is due! I have course project report #4 guidelines to hand out.
Perturbation Theory March 11, 2013 Just So You Know The Fourier Analysis/Vocal Tract exercise is due on Wednesday. Please note: don’t make too much out.
Spectral Analysis Feburary 24, 2009 Sorting Things Out 1.TOBI transcription homework rehash. And some structural reminders. 2.On Thursday: back in the.
Phonology, part 5: Features and Phonotactics
Voice Quality October 14, 2014 Practicalities Course Project report #2 is due! Also: I have new guidelines to hand out. The mid-term is on Tuesday after.
Fourier Analysis. Joseph Fourier ( ) “Yesterday was my 21st birthday, and at that age, Newton and Pascal had already acquired many claims to immortality.”
1 Chapter 16 Fourier Analysis with MATLAB Fourier analysis is the process of representing a function in terms of sinusoidal components. It is widely employed.
Physics 1251 The Science and Technology of Musical Sound Unit 1 Session 8 Harmonic Series Unit 1 Session 8 Harmonic Series.
DFT/FFT and Wavelets ● Additive Synthesis demonstration (wave addition) ● Standard Definitions ● Computing the DFT and FFT ● Sine and cosine wave multiplication.
Intro to Spectral Analysis and Matlab. Time domain Seismogram - particle position over time Time Amplitude.
Synthesis. What is synthesis? Broad definition: the combining of separate elements or substances to form a coherent whole. (
The Source January 26, 2010 Voice Quality Review ATLTMCFlow Modalmoderatevariesmoderatemed. Creakyhighlowhighlow Breathylowvarieslowhigh.
SIMS-201 Characteristics of Audio Signals Sampling of Audio Signals Introduction to Audio Information.
Physics of Sounds Overview Properties of vibrating systems Free and forced vibrations Resonance and frequency response Sound waves in air Frequency, wavelength,
Basic Spectrogram Lab 8. Spectrograms §Spectrograph: Produces visible patterns of acoustic energy called spectrograms §Spectrographic Analysis: l Acoustic.
PH 105 Dr. Cecilia Vogel Lecture 14. OUTLINE  consonants  vowels  vocal folds as sound source  formants  speech spectrograms  singing.
Vowel Acoustics, part 2 November 14, 2012 The Master Plan Acoustics Homeworks are due! Today: Source/Filter Theory On Friday: Transcription of Quantity/More.
Signals Processing Second Meeting. Fourier's theorem: Analysis Fourier analysis is the process of analyzing periodic non-sinusoidal waveforms in order.
1 Lab Preparation Initial focus on Speaker Verification –Tools –Expertise –Good example “Biometric technologies are automated methods of verifying or recognising.
Basic Acoustics + Digital Signal Processing September 11, 2014.
Basics of Signal Processing. frequency = 1/T  speed of sound × T, where T is a period sine wave period (frequency) amplitude phase.
Source/Filter Theory and Vowels February 4, 2010.
LE 460 L Acoustics and Experimental Phonetics L-13
Where we’re going Speed, Storage Issues Frequency Space.
Basics of Signal Processing. SIGNALSOURCE RECEIVER describe waves in terms of their significant features understand the way the waves originate effect.
Chapter-4 Synthesis and Analysis of Complex Waves Fourier Synthesis: The process of combining harmonics to form a complex wave. Fourier Analysis: Determining.
Motivation Music as a combination of sounds at different frequencies
Voice Quality Feburary 11, 2013 Practicalities Course project reports to hand in! And the next set of guidelines to hand out… Also: the mid-term is on.
Automatic Pitch Tracking September 18, 2014 The Digitization of Pitch The blue line represents the fundamental frequency (F0) of the speaker’s voice.
Resonance, Revisited March 4, 2013 Leading Off… Project report #3 is due! Course Project #4 guidelines to hand out. Today: Resonance Before we get into.
MUSIC 318 MINI-COURSE ON SPEECH AND SINGING
Automatic Pitch Tracking January 16, 2013 The Plan for Today One announcement: Starting on Monday of next week, we’ll meet in Craigie Hall D 428 We’ll.
Vowel Acoustics November 2, 2012 Some Announcements Mid-terms will be back on Monday… Today: more resonance + the acoustics of vowels Also on Monday:
ECE 598: The Speech Chain Lecture 7: Fourier Transform; Speech Sources and Filters.
Wireless and Mobile Computing Transmission Fundamentals Lecture 2.
Harmonics November 1, 2010 What’s next? We’re halfway through grading the mid-terms. For the next two weeks: more acoustics It’s going to get worse before.
Speech Science V Akustische Grundlagen WS 2007/8.
Voice Quality + Stop Acoustics
Voice Quality + Spectral Analysis Feburary 15, 2011.
The end of vowels + The beginning of fricatives November 19, 2012.
Signals CY2G2/SE2A2 Information Theory and Signals Aims: To discuss further concepts in information theory and to introduce signal theory. Outcomes:
Seismic Reflection Data Processing and Interpretation A Workshop in Cairo 28 Oct. – 9 Nov Cairo University, Egypt Dr. Sherif Mohamed Hanafy Lecturer.
Formants, Resonance, and Deriving Schwa March 10, 2009.
Voice Onset Time + Voice Quality
Sonorant Acoustics March 24, 2009 Announcements and Such Collect course reports Give back homeworks Hand out new course project guidelines.
Resonance October 23, 2014 Leading Off… Don’t forget: Korean stops homework is due on Tuesday! Also new: mystery spectrograms! Today: Resonance Before.
Complex Auditory Stimuli
Voice Quality + Korean Stops October 16, 2014 Don’t Forget! The mid-term is on Tuesday! So I have a review sheet for you. For the mid-term, we will just.
Vowel Acoustics March 10, 2014 Some Announcements Today and Wednesday: more resonance + the acoustics of vowels On Friday: identifying vowels from spectrograms.
Frequency, Pitch, Tone and Length February 12, 2014 Thanks to Chilin Shih for making some of these lecture materials available.
Resonance January 28, 2010 Last Time We discussed the difference between sine waves and complex waves. Complex waves can always be understood as combinations.
Phonation + Voice Quality Feburary 11, 2014 Weekday Update Course project report #2 is due right now! I have guidelines for course project report #3,
Resonance March 7, 2014 Looking Ahead I’m still behind on grading the mid-term and Production Exercise #1… They should be back to you by Friday.
Voicing + Basic Acoustics October 14, 2015 Agenda Production Exercise #2 is due on Friday! No transcription exercise this Friday! Today, we’ll begin.
Stop + Approximant Acoustics
Voice Quality January 19, 2010 Vocal Tract Anatomy Our vocal tracts are shaped in a way that makes it easier to speak… But more dangerous to eat!
Vowels, part 4 November 16, 2015 Just So You Know Today: Vowel remnants + Source-Filter Theory For Wednesday: vowel transcription! Turkish and British.
Voice Quality Feburary 13, 2014 Practicalities The mid-term is on the Thursday after the break! So I have a review sheet for you. For the mid-term, we.
The Frequency Domain Digital Image Processing – Chapter 8.
Basic Acoustics + Digital Signal Processing January 11, 2013.
Spectral Analysis March 3, 2016 Mini-Rant I have succeeded in grading your course project reports. Things to keep in mind: A table of stop phonemes is.
Resonance October 29, 2015 Looking Ahead I’m still behind on grading the mid-term and Production Exercise #1… They should be back to you by Monday. Today:
Harmonics October 28, Where Were We? Mid-terms: our goal is to get them back to you by Friday. Production Exercise #2 results should be sent to.
Fundamental Frequency Change
MECH 373 Instrumentation and Measurements
For a periodic complex sound
The Production of Speech
Sound shadow effect Depends on the size of the obstructing object and the wavelength of the sound. If comparable: Then sound shadow occurs. I:\users\mnshriv\3032.
Presentation transcript:

Spectral Analysis Feburary 23, 2010

Sorting Things Out 1.On Thursday: back in the computer lab. Craigie Hall D 428 Analysis of Korean stops. 2.Remember: mid-term next Tuesday Check out the review sheet… 3.Course project reports to hand back Also note: the last TOBI homework 4.Note: Spectrogram reading challenge! For fun and extra credit points

Today Today: Wrap up voice quality discussion Begin examination of spectral analysis Some leftovers: 1.Ventricular folds video 2.Death Metal

Harsh Voice A “raucous voice quality” (Holmes, 1932) Acoustically: fundamental frequency is aperiodic = lots of jitter (variability in time) Articulatorily: harsh voice does not add anything new to the voice quality parameters; it just increases the intensity of those already in operation. Harsh voice  “excessive approximation of the vocal folds” = high medial compression and high adductive tension

Harsh, continued “Harshness results from overtensions in the throat and neck; it is often if not usually accompanied by hypertensions of the whole body.” (Gray and Wise, 1959) Harsh F0 is usually > 100 Hz Creaky F0 is usually < 100 Hz

Voice Quality Summary So far, we’ve talked about: ATLTMCFlow Modalmoderatevariesmoderatemed. Tensehighvarieshighhigh Creakyhighlowhighlow Breathylowvarieslowhigh WhisperlowN/Ahighmed.

4. Whispery Voice When we whisper: The cartilaginous glottis remains open, but the ligamental glottis is closed. Air flow through opening with a “hiss” The laryngeal settings: 1.Little or no adductive tension 2.Moderate to high medial compression 3.Moderate airflow 4.Longitudinal tension is irrelevant…

Nodules One of the more common voice disorders is the development of nodules on either or both of the vocal folds. nodule = callous-like bump What effect might this have on voice quality?

Last but not least What’s going on here? At some point, my voice changes from modal to falsetto.

5. Falsetto The laryngeal specifications for falsetto: 1.High longitudinal tension 2.High adductive tension 3.High medial compression Contraction of thyroarytenoids 4.Lower airflow than in modal voicing The results: Very high F0. Very thin area of contact between vocal folds. Air often escapes through the vocal folds.

Falsetto EGG The falsetto voice waveform is considerably more sinusoidal than modal voice.

Some Real EGGs Modal voice (F0 = 140 Hz) Falsetto voice (F0 = 372 Hz)

Voice Quality Summary ATLTMCFlow Modalmoderatevariesmoderatemed. Tensehighvarieshighhigh Creakyhighlowhighlow Breathylowvarieslowhigh WhisperlowN/Ahighmed. Falsettohighhighhighlow

Last but not least, Korean makes an interesting distinction between “emphatic” (or fortis) obstruents and unaspirated and aspirated (lenis) obstruents.

What’s going on here? A variety of things occur during the articulation of fortis consonants in Korean. 1.Glottis is not open as wide (during closure) as in lenis stops.  Voicing begins more quickly after stop release 2.Increased airflow in fortis stops.  Higher F0 after stop release. 3.Vocal folds are “more tense” than in lenis stops. = greater medial compression = “squarer” glottal waveform

An Actual EGG Waveform Modal voicing (by me): Note: completely closed and completely open phases are both actually quite short. Also: closure slope is greater than opening slope. Q: Why might there be differences in slope?

A Different Kind of Voicing The basic voice quality in khoomei is called xorekteer. Notice any differences in the EGG waveforms? This voice quality requires greater medial compression of the vocal folds....and also greater airflow

Why Should You Care? Remember that the most basic kind of sound wave is a sinewave. time pressure Sinewaves can be defined by three basic properties: Frequency, (peak) amplitude, phase

Complex Waves It is possible to combine more than one sinewave together into a complex wave. At any given time, each wave will have some amplitude value. A 1 (t 1 ) := Amplitude value of sinewave 1 at time 1 A 2 (t 1 ) := Amplitude value of sinewave 2 at time 1 The amplitude value of the complex wave is the sum of these values. A c (t 1 ) = A 1 (t 1 ) + A 2 (t 1 ) Note: a harmonic is simply a component sinewave of a complex wave.

Complex Wave Example Take waveform 1: high amplitude low frequency Add waveform 2: low amplitude high frequency The sum is this complex waveform: + =

Another Perspective Sinewaves can also be represented by their power spectra. Frequency on the x-axis Intensity on the y-axis (related to peak amplitude) WaveformPower Spectrum

Putting the two together WaveformPower Spectrum + + = = harmonics

More Combinations What happens if we keep adding more and more high frequency components to the sum? += +=

A Spectral Comparison WaveformPower Spectrum

What’s the Point? Remember our EGG waveforms for the different kinds of voice qualities: The glottal waveform for tense voice resembles a square wave.  lots of high frequency components (harmonics)

What’s the point, part 2 A modal voicing EGG looks like: It is less square and therefore has less high frequency components. Although it is far from sinusoidal...

What’s the point, part 3 Breathy and falsetto voice are more sinusoidal... And therefore the high frequency harmonics have less power, compared to the fundamental frequency.

Let’s Check ‘em out Head over to Praat and check out the power spectra of: a sinewave a square wave a sawtooth wave tense voice modal voice creaky voice breathy voice

Spectral Tilt Spectral tilt = drop-off in intensity of higher harmonics, compared to the intensity of the fundamental.

The Source The complex wave emitted from the glottis during voicing= The source of all voiced speech sounds. In speech (particularly in vowels), humans can shape this spectrum to make distinctive sounds. Some harmonics may be emphasized... Others may be diminished (damped) Different spectral shapes may be formed by particular articulatory configurations....but the process of spectral shaping requires the raw stuff of the source to work with.

Spectral Shaping Examples Certain spectral shapes seem to have particular vowel qualities.

Spectrograms A spectrogram represents: Time on the x-axis Frequency on the y-axis Intensity on the z-axis

Real Vowels

Ch-ch-ch-ch-changes Check out some spectrograms of sinewaves which change frequency over time:

The Whole Thing What happens when we put all three together? This is an example of sinewave speech.

The Real Thing Spectral change over time is the defining characteristic of speech sounds.  It is crucial to understand spectrographic representations for the acoustic analysis of speech.

Life’s Persistent Questions How do we get from here: To here? Answer: Fourier Analysis

Fourier’s Theorem Joseph Fourier ( ) French mathematician Studied heat and periodic motion His idea: any complex periodic wave can be constructed out of a combination of different sinewaves. The sinusoidal (sinewave) components of a complex periodic wave = harmonics

Fourier Analysis Building up a complex wave from sinewave components is straightforward… Breaking down a complex wave into its spectral shape is a little more complicated. In our particular case, we will look at: Discrete Fourier Transform (DFT) Also: Fast Fourier Transform (FFT) is used often in speech analysis Basically a more efficient, less accurate method of DFT for computers.

The Basic Idea For the complex wave extracted from each window... Fourier Analysis determines the frequency and intensity of the sinewave components of that wave. Do this about 1000 times a second, turn the spectra on their sides, and you get a spectrogram.

Possible Problems What would happen if a waveform chunk was windowed like this? Remember, the goal is to determine the frequency and intensity of the sinewave components which make up that slice of the complex wave.

The Usual Solution The amplitude of the waveform at the edges of the window is normally reduced... by transforming the complex wave with a smoothing function before spectral analysis. Each function defines a particular window type. For example: the “Hanning” Window

There are lots of different window types... each with its own characteristic shape Hamming BartlettGaussian HanningWelchRectangular

Window Type Ramifications Play around with the different window types in Praat.

Ideas Once the waveform has been windowed, it can be boiled down into its component frequencies. Basic strategy: Determine whether the complex wave correlates with sine (and cosine!) waves of particular frequencies. Correlation measure: “dot product” = sum of the point-by-point products between waves. Interesting fact: Non-zero correlations only emerge between the complex wave and its harmonics! (This is Fourier’s great insight.)

A Not-So-Complex Example Let’s build up a complex wave from 8 samples of a 1 Hz sine wave and a 4 Hz cosine wave. Note: our sample rate is 8 Hz Hz Hz Sum: Check out a visualization.

Correlations, part 1 Let’s check the correlation between that wave and the 1 Hz sinewave component Sum: Hz Dot: The sum of the products of each sample is 4. This also happens to be the dot product of the 1 Hz wave with itself. = its “power”