Presentation is loading. Please wait.

Presentation is loading. Please wait.

Auditory Perception Hillenbrand SPPA 2060.

Similar presentations


Presentation on theme: "Auditory Perception Hillenbrand SPPA 2060."— Presentation transcript:

1 Auditory Perception Hillenbrand SPPA 2060

2 Auditory perception is one branch of a larger science called psychophysics. Psychophysics studies the relationships that exist between perceptual dimensions (also psychological, subjective, or mental) and the physical properties of stimuli. The distinction between perceptual dimensions and physical dimensions is all important.

3 Physical dimensions: Any aspect of a physical stimulus that could be measured in a straightforward way with an instrument (e.g., a light meter, a sound level meter, a spectrum analyzer, a fundamental frequency meter, etc.)

4 Perceptual dimensions: These are the mental experiences that occur inside the mind of the observer. These experiences are actively created by the sensory system and brain based on an analysis of the physical properties of the stimulus. Perceptual dimensions can be measured, but not with a meter, Measuring perceptual dimensions requires an observer (e.g., a listener, a “looker”, a smeller, a taster …).

5 For example, in vision: The percept of hue is created by the eye and brain based (in part) on the visual system’s analysis of the wavelength composition of the stimulus. But: hue ¹ wavelength wavelength: physical dimension (can be measured with a meter) hue: psychological dimension (can be measured, but that requires an observer)

6 Visual Psychophysics Physical Properties
Perceptual Dimensions of Light Hue Wavelength Brightness Luminance Shape Contour/Contrast Both dimensions can be measured – the physical dimensions can be measured with the right instrument; measuring psychological dimensions requires an observer. 9/20/2018

7 Brief Digression: How Many Senses Do We Have?
There is a branch of psychophysics devoted to each sense; i.e., vision, hearing, taste, smell, … Q: How many senses are there? A: Not the five that we all learned about. Q: So how many? A: No one knows for sure, but way more than five. 9/20/2018

8 The ‘basic’ five: Vision, hearing, touch, taste, smell. What else?
pain thirst hunger nausea, “butterflies,” & other stomach sensations balance/equilibrium proprioceptive sensations such as stretch & other body-position sensations itch thermoception (hot & cold) orgasm many, many others 9/20/2018

9 Auditory Psychophysics (aka psychoacoustics or auditory perception)
Physical Properties Perceptual Dimensions of Sound Pitch Fund. Freq. (f0) Loudness Intensity Timbre (sound quality) Spectrum env./ Amplitude env. 9/20/2018

10 Perceptual Experiences are Actively Created, Not Passively Received
Subjective contour: The triangles, circles and squares are “seen” not so much because they are “there” in the physical sense, but because they are inferred. Unconscious inference lies at the heart of perception. In some sense, “I’ll see it when I believe it.” is more true than “I’ll believe it when I see it.”

11 Reversible Figures Reversible figures reveal the active organization of percepts – the drawing on the left is organized by you brain into a bird, then reorganized into a rabbit, then back to a bird, … Same with the old lady/young lady. 9/20/2018

12 Another duck-rabbit, just for yucks.
Which is bigger? Bottom one, eh? Nah. They’re the same. (This is the Jastrow Illusion.)

13 The Muller-Lyer Illusion
Which horizontal line is longer?

14 The Muller-Lyer Illusion
Surprise, surprise. They’re the same. Duh - everything in this field is always the same. It gets on your nerves.

15 The corridor illusion: Which cylinder is larger
The corridor illusion: Which cylinder is larger? The cylinder to the right appears larger because the visual system infers that it is further away. The inference is unconscious, automatic and obligatory (i.e., you can’t help yourself – even when you know the trick). 9/20/2018

16 The McGurk Effect (McGurk & Macdonald, 1976*)
*McGurk, H., and MacDonald, J. (1976). Hearing lips and seeing voices. Nature, 264, 9/20/2018

17 Some History on the McGurk Illusion
The most striking demonstration of the combined (bimodal) nature of speech understanding appeared by accident. Harry McGurk, a senior developmental psychologist at the University of Surrey in England, and his research assistant John MacDonald were studying how infants perceive speech during different periods of development. For example, they placed a videotape of a mother talking in one location while the sound of her voice played in another. For some reason, they asked their recording technician to create a videotape with the audio syllable "ba" dubbed onto a visual "ga." When they played the tape, McGurk and McDonald perceived "da." Confusion reigned until they realized that "da" resulted from a quirk in human perception, not an error on the technician's part. After testing children and adults with the dubbed tape, the psychologists reported this phenomenon in a 1976 paper humorously titled "Hearing Lips and Seeing Voices," a landmark in the field of human sensory integration. This audio-visual illusion has become known as the McGurk effect or McGurk illusion." Further reading: Dominic W. Massaro & David G. Stork, "Speech Recognition and Sensory Integration", American Scientist, 1998, vol. 86, p The McGurk effect has played an important role in audio-visual speech integration and speech reading. McGurk links on the web include the following: 9/20/2018

18 The Three Main Perceptual Attributes of Sound
Pitch (not fundamental frequency) Loudness (not intensity) Timbre (not spectrum envelope or amplitude envelope) The terms pitch, loudness, and timbre refer not to the physical characteristics of sound, but to the mental experiences that occur in the minds of listeners. 9/20/2018

19 Pitch and Fundamental Frequency
Rule 1: All else being equal, the higher the f0, the higher the perceived pitch. Lower f0, lower pitch Higher f0, higher pitch 9/20/2018

20 Rule 2: The ear is more sensitive to f0 differences in the low frequencies than the higher frequencies. This means that: 300 vs ¹ vs. 3050 That is, the difference in perceived pitch (not f0) between 300 and 350 Hz is NOT the same as the difference in pitch between 3000 and 3050 Hz, even though the physical differences in f0 are the same. vs Which f0 difference is larger? (A: They’re the same.) Which pitch difference is larger? (A: 300 vs. 350 – by a lot) 9/20/2018

21 Lower f0, lower pitch Higher f0, lower pitch
Three ways to measure f0 Frequency domain: Measure H1 (i.e., the lowest frequency harmonic). Frequency domain: Measure the harmonic spacing. Time domain: Measure the fundamental period. 9/20/2018

22 The “Problem” of the Missing Fundamental
Normal f0: f0 Removed: 9/20/2018

23 Conclusion: The fundamental does not need to be physically present in the signal for a listener to hear a pitch corresponding to where f0 ought to be. What Explains This? Even with the 1st harmonic removed, a signal remains periodic at the original f0. 9/20/2018

24 The “Pitch Shift” Effect
If the auditory system evaluated pitch by measuring the harmonic spacing, these 2 signals (1200, 1400, 1600 … and 1240, 1440, 1640 …) would have the same pitch. They do not have the same pitch, so we can rule out harmonic spacing. Which theory is left? Measuring the fundamental period. 9/20/2018

25 What does all this mean? Rule 3: The sensation of pitch is probably based on a measurement of the fundamental period. It is definitely not based on a measurement of either (a) the lowest frequency harmonic in a harmonic spectrum (because of the “missing fundamental” effect), or (b) harmonic spacing (because of the “pitch shift” effect). 9/20/2018

26 Loudness and Intensity
Rule 1: All else being equal, the higher the intensity, the greater the loudness. Higher intensity, higher loudness Lower intensity, lower loudness 9/20/2018

27 Two signals differing by 10 dB:
Rule 2: The relationship between intensity and loudness is seriously nonlinear. Doubling intensity does not double loudness. In order to double loudness, intensity must be increased by a factor of 10, or by 10 dB [10 x log10 (10) = 10 x 1 = 10 dB]. This is called the 10 dB rule. Two signals differing by 10 dB: (500 Hz sinusoids) Note that the more intense sound is NOT 10 times louder, even though it is 10 times more intense. 9/20/2018

28 The 10 dB rule means that a 70 dB signal will be twice as loud as a 60 dB signal, four times as loud as a 50 dB signal, eight times as loud as a 40 dB signal, etc. A 30 dB hearing loss is considered mild –just outside the range of normal hearing. Based on the 10 dB rule, how much is loudness affected by a 30 dB hearing loss? (Answer: 1/8th. But note that this does not mean that someone with a 30 dB loss will have 8 times more difficulty with speech understanding than someone with normal hearing.)

29 (Remember that this is the reason for the dBHL scale.)
Rule 3: Loudness is strongly affected by the frequency of the signal. If intensity is held constant, a mid-frequency signal (in the range from ~ Hz) will be louder than lower or higher frequency signals. 250 Hz, 3000 Hz, 8000 Hz The 3000 Hz signal should appear louder than the 125 or the 8000 signal, despite the fact that their intensities are (about) equal. (Remember that this is the reason for the dBHL scale.) 9/20/2018

30 Timbre (also sound quality or tone color)
Timbre, also known as sound quality or tone color, is oddly defined in terms of what it is not: When two sounds are heard that match for pitch, loudness, and duration, and a difference can still be heard between the sounds, that difference is called timbre (also called sound quality or tone color). 9/20/2018

31 Example: a clarinet, a saxophone, and a piano all play a middle C at the same loudness and same duration. Each of these instruments has a unique sound quality. This difference is called timbre, tone color, or sound quality. There are also many examples of timbre difference in speech. For example, two vowels (e.g., [ɑ] and [i]) spoken at the same loudness and same pitch differ from one another in timbre. 9/20/2018

32 There are two physical correlates of timbre:
spectrum envelope amplitude envelope spectrum envelope: Smooth line drawn to enclose an amplitude spectrum. amplitude envelope: Smooth line drawn to enclose a sound wave (time domain representation). 9/20/2018

33 Timbre and Spectrum Envelope
Timbre differences between one musical instrument and another are partly related to differences in spectrum envelope -- differences in the relative amplitudes of the individual harmonics. In the examples above, we would expect all of these sounds to have the same pitch because the harmonic spacing is the same in all cases. The timbre differences that you would hear are controlled in part by the differences in the shape of the spectrum envelope. 9/20/2018

34 Six Synthesized Sounds Differing in Spectrum Envelope
Note the similarities in pitch (due to constant f0/harmonic spacing) and the differences in timbre or sound quality. 9/20/2018

35 Vowels Also Differ in Spectrum Envelope
Shown here are the smoothed envelopes only (i.e., the harmonic fine structure is not shown) of 10 American-English vowels.* Note that each vowel has a unique shape to its spectrum envelope. Perceptually, these sounds differ from one another in timbre. Purely as a matter of convention, the term timbre is seldom used by phoneticians, although it applies just as well here as it does in music. In phonetics, timbre differences among vowels are typically referred to as differences in vowel quality or vowel color. * From Hillenbrand and Houde (2003). “A narrow band pattern-matching model of vowel perception,” Journal of the Acoustical Society of America, 113, 9/20/2018

36 Aperiodic sounds can also differ in spectrum envelope, and the perceptual differences are properly described as timbre differences. 9/20/2018

37 Amplitude Envelope Leading edge = attack Trailing edge = decay
Timber is also affected by amplitude envelope. Amplitude envelope is a smooth line drawn to enclose a sound wave. It is also sometimes called the amplitude contour of the sound wave. These are both good terms since the amplitude envelope shows how overall signal amplitude varies over time. Amplitude envelope refers mainly to the characteristics of the way sounds are turned on and turned off. The four signals below are sinusoids that differ in their amplitude envelopes. Leading edge = attack Trailing edge = decay 9/20/2018 The attack especially has a large effect on timbre.

38 Same melody, same spectrum envelope (if sustained), different amplitude envelopes (i.e., different attack and decay characteristics). Note differences in timbre or sound quality as the amplitude envelope varies. 9/20/2018

39 Timbre differences related to amplitude envelope also play a role in speech. Note the differences in the shape of the attack for [bɑ] vs. [wɑ] (top) and [ʃɑ] vs. [tʃɑ]. abrupt attack more gradual attack more gradual attack abrupt attack 9/20/2018


Download ppt "Auditory Perception Hillenbrand SPPA 2060."

Similar presentations


Ads by Google