Additivity of auditory masking using Gaussian-shaped tones a Laback, B., a Balazs, P., a Toupin, G., b Necciari, T., b Savel, S., b Meunier, S., b Ystad,

Slides:



Advertisements
Similar presentations
Revised estimates of human cochlear tuning from otoacoustic and behavioral measurements Christopher A. Shera, John J. Guinan, Jr., and Andrew J. Oxenham.
Advertisements

Adaptive Methods Research Methods Fall 2010 Tamás Bőhm.
Hearing relative phases for two harmonic components D. Timothy Ives 1, H. Martin Reimann 2, Ralph van Dinther 1 and Roy D. Patterson 1 1. Introduction.
Introduction Relative weights can be estimated by fitting a linear model using responses from individual trials: where g is the linking function. Relative.
Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.
CS 551/651: Structure of Spoken Language Lecture 11: Overview of Sound Perception, Part II John-Paul Hosom Fall 2010.
Speech Science XII Speech Perception (acoustic cues) Version
Psychoacoustics of Dynamic ‘Center-of-Gravity’ Signals Larry Feth Ashok Krishnamurthy Ohio State University.
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
Masker-First Advantage in Cued Informational Masking Studies Virginia M. Richards a, Rong Huang a, and Gerald Kidd Jr b. (a) Department of Psychology,
PH 105 Dr. Cecilia Vogel Lecture 10. OUTLINE  Subjective loudness  Masking  Pitch  logarithmic  critical bands  Timbre  waveforms.
Temporal processing Representation of the temporal characteristics of sound.
Interrupted speech perception Su-Hyun Jin, Ph.D. University of Texas & Peggy B. Nelson, Ph.D. University of Minnesota.
Evaluating Hypotheses
1 Hearing Sound is created by vibrations from a source and is transmitted through a media (such as the atmosphere) to the ear. Sound has two main attributes:
Two- tone unmasking and suppression in a forward-masking situation Robert V. Shannon 1976 Spring 2009 HST.723 Theme 1: Psychophysics.
Spectral centroid 6 harmonics: f0 = 100Hz E.g. 1: Amplitudes: 6; 5.75; 4; 3.2; 2; 1 [(100*6)+(200*5.75)+(300*4)+(400*3.2)+(500*2 )+(600*1)] / = 265.6Hz.
4aPP17. Effect of signal frequency uncertainty for random multi-burst maskers Rong Huang and Virginia M. Richards Department of Psychology, University.
Cross-Spectral Channel Gap Detection in the Aging CBA Mouse Jason T. Moore, Paul D. Allen, James R. Ison Department of Brain & Cognitive Sciences, University.
Sound source segregation (determination)
Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06.
Lecture 1 Signals in the Time and Frequency Domains
LIGO-G Z Coherent Coincident Analysis of LIGO Burst Candidates Laura Cadonati Massachusetts Institute of Technology LIGO Scientific Collaboration.
Adaptive Design of Speech Sound Systems Randy Diehl In collaboration with Bjőrn Lindblom, Carl Creeger, Lori Holt, and Andrew Lotto.
The Care and Feeding of Loudness Models J. D. (jj) Johnston Chief Scientist Neural Audio Kirkland, Washington, USA.
METHODOLOGY INTRODUCTION ACKNOWLEDGEMENTS LITERATURE Low frequency information via a hearing aid has been shown to increase speech intelligibility in noise.
T – Biomedical Signal Processing Chapters
LIGO-G Z LIGO Scientific Collaboration 1 Upper Limits on the Rate of Gravitational Wave Bursts from the First LIGO Science Run Edward Daw Louisiana.
1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.
Studies of Information Coding in the Auditory Nerve Laurel H. Carney Syracuse University Institute for Sensory Research Departments of Biomedical & Chemical.
Chapter 7: Loudness and Pitch. Loudness (1) Auditory Sensitivity: Minimum audible pressure (MAP) and Minimum audible field (MAF) Equal loudness contours.
Sounds in a reverberant room can interfere with the direct sound source. The normal hearing (NH) auditory system has a mechanism by which the echoes, or.
Gammachirp Auditory Filter
Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.
Confidence Intervals Lecture 3. Confidence Intervals for the Population Mean (or percentage) For studies with large samples, “approximately 95% of the.
Hearing Research Center
Analysis of Optimal and Suboptimal Discrete-Time Digital Communications Receivers Clemson University SURE Program 2005 Justin Ingersoll, Prof. Michael.
Pitch perception in auditory scenes 2 Papers on pitch perception… of a single sound source of more than one sound source LOTS - too many? Almost none.
SOUND PRESSURE, POWER AND LOUDNESS MUSICAL ACOUSTICS Science of Sound Chapter 6.
A comparison of methods for characterizing the event-related BOLD timeseries in rapid fMRI John T. Serences.
Spatial Smoothing and Multiple Comparisons Correction for Dummies Alexa Morcom, Matthew Brett Acknowledgements.
1 Hearing Sound is created by vibrations from a source and is transmitted through a media (such as the atmosphere) to the ear. Sound has two main attributes:
Introduction to psycho-acoustics: Some basic auditory attributes For audio demonstrations, click on any loudspeaker icons you see....
Nuclear Accent Shape and the Perception of Syllable Pitch Rachael-Anne Knight LAGB 16 April 2003.
Outline Transmitters (Chapters 3 and 4, Source Coding and Modulation) (week 1 and 2) Receivers (Chapter 5) (week 3 and 4) Received Signal Synchronization.
A. R. Jayan, P. C. Pandey, EE Dept., IIT Bombay 1 Abstract Perception of speech under adverse listening conditions may be improved by processing it to.
A comparative approach for gene network inference using time-series gene expression data Guillaume Bourque* and David Sankoff *Centre de Recherches Mathématiques,
Background Matching Pursuit Signal Decomposition Matching pursuit (MP) gives a representation of a signal as the sum of a number of “atoms” selected from.
Fletcher’s band-widening experiment (1940)
SOUND PRESSURE, POWER AND LOUDNESS
4aPPa32. How Susceptibility To Noise Varies Across Speech Frequencies
Authors: Peter W. Battaglia, Robert A. Jacobs, and Richard N. Aslin
Precedence-based speech segregation in a virtual auditory environment
CS 591 S1 – Computational Audio
Copyright © American Speech-Language-Hearing Association
Ana Alves-Pinto, Joseph Sollini, Toby Wells, and Christian J. Sumner
Spread Spectrum Audio Steganography using Sub-band Phase Shifting
Effects of divided attention and sensorimotor synchronization on detection of timing perturbations in auditory sequences Bruno H. Repp Haskins Laboratories,
The “Flash-Lag” Effect Occurs in Audition and Cross-Modally
Statistical Methods For Engineers
Error rate due to noise In this section, an expression for the probability of error will be derived The analysis technique, will be demonstrated on a binary.
Volume 62, Issue 1, Pages (April 2009)
Gary Margrave and Michael Lamoureux
Volume 61, Issue 2, Pages (January 2009)
Coherent Coincident Analysis of LIGO Burst Candidates
Volume 62, Issue 1, Pages (April 2009)
Speech Perception (acoustic cues)
Govt. Polytechnic Dhangar(Fatehabad)
Integration Trumps Selection in Object Recognition
Encoding of Stimulus Probability in Macaque Inferior Temporal Cortex
Presentation transcript:

Additivity of auditory masking using Gaussian-shaped tones a Laback, B., a Balazs, P., a Toupin, G., b Necciari, T., b Savel, S., b Meunier, S., b Ystad, S., and b Kronland-Martinet, R. a Acoustics Research Institute, Austrian Acad. of Sciences, Austria b Laboratoire de Mécanique et d'Acoustique, CNRS Marseille, France MULAC Meeting, Vienna Sept 24rd, Additivity of auditory masking using Gaussian-shaped tones a Laback, B., a Balazs, P., a Toupin, G., b Necciari, T., b Savel, S., b Meunier, S., b Ystad, S., and b Kronland-Martinet, R. a Acoustics Research Institute, Austrian Acad. of Sciences, Austria b Laboratoire de Mécanique et d'Acoustique, CNRS Marseille, France MULAC Meeting, Vienna Sept 24rd, Acoustics Research Institute Austrian Academy of Sciences

Motivation Both temporal and frequency masking have been studied extensively in the literature Very little is known about their interaction, i.e., masking in the time-frequency domain An accompanying study (Necciari et al., this conference) presents data on time-frequency time- frequency masking caused by a Gaussian-shaped tone pulse (“Gaussian”) Our aim is to study the additivity of masking from multiple Gaussian maskers Taken together, these data may serve as a basis to model time-frequency masking in complex signals

Tim-frequency masking time frequency

time frequency Tim-frequency masking

time frequency Tim-frequency masking

Outline 3 steps: Additivity of temporal masking Additivity of frequency masking Additivity of time-frequency masking (not presented today)

Experiment design Both signal and maskers are Gaussian-windowed tones: with Γ: gamma factor: (Γ = α.f 0 ), where f 0 is the tone frequency and α the shape factor Equivalent rectangular bandwidth (  Γ ) : 600 Hz Equivalent rectangular duration: 1.7 ms Good properties of Gaussian in time-frequency domain: Minimal spread in time-frequency Gaussian shape in both time and frequency A study by van Schijndel et al. (1999) has shown that Gaussian-windowed tones with an appropriate alpha factor may fit the auditory time-frequency window.

Procedure: – 3 interval - 3 AFC (oddity task) – Adaptive procedure: 3 down - 1 up rule (estimates the 79.4% threshold) – 12 turnarounds, the last 8 used to calculate the threshold – Stepsize: 5 dB, halved after 2 turnarounds Repeated measurements to have at least three stable values Presented in blocks of equivalent number of maskers Five subjects, normal hearing according to standard audiometric tests Experiment design

Additivity of temporal masking Design – Frequency (target and maskers): 4000 Hz – Four maskers with time shifts: -24, -16, -8, +8 ms – Maskers nearly equally effective (iterative approach) Amount of masking:  8 dB – Combinations: “M 2 -M 3 ”, “M 3 -M 4 ”, “M 1 -M 2 -M 3 ”, “M 2 -M 3 -M 4 ”, “M 1 -M 2 -M 3 -M 4 ” Δt M3M3 TM2M2 time (ms) M1M1 M4M

Waveform of four maskers at equally effective levels (target at masked threshold for single masker) M1 M2 M3 M4 T

Differences between masker levels and masked thresholds for single maskers Five subjects Forward maskingBackward masking Error bars: 95% confidence intervals

Additivity of temporal masking Average results over five subjects p << 0.05 p >> 0.05 Empty symbols: measured data Filled symbols: linear additivity model Error bars: 95% confidence intervals

Additivity of temporal masking Average results over five subjects Error bars: 95% confidence intervals

Summary of temporal masking data (average) No difference between forward and backward maskers Amount of masking increases with number of maskers: – 2 maskers vs. 1 masker: + 18 dB (p << 0.05) – 3 maskers vs. 2 maskers: + 5 dB (p << 0.05) – 4 maskers vs. 3 maskers: + 11 dB (p << 0.05) Amount of excess masking (nonlinear additivity) increases with number of maskers – 2 maskers: 14 dB – 3 maskers: 17 dB – 4 maskers: 26 dB Results qualitatively consistent with literature data using stimuli with no or little temporal overlap of maskers

Additivity of frequency masking Design – Target frequency: 5611 Hz – Four simultaneous maskers with frequency separations: -7, -5, -3, +3 erbs – Maskers nearly equally effective – Amount of masking: 8 dB – Combinations: as for temporal masking Δf M3M3 TM2M2 Frequency (erb) M1M1 M4M

Additivity of frequency masking Design Cochlear distortions (combination tones) could be detection cues Therefore, lowpass-filtered background noise was added The most critical condition (M3+T) was tested with/without noise on two subjects No difference in threshold: so finally NO masking noise!

Masking towards higher frequencies Masking towards lower frequencies Error bars: 95% CI Differences between masker levels and masked thresholds for single maskers Five subjects

Additivity of frequency masking Average results over five subjects Error bars: 95% CI Empty symbols: measured data Filled symbols: linear additivity model

Summary of frequency masking data (average) Amount of masking depends on maskers involved: – M 2 -M 3 vs. single: 3 dB (p < 0.05) – M 3 -M 4 vs. single: 15 dB (p << 0.05) – M 1 -M 2 -M 3 vs. M 2 -M 3 : 5 dB (p < 0.05) – M 2 -M 3 -M 4 vs. M 3 -M 4 : 0 dB (p > 0.05) – M 2 -M 3 -M 4 vs. M 2 -M 3 : 14 dB (p << 0.05) – M 1 -M 2 -M 3 -M 4 vs. M 1 -M 2 -M 3 : 9 dB (p << 0.05) – M 1 -M 2 -M 3 -M 4 vs. M 2 -M 3 -M 4 : 0 dB (p > 0.05) Excess masking (nonlinear additivity) mainly occurring when higher-frequency masker (M4) included – Pairs: 2-3: 0 dB, 3-4: 15 dB – Triples: 1-2-3: 5 dB, 2-3-4: 13 dB – Quadruple: 14 dB

M1 Maskers M1,M2, and M3 overlap with each other, but not with M4 M2 M3 M4 Waveform of four maskers at equally effective levels (target at masked threshold for single masker) T

Discussion and Conclusions Strong excess masking for Gaussian maskers if they are physically non-overlapping Amount of excess masking increases monotonically with number of non-overlapping maskers Excess masking is thought to be related to the compressivity of BM vibration (e.g. Humes and Jesteadt, 1989) Thus, our Gaussians seem to be subject to BM compression, even though they are rather short (ERD = 1.7 ms) This is consistent with the physiological finding that the BM starts to be highly compressive already 0.5 to 0.7 ms after the onset of a signal (Recio et al., 1998)

Modeling of Results Linear Energy Summation Model Assumption: Masked threshold proportional to masker energy at out put of integrator stage Combining two equally effective maskers A and B should produce X + 3 dB of masking Valid for completely overlapping maskers Nonlinear Model Assumption: Compressive nonlinearity in auditory system is preceding the integrator stage Combining maskers A and B results in more than linear additivity (excess masking) Valid for non-overlapping maskers

Modeling of Results General form: where M A, B : Amount of masking produced by maskers A or B M AB : Amount of masking produced by the combination of maskers A and B J: Compressive nonlinearity in peripheral auditory processing

Modeling of Results Power-law model (Lutfi, 1980): – for p = 1: linear model – for p < 1: compressive model MT X : Masked threshold of masker X Modified Power-law model (Humes et al., 1989): – Threshold in quiet (QT) considered as “internal noise”

Start with Temporal Masking: → perfect masker separation Power Model:  best fit for p = 0.2  Mean error: 1.9 dB Modified power model:  Prediction always too low

Include Correction for Quiet Threshold: -7 dB Power Model:  Mean error: 1.9 dB Modified power model:  Mean error: 1.6 dB Why correction required? → Probably, absolute thresholds for Gaussians are no good approximation for internal noise

Spectral Masking: Using same p-value (0.2) and threshold correction Power Model: Good fit only for M3M4 (non-overlapping) Modified power model: too high predictions Adjustment of parameters required!

p-values optimized for Modified Power model

Some questions Can we derive appropriate p-values from amount of overlap between maskers? Can the (modified) power model be included into the Gabor- Multiplier framework to predict time-frequency masking effects for complex signals?

More experiments to test the model

Acknowledgements We would like to thank – the subjects for their patience – Piotr Majdak for providing support in the development of the software for the experiments Work partly supported by WTZ (project AMADEUS) and WWTF (project MULAC)

End of talk

p-values optimized for Power model

Time-frequency conditions time frequency