Seneff’s Auditory Model Miriam Cordero Ruiz (SONY Advanced Technology Center Stuttgart) Leuven, july 2002
Which is the best speech recognizer?
Introduction Auditory System Seneff’s Model Stage I Stage II Conclusions
Human Auditory System
t t Inner Hair Cells
Structure of the model CRITICAL BAND FILTER BANK HAIR CELL SYNAPSE MODEL ENVELOPE DETECTOR SYNCHRONY DETECTOR Mean rate spectrum synchrony spectrum STAGE I STAGE II STAGE III
Stage I: Auditory Filter Bank 40 channels ( Hz) BW 1channel =0,5 Barks
Design of the Auditory Filter Bank INITIAL COMPLEX ZEROES ZERO OF CASCADE RESONATOR CHANNEL 1CHANNEL 2CHANNEL 40 ……. f(Hz)
Stage II Physiological Data Model < 1kHz
Stages I+II CRITICAL BAND FILTER BANK HALFWAVE RECTIFICATION SHORT-TERM ADAPTATION LOW PASS FILTER RAPID AGC STAGE II STAGE I
Results
Other Peripheral Models Patterson-Meddis Gammatone Filterbank Lyon’s Cochlear Model Gammatone Filterbank Adaptation Stage
Conclusions Based on biological data Front-End for Speech Processing Speech Recognition, Speaker ID, Localization…. Better performance