Seneff’s Auditory Model Miriam Cordero Ruiz (SONY Advanced Technology Center Stuttgart) Leuven, july 2002.

Slides:



Advertisements
Similar presentations
Frequency analysis.
Advertisements

An AER Analog Silicon Cochlea Model using Pseudo Floating Gate Transconductors Master Thesis in Electronics and Computer Science, Microelectronics Programme,
MPEG-1 MUMT-614 Jan.23, 2002 Wes Hatch. Purpose of MPEG encoding To decrease data rate How? –two choices: could decrease sample rate, but this would cause.
The case of the missing pitch templates: How harmonic templates emerge in the early auditory system Shihab Shamma and David Klein, 2000.
Multipitch Tracking for Noisy Speech
Student name Student ID Degree program Area of specialization.
Hearing over time Using the neural representation of the time waveform of sound.
1 Filters Definition: A filter is a frequency selective system that allows energy at certain frequencies and attenuates the rest.
The peripheral auditory system David Meredith Aalborg University.
3/23/2002 Copyright James D. Johnston Permission granted for any educational use. 1 What can we hear? James D. Johnston home.comcast.net/~retired_old_jj.
Vocal Emotion Recognition with Cochlear Implants Xin Luo, Qian-Jie Fu, John J. Galvin III Presentation By Archie Archibong.
Watkins, Raimond & Makin (2011) J Acoust Soc Am –2788 temporal envelopes in auditory filters: [s] vs [st] distinction is most apparent; - at higher.
Speech perception Relating features of hearing to the perception of speech.
MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.
I.1 ii.2 iii.3 iv.4 1+1=. i.1 ii.2 iii.3 iv.4 1+1=
I.1 ii.2 iii.3 iv.4 1+1=. i.1 ii.2 iii.3 iv.4 1+1=
Relationship between perception of spectral ripple and speech recognition in cochlear implant and vocoder listeners L.M. Litvak, A.J. Spahr, A.A. Saoji,
1 New Technique for Improving Speech Intelligibility for the Hearing Impaired Miriam Furst-Yust School of Electrical Engineering Tel Aviv University.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
COMMUNICATION SYSTEM EECB353 Chapter 2 Part IV AMPLITUDE MODULATION Dept of Electrical Engineering Universiti Tenaga Nasional.
Isabel Gambin, Ivan Grech, Owen Casha, Edward Gatt and Joseph Micallef Department of Microelectronics and Nanoelectronics University of Malta.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Audio and Music Representations (Part 2) 1.
Abstract We report comparisons between a model incorporating a bank of dual-resonance nonlinear (DRNL) filters and one incorporating a bank of linear gammatone.
EE Audio Signals and Systems Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
A physiologically motivated gammachirp auditory filterbank Toshio Irino (NTT Communication Sciences. Lab. Japan) Masashi Unoki (CNBH, Univ. Cambridge/JAIST)
Speech Science XI Speech Perception (auditory physiology) Version WS
Senior Design Fall 06 and Spring 07 Speech Strategy for the Cochlear Implant.
Basics of Neural Networks Neural Network Topologies.
Speaker Recognition by Habib ur Rehman Abdul Basit CENTER FOR ADVANCED STUDIES IN ENGINERING Digital Signal Processing ( Term Project )
Authors: Sriram Ganapathy, Samuel Thomas, and Hynek Hermansky Temporal envelope compensation for robust phoneme recognition using modulation spectrum.
Recognition of Speech Using Representation in High-Dimensional Spaces University of Washington, Seattle, WA AT&T Labs (Retd), Florham Park, NJ Bishnu Atal.
Gammachirp Auditory Filter
A NEW FEATURE EXTRACTION MOTIVATED BY HUMAN EAR Amin Fazel Sharif University of Technology Hossein Sameti, S. K. Ghiathi February 2005.
Speech Recognition Feature Extraction. Speech recognition simplified block diagram Speech Capture Speech Capture Feature Extraction Feature Extraction.
GUIDED BY T.JAYASANKAR, ASST.PROFESSOR OF ECE, ANNA UNIVERSITY OF TIRUCHIRAPPALLI. PRESENTED BY C.SENTHILKUMAR, REG.NO: , M.E(MBCBS),COM SYSTEM,VI.
P. N. Kulkarni, P. C. Pandey, and D. S. Jangamashetti / DSP 2009, Santorini, 5-7 July DSP 2009 (Santorini, Greece. 5-7 July 2009), Session: S4P,
IIT Bombay {pcpandey,   Intro. Proc. Schemes Evaluation Results Conclusion Intro. Proc. Schemes Evaluation Results Conclusion.
A Model of Binaural Processing Based on Tree-Structure Filter-Bank
Robust Feature Extraction for Automatic Speech Recognition based on Data-driven and Physiologically-motivated Approaches Mark J. Harvilla1, Chanwoo Kim2.
Performance Comparison of Speaker and Emotion Recognition
Computational Modeling of the Auditory Periphery:
The Traveling Wave. Reminder 2 Frequency Amplitude Frequency Phase Frequency domain Time domain (time) waveform Amplitude spectrum.
MEL’S DOG & PONY SHOW RADIO FREQUENCY FILTERS A radio frequency filter is an electronic circuit that will allow or not allow a band of frequencies to.
Feedback Filters n A feedback filter processes past output samples, as well as current input samples: n Feedback filters create peaks (poles or resonances)
RCC-Mean Subtraction Robust Feature and Compare Various Feature based Methods for Robust Speech Recognition in presence of Telephone Noise Amin Fazel Sharif.
Lecture 8 Neuromorphic Hearing 1. Outline The ear and the cochlea Artificial Vs biological vision systems Fundamentals of photo receptors Read-out strategies.
Lecture 8 Neuromorphic Hearing 1. Outline The ear and the cochlea Silicon Cochlea 2.
Piano Music Transcription Wes “Crusher” Hatch MUMT-614 Thurs., Feb.13.
RADIO RECIEVERS.
Fletcher’s band-widening experiment (1940) Present a pure tone in the presence of a broadband noise. Present a pure tone in the presence of a broadband.
Design a New Speech Encoder for Cochlear Implants Using DRNL model. Prepared By: Koka Gopi 3 rd sem, M.tech Communication Systems.
EFC Media Center Training
distance, m (log scale) -25o 0o +25o C50, dB left right L-shaped
Fletcher’s band-widening experiment (1940)
Presenter: Prof.Dr.-Eng. Gheorghe-Daniel Andreescu
Filters: Intuitive Understanding
III. Analysis of Modulation Metrics IV. Modifications
Lab 1: (i) TIMS introduction (ii) Modeling an Equation
Previous Microphone Array System Integrated Microphone Array System
1-channel 2-channel 4-channel 8-channel 16-channel Original
Lab 5 Part II Instructions
External ear functions
Josh H. McDermott, Eero P. Simoncelli  Neuron 
Snaring Otoferlin's Role in Deafness
MPEG-1 Overview of MPEG-1 Standard
Organizing Your Speech
Green Filters Cascade Polyphase M-to-1 Down Sample Filter, Inner Filter, and Polyphase 1-to-M Up Sample Filter fred harris.
Statistical and Signal Processing Approaches for Voicing Detection
Auditory Demonstrations
Auditory Demonstrations
Presentation transcript:

Seneff’s Auditory Model Miriam Cordero Ruiz (SONY Advanced Technology Center Stuttgart) Leuven, july 2002

Which is the best speech recognizer?

Introduction Auditory System Seneff’s Model Stage I Stage II Conclusions

Human Auditory System

t t Inner Hair Cells

Structure of the model CRITICAL BAND FILTER BANK HAIR CELL SYNAPSE MODEL ENVELOPE DETECTOR SYNCHRONY DETECTOR Mean rate spectrum synchrony spectrum STAGE I STAGE II STAGE III

Stage I: Auditory Filter Bank 40 channels ( Hz) BW 1channel =0,5 Barks

Design of the Auditory Filter Bank INITIAL COMPLEX ZEROES ZERO OF CASCADE RESONATOR CHANNEL 1CHANNEL 2CHANNEL 40 ……. f(Hz)

Stage II Physiological Data Model < 1kHz

Stages I+II CRITICAL BAND FILTER BANK HALFWAVE RECTIFICATION SHORT-TERM ADAPTATION LOW PASS FILTER RAPID AGC STAGE II STAGE I

Results

Other Peripheral Models Patterson-Meddis Gammatone Filterbank Lyon’s Cochlear Model Gammatone Filterbank Adaptation Stage

Conclusions Based on biological data Front-End for Speech Processing Speech Recognition, Speaker ID, Localization…. Better performance