1 SPEECH PROCESSING FOR BINAURAL HEARING AIDS Dr P. C. Pandey EE Dept., IIT Bombay Feb’03.

Slides:



Advertisements
Similar presentations
Revised estimates of human cochlear tuning from otoacoustic and behavioral measurements Christopher A. Shera, John J. Guinan, Jr., and Andrew J. Oxenham.
Advertisements

Hearing Aids and Hearing Impairments Part II Meena Ramani 02/23/05.
Hearing and Deafness Outer, middle and inner ear.
MIMICKING THE HUMAN EAR Philipos Loizou (author) Oliver Johnson (me)
More From Music music through a cochlear implant Dr Rachel van Besouw Hearing & Balance Centre, ISVR.
Pitch organisation in Western tonal music. Pitch in two dimensions Pitch perception in music is often thought of in two dimensions, pitch height and pitch.
Speech Science XII Speech Perception (acoustic cues) Version
EE Dept., IIT Bombay Workshop “AICTE Sponsored Faculty Development Programme on Signal Processing and Applications", Dept. of Electrical.
M.Sc. in Medical Engineering
Sensory Systems: Auditory. What do we hear? Sound is a compression wave: When speaker is stationary, the air is uniformly dense Speaker Air Molecules.
Interrupted speech perception Su-Hyun Jin, Ph.D. University of Texas & Peggy B. Nelson, Ph.D. University of Minnesota.
Speech Perception Richard Wright Linguistics 453.
TOPIC 4 BEHAVIORAL ASSESSMENT MEASURES. The Audiometer Types Clinical Screening.
Relationship between perception of spectral ripple and speech recognition in cochlear implant and vocoder listeners L.M. Litvak, A.J. Spahr, A.A. Saoji,
Sound source segregation (determination)
Hearing Loss Hearing loss can be divided into two basic types: Temporary threshold shifts Permanent loss Hearing loss resulting from damage.
Chapter 6: The Human Ear and Voice
Unit 4: Sensation & Perception
Hearing.
Dr. P. C. Pandey EE Dept, IIT Bombay Education B.Tech. (BHU, 1979), M.Tech. (IIT Kanpur,1981), Ph.D. (Toronto, 1987) Employment.
Figures for Chapter 6 Compression
Hearing.
IIT Bombay ICA 2004, Kyoto, Japan, April 4 - 9, 2004   Introdn HNM Methodology Results Conclusions IntrodnHNM MethodologyResults.
CSD 5400 REHABILITATION PROCEDURES FOR THE HARD OF HEARING Auditory Perception of Speech and the Consequences of Hearing Loss.
EE Audio Signals and Systems Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Second International Conference on Intelligent Interactive Technologies and Multimedia (IITM 2013), March 2013, Allahabad, India 09 March 2013 Speech.
IIT Bombay Dr. Prem C. Pandey Dr. Pandey is a Professor in Electrical Engineering at IIT Bombay. He is currently also the Associate.
IIT Bombay AIM , IIT Bombay, 27 June ’03 1 Online Monitoring of Dissipation Factor Dayashankar Dubey (MTech) Suhas P. Solanki,
IIT Bombay AIM 2003, 27th June, 2003 R&D Areas Faculty Academics Labs & Facilities Projects R&D AreasFacultyAcademicsLabs & FacilitiesProjects 1 Electronic.
Prof. Brian L. Evans Dept. of Electrical and Computer Engineering The University of Texas at Austin EE445S Real-Time Digital Signal Processing Lab Fall.
♠♠♠♠ 1Intro 2.Loudness 3.Method. 4.Results 5.Concl. ♦♦ ◄◄ ► ► 1/161Intro 2.Loudness 3.Method. 4.Results 5.Concl. ♦♦ ◄ ► IIT Bombay ICA 2010 : 20th Int.
Page 0 of 23 MELP Vocoders Nima Moghadam SN#: Saeed Nari SN#: Supervisor Dr. Saameti April 2005 Sharif University of Technology.
Hearing Test ng_test/ ng_test/
CAPD: ”Behavioral assessment”
1/18 1.Intro 2. Implementation 3. Results 4. Con.
♠ 1.Intro 2. List. tests 3. Results 4 Concl.♠♠ 1.Intro 2. List. tests 3. Results 4 Concl. ♥♥ ◄◄ ► ► 1/17♥♥◄ ► IIT Bombay ICA 2010 : 20th Int. Congress.
♥♥♥♥ 1. Intro. 2. VTS Var.. 3. Method 4. Results 5. Concl. ♠♠ ◄◄ ►► 1/181. Intro.2. VTS Var..3. Method4. Results5. Concl ♠♠◄◄►► IIT Bombay NCC 2011 : 17.
1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.
Pure Tone Audiometry most commonly used test for evaluating auditory sensitivity delivered primarily through air conduction and bone conduction displayed.
Hearing & Aging Or age brings wisdom and other bad news.
Authors: Sriram Ganapathy, Samuel Thomas, and Hynek Hermansky Temporal envelope compensation for robust phoneme recognition using modulation spectrum.
COCHLEAR IMPLANTS Brittany M. Alphonse Biomedical Engineering BME 181.
Gammachirp Auditory Filter
EE Dept., IIT Bombay IEEE Workshop on Intelligent Computing, IIIT Allahabad, Oct Signal processing for improving speech.
P. N. Kulkarni, P. C. Pandey, and D. S. Jangamashetti / DSP 2009, Santorini, 5-7 July DSP 2009 (Santorini, Greece. 5-7 July 2009), Session: S4P,
IIT Bombay 14 th National Conference on Communications, 1-3 Feb. 2008, IIT Bombay, Mumbai, India 1/27 Intro.Intro.
IIT Bombay {pcpandey,   Intro. Proc. Schemes Evaluation Results Conclusion Intro. Proc. Schemes Evaluation Results Conclusion.
Hearing Sound and the limits to hearing Structure of the ear: Outer, middle, inner Outer ear and middle ear functions Inner ear: the cochlea - Frequency.
EE Dept., IIT Bombay P. C. Pandey, "Signal processing for persons with sensorineural hearing loss: Challenges and some solutions,”
Vocal Tract & Lip Shape Estimation By MS Shah & Vikash Sethia Supervisor: Prof. PC Pandey EE Dept, IIT Bombay AIM-2003, EE Dept, IIT Bombay, 27 th June,
EE Dept., IIT Bombay Part B Sliding-band Dynamic Range Compression (N. Tiwari & P. C. Pandey, NCC 2014) P. C. Pandey, "Signal processing.
Introduction to psycho-acoustics: Some basic auditory attributes For audio demonstrations, click on any loudspeaker icons you see....
EE Dept., IIT Bombay Workshop “Radar and Sonar Signal Processing,” NSTL Visakhapatnam, Aug 2015 Coordinator: Ms. M. Vijaya.
IIT Bombay 17 th National Conference on Communications, Jan. 2011, Bangalore, India Sp Pr. 1, P3 1/21 Detection of Burst Onset Landmarks in Speech.
A. R. Jayan, P. C. Pandey, EE Dept., IIT Bombay 1 Abstract Perception of speech under adverse listening conditions may be improved by processing it to.
EE Dept., IIT Bombay CEP-cum-TEQUIP-KITE Course “Digital Signal Processing”, IIT Bombay, 2–6 November 2015, Course Coordinator:
Chapter 4 Sensation and Perception. The Ear Audition = hearing Audition = hearing Sounds = mechanical energy typically caused by vibrating objects Sounds.
HEARING Module 20. Hearing – sound waves  Audition – the sense or act of hearing  Frequency – the number of complete wavelengths that pass a point in.
AP Psychology Unit 4 Module 20
Hearing As with the eye, the ear receives waves, this time of sounds. As with the eye, the ear receives waves, this time of sounds. Length of wave = pitch.
Hearing or audition.
Hearing tests.
PSYCHOACOUSTICS A branch of psychophysics
Automated Detection of Speech Landmarks Using
The influence of hearing loss and age on sensitivity to temporal fine structure Brian C.J. Moore Department of Experimental Psychology, University of Cambridge,
Presentation by Maliha Khan and Kevin Kemelmakher
A Smartphone App-Based
Speech Perception (acoustic cues)
Auditory, Tactical, and Olfactory Displays
Presenter: Shih-Hsiang(士翔)
Presentation transcript:

1 SPEECH PROCESSING FOR BINAURAL HEARING AIDS Dr P. C. Pandey EE Dept., IIT Bombay Feb’03

2 R&D activities in SPI Lab, EE Dept, IIT Bombay Speech & hearing Healthcare instrumentation Impedance cardiography Industrial instrumentation

3 Speech & hearing Speech processing for improving perception by persons with sensori-neural hearing loss: - Consonantal enhancement (with Prof SD Agashe) - Binaural dichotic presentation Vocal tract shape estimation for speech training of deaf children Speech synthesis and study of phonemic features using HNM Cancellation of background noise in alaryngeal speech using spectral subtraction

4 Healthcare instrumentation Low cost diagnostic audiometer Impedance glottograph for voice pitch Impedance cardiograph for sports medicine. Intravenous drip rate indicator Communicator for children with cerebral palsy (with Prof GG Ray) Non-invasive ultrasonic thermometry system (with Prof T Anjaneyulu) Myoelectric hand (with Prof SR Devasahayam & R Lal)

5 Impedance cardiography  Signal processing for improving the estimation of stroke volume from impedance cardiogram Industrial Instrumentation  Noninvasive m/s of single phase fluid flow using ultrasonic crosscorrelation technique (with Prof T Anjaneyulu)  Online measurement of dielectric dissipation factor for condition monitoring of high voltage insulation (with Prof SV Kulkarni)

6

7

8

9

10

11 Speech Processing for Binaural Hearing Aids Hearing system Outer ear  Middle ear  Inner ear  Cochlear nerve  Brain Hearing impairments Conducrtive Sensorineural Central Functional Sensory Aids for the hearing impaired Hearing aids Cochlear prosthesis Visual & tactile aids

12 Causes of sensorineural loss Loss of sensory hair cells in cochlea Degeneration of auditory nerve fibers Characteristics of sensorineural loss Frequency dependent shifts in hearing thresholds Reduced dynamic range, loudness recruitment Poor frequency selectivity & increased spectral masking Reduced temporal resolution & increased temporal masking

13 Effects of increased spectral masking Smearing of spectral peaks and valleys due to broader auditory filters Reduction of internal spectral contrast Reduced discrimination of consonantal place feature Effects of increased temporal masking Forward and backward masking of weak segments by strong ones Reduced ability to discriminate sub-phonemic segments like noise bursts, voice-onset-time, and formant transitions

14 Speech processing for dichotic presentation for binaural hearing aids to reduce the effects of masking  Masking takes place at the peripheral level of the auditory system  Information from the two ears gets integrated at higher levels in the perception process  Binaural dichotic presentation for persons with bilateral residual hearing: - Speech signal split in a complementary form, - Signal components likely to mask each other presented to different ears, - - Information integrated at higher levels, for better speech perception

15 Binaural dichotic presentation schemes  Spectral splitting Filtering by 2 complementary comb filters: better place reception  Temporal splitting Gating by 2 complementary fading functions: better duration reception  Combined splitting Processing by 2 time-varying comb filters  All the sensory cells of the basilar membrane get periodic relaxation from stimulation: better perception of consonantal duration, place, and other features

16 Inter-aural fading with trapezoidal transition and inter-aural overlap Temporal splitting of the signal for dichotic presentation using w 1 ( n ) and w 2 ( n ) TEMPORAL SPLITTING WITH TRAPEZOIDAL FADING Inter-aural switching period = 20 ms, Duty cycles = 70%, Transition durations = 0, 1, 2, 3 ms s1(n)s1(n) s2(n)s2(n) w1(n)w1(n) w2(n)w2(n) s (n)s (n)   n n w1(n)w1(n) L N w2(n)w2(n) N L M M

17 Investigations with spectral splitting Auditory filter bandwidth based comb filters 18 bands over 5 kHz, 256 coefficient linear phase filters, designed using frequency sampling technique Listening tests with hearing impaired subjects: improvement in response time, recognition scores, & reception of place feature Better results with perceptually balanced filters 1 dB ripple, 30 dB attenuation, 4-6 dB crossover Filters with personalized frequency response Overall improvement, but not particularly for place

18 Combined splitting with time-varying filters  m /2   m /2 +1   m /2 +2  mm 11 22 Time varying comb filter 1 Time varying comb filter 2 s(n)s(n) s2(n)s2(n) s1(n)s1(n) set of filter coefficients Magnitude 1 2 m 1 Frequency Sweep cycle duration = 20 ms. With m shiftings, each pair of comb filter processes for 20/ m ms

19 Inten. dB Frequency (kHz) Time in ms Inten. dB Frequency (kHz) Time in ms (a) (b) An idealized representation of magnitude response of the pair of time- varying comb filters using 4 shiftings for the (a) left ear (b) right ear

Normalized frequency Magnitude (dB)

21 Time-varying comb filters Set of linear phase 256-coeff. FIR filters with pre-calculated coefficients (designed using iterative use of frequency sampling technique). Comb filter responses optimized for min. perceived spectral distortion: low passband ripple & high stopband attenuation, inter-band crossover gains adjusted for loudness balance. Pass band ripple 30 dB Gain at inter-band crossovers: -4 to -6 dB Sweep cycle duration : 20 ms Number of shiftings: 2, 4, 8, 16

22 Listening tests for evaluation of the schemes Test material: Closed set of 12 VCV syllables, formed with consonants / p, t, k, b, d, g, m, n, s, z, f, v / and vowel / a / Subjects & listening condition: Normal hearing subjects with loss simulated by Gaussian noise with short-time (~10 ms) SNRs of  6 : -15 dB. MCL( 70–75 dB SPL) Hearing impaired subjects with bilateral sensorineural loss. MCL. Performance measurement Response time statistics Stimulus-response confusion matrix Recognition scores Rel. information trans. of consonantal features

23 Listening test set-up Acoustically Isolated Chamber Lowpass Filter and Audio Amplifier PCL- 208 D/A Ports PC RS232C s1(t)s1(t) s2(t)s2(t)  Subject terminal Subject

24 Conclusions All the three schemes improve response time, recognition scores, & rel. info. tr. for overall and various speech features. Extent of improvement with a scheme related to nature of the loss - Severe high frequency hearing loss : Max. improvement with temporal splitting (17.9%). - Symmetrically low frequency hearing loss and symmetrically sloping high frequency hearing loss: max improvement with spectral splitting (17.5%) & combined splitting with 8 shiftings (20.5%). - Asymmetrical high frequency loss: temporal splitting (7.6%) & combined splitting (7.6%) (contd.)

25 Spectral splitting more effective in reducing perceptual load. Overall max improvement in rec. scores with combined splitting with 8 shiftings. Temporal splitting mainly improved the duration feature perception. Spectral splitting mainly improved the the place feature perception. Combined splitting with 8 improved perception of both duration and place. Reception of the relatively robust consonantal features (voicing, manner, and nasality) not adversally affected by splitting. Personalized filter response gives additional improvement

26 Next Listening tests with a larger number of S’s to establish relationship between processing parameters & nature of loss. Individualized multi-band compression. Implementation of the processing schemes as part of wearable hearing aids, with personalized parameter setting. Effect of binaural dichotic listening on non-speech signals & source localization to be investigated. Investigations with combination of consonant enhancement with dichotic presentation.

27 THANK YOU