Speech Enhancement for ASR by Hans Hwang 8/23/2000 Reference 1. Alan V. Oppenheim,etc., ” Multi-Channel Signal Separation by Decorrelation ”,IEEE Trans.

Slides:



Advertisements
Similar presentations
Adaptive Filters S.B.Rabet In the Name of GOD Class Presentation For The Course : Custom Implementation of DSP Systems University of Tehran 2010 Pages.
Advertisements

2004 COMP.DSP CONFERENCE Survey of Noise Reduction Techniques Maurice Givens.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: The FIR Adaptive Filter The LMS Adaptive Filter Stability and Convergence.
Error detection and concealment for Multimedia Communications Senior Design Fall 06 and Spring 07.
1/44 1. ZAHRA NAGHSH JULY 2009 BEAM-FORMING 2/44 2.
Reduction of Additive Noise in the Digital Processing of Speech Avner Halevy AMSC 664 Final Presentation May 2009 Dr. Radu Balan Department of Mathematics.
HMM-BASED PATTERN DETECTION. Outline  Markov Process  Hidden Markov Models Elements Basic Problems Evaluation Optimization Training Implementation 2-D.
Advances in WP1 Nancy Meeting – 6-7 July
Communications & Multimedia Signal Processing Meeting 6 Esfandiar Zavarehei Department of Electronic and Computer Engineering Brunel University 6 July,
Single-Channel Speech Enhancement in Both White and Colored Noise Xin Lei Xiao Li Han Yan June 5, 2002.
Speech Enhancement Based on a Combination of Spectral Subtraction and MMSE Log-STSA Estimator in Wavelet Domain LATSI laboratory, Department of Electronic,
3/24/2006Lecture notes for Speech Communications Multi-channel speech enhancement Chunjian Li DICOM, Aalborg University.
HIWIRE MEETING CRETE, SEPTEMBER 23-24, 2004 JOSÉ C. SEGURA LUNA GSTC UGR.
Speech Recognition in Noise
LORIA Irina Illina Dominique Fohr Chania Meeting May 9-10, 2007.
1 Speech Enhancement Wiener Filtering: A linear estimation of clean signal from the noisy signal Using MMSE criterion.
Adaptive Signal Processing
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Adaptive Noise Cancellation ANC W/O External Reference Adaptive Line Enhancement.
Sparsity-Aware Adaptive Algorithms Based on Alternating Optimization and Shrinkage Rodrigo C. de Lamare* + and Raimundo Sampaio-Neto * + Communications.
Normalization of the Speech Modulation Spectra for Robust Speech Recognition Xiong Xiao, Eng Siong Chng, and Haizhou Li Wen-Yi Chu Department of Computer.
HMM-BASED PSEUDO-CLEAN SPEECH SYNTHESIS FOR SPLICE ALGORITHM Jun Du, Yu Hu, Li-Rong Dai, Ren-Hua Wang Wen-Yi Chu Department of Computer Science & Information.
A VOICE ACTIVITY DETECTOR USING THE CHI-SQUARE TEST
Algorithm Taxonomy Thus far we have focused on:
Introduction to estimation theory Seoul Nat’l Univ.
Introduction to Adaptive Digital Filters Algorithms
1 An Efficient Classification Approach Based on Grid Code Transformation and Mask-Matching Method Presenter: Yo-Ping Huang Tatung University.
Rake Reception in UWB Systems Aditya Kawatra 2004EE10313.
Presented by Tienwei Tsai July, 2005
Nico De Clercq Pieter Gijsenbergh Noise reduction in hearing aids: Generalised Sidelobe Canceller.
Heart Sound Background Noise Removal Haim Appleboim Biomedical Seminar February 2007.
REVISED CONTEXTUAL LRT FOR VOICE ACTIVITY DETECTION Javier Ram’ırez, Jos’e C. Segura and J.M. G’orriz Dept. of Signal Theory Networking and Communications.
Reduction of Additive Noise in the Digital Processing of Speech Avner Halevy AMSC 663 Mid Year Progress Report December 2008 Professor Radu Balan 1.
Image Restoration using Iterative Wiener Filter --- ECE533 Project Report Jing Liu, Yan Wu.
Robust Speech Feature Decorrelated and Liftered Filter-Bank Energies (DLFBE) Proposed by K.K. Paliwal, in EuroSpeech 99.
LOG-ENERGY DYNAMIC RANGE NORMALIZATON FOR ROBUST SPEECH RECOGNITION Weizhong Zhu and Douglas O’Shaughnessy INRS-EMT, University of Quebec Montreal, Quebec,
Acoustic Noise Cancellation
Name : Arum Tri Iswari Purwanti NPM :
Signature with Text-Dependent and Text-Independent Speech for Robust Identity Verification B. Ly-Van*, R. Blouet**, S. Renouard** S. Garcia-Salicetti*,
Basics of Neural Networks Neural Network Topologies.
Nico De Clercq Pieter Gijsenbergh.  Problem  Solutions  Single-channel approach  Multichannel approach  Our assignment Overview.
Authors: Sriram Ganapathy, Samuel Thomas, and Hynek Hermansky Temporal envelope compensation for robust phoneme recognition using modulation spectrum.
Unit-V DSP APPLICATIONS. UNIT V -SYLLABUS DSP APPLICATIONS Multirate signal processing: Decimation Interpolation Sampling rate conversion by a rational.
Independent Component Analysis Algorithm for Adaptive Noise Cancelling 적응 잡음 제거를 위한 독립 성분 분석 알고리즘 Hyung-Min Park, Sang-Hoon Oh, and Soo-Young Lee Brain.
Active Noise Cancellation System
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Reestimation Equations Continuous Distributions.
Speech Enhancement Using a Minimum Mean Square Error Short-Time Spectral Amplitude Estimation method.
A Semi-Blind Technique for MIMO Channel Matrix Estimation Aditya Jagannatham and Bhaskar D. Rao The proposed algorithm performs well compared to its training.
Voice Activity Detection based on OptimallyWeighted Combination of Multiple Features Yusuke Kida and Tatsuya Kawahara School of Informatics, Kyoto University,
ECE 8443 – Pattern Recognition ECE 3163 – Signals and Systems Objectives: Stability Response to a Sinusoid Filtering White Noise Autocorrelation Power.
An Introduction To The Kalman Filter By, Santhosh Kumar.
1 An Efficient Classification Approach Based on Grid Code Transformation and Mask-Matching Method Presenter: Yo-Ping Huang.
CHAPTER 10 Widrow-Hoff Learning Ming-Feng Yeh.
Professors: Eng. Diego Barral Eng. Mariano Llamedo Soria Julian Bruno
3.7 Adaptive filtering Joonas Vanninen Antonio Palomino Alarcos.
ICASSP 2007 Robustness Techniques Survey Presenter: Shih-Hsiang Lin.
RCC-Mean Subtraction Robust Feature and Compare Various Feature based Methods for Robust Speech Recognition in presence of Telephone Noise Amin Fazel Sharif.
September 28, 2000 Improved Simultaneous Data Reconciliation, Bias Detection and Identification Using Mixed Integer Optimization Methods Presented by:
PHASE-BASED DUAL-MICROPHONE SPEECH ENHANCEMENT USING A PRIOR SPEECH MODEL Guangji Shi, M.A.Sc. Ph.D. Candidate University of Toronto Research Supervisor:
The Unscented Particle Filter 2000/09/29 이 시은. Introduction Filtering –estimate the states(parameters or hidden variable) as a set of observations becomes.
GROUP MEMBERS ELISHBA KHALID 07-CP-07 TAHIRA SAMEEN 07-CP-31.
Variable Step-Size Adaptive Filters for Acoustic Echo Cancellation Constantin Paleologu Department of Telecommunications
Speech Enhancement Summer 2009
LECTURE 30: SYSTEM ANALYSIS USING THE TRANSFER FUNCTION
Speech Enhancement with Binaural Cues Derived from a Priori Codebook
Two-Stage Mel-Warped Wiener Filter SNR-Dependent Waveform Processing
1-D DISCRETE COSINE TRANSFORM DCT
DCT-based Processing of Dynamic Features for Robust Speech Recognition Wen-Chi LIN, Hao-Teng FAN, Jeih-Weih HUNG Wen-Yi Chu Department of Computer Science.
Wiener Filtering: A linear estimation of clean signal from the noisy signal Using MMSE criterion.
Presenter: Shih-Hsiang(士翔)
Combination of Feature and Channel Compensation (1/2)
Presentation transcript:

Speech Enhancement for ASR by Hans Hwang 8/23/2000 Reference 1. Alan V. Oppenheim,etc., ” Multi-Channel Signal Separation by Decorrelation ”,IEEE Trans. on ASSP, , Yunxin Zhao,etc., ” Adaptive Co-channel Speech Separation and Recognition ”,IEEE Trans. On SAP, , Ing Yang Soon,etc., ” Noisy Speech Enhancement Using Discrete Cosine Transform ”,Speech communication, ,1998

Outline Signal Separation by S-ADF/LMS Speech Enhancement by DCT Residual Signal Reduction Experimental Results

Speech Signal Separation Introduction: -To Recover the desired signal and identify the unknown system from the observation signal -Speech signal recovered from SSS will increase SNR and improve the speech recognition accuracy -Specifically consider the two-channel case

SSS cont ’ d Two-channel model description A and B are cross-coupling effect between channels and we ignore the transfer function of each channel. x i (t) is source signal and y i (t) is acquired signal

SSS (cont ’ d) Source separation system (separate source signals out from acquired signals) and called decoupling filters and modeled as FIR filter

SSS by ADF Calculate the FIR coeff. by adaptive decorre- lation filter(ADF) proposed by A. V. Oppenheim in The objective is to design decoupling filter s.t., the estimated signals are uncorrelated. -The decoupling filtering coeff. ’ s are estimated iteratively based on the previous estimated filter coeff. ’ s and current observations

SSS by ADF (cont ’ d) The closed form of decoupling filters where

SSS by ADF (cont ’ d) Choice of adaptation gain -As time goes to infinite the adaptation gain goes to zero for the system stable consideration. -Optimal choice adaptation gain for the system stability and convergence. -

SSS by ADF (cont ’ d) The experiment of :

Source Signal Detection(SSD) Introduction -If one of the two is inactive then the estimated signals will be poor by ADF and cause the recog- nition errors. -So the ASR and ADF are performed within active region of each target signal.

SSD (cont ’ d)

SSD by coherence function If then

SSD (cont ’ d) - decision variable -Decision Rule:

SSD (cont ’ d) -Implementation using DFT and Result

SSD (cont ’ d)

Improved Filter Estimation Widrow ’ s LMS algorithm proposed in If we don ’ t know A or B in observation(i.e., one of the source signals is inactive) then the estimation of filters will cause much errors compared to the actual filters. -If we know source signal 2 is inactive(using SSD) then we only estimate filter B and remain filter A unchanged.

Improved Filter Estimation LMS algorithm and result

Experimental Results -Evaluate in terms of WRA and SIR

Experimental Result (cont ’ d) *Use 717 TIMIT sentences to train 62 phone units. Front-end feature is PLP and its dynamic. Grammar perplexity is 105. After acoustic normalization

Speech Enhancement using Discrete Cosine Transform Motivation -DCT provides significantly higher compaction as compared to the DFT

SE Using DCT (cont ’ d) -DCT provides higher spectral resolution than DFT -DCT is real transform so it has only binary phases. Its phase won ’ t be changed unless added noise is strong.

Estimating signal by MMSE Intorduction -y(t)=x(t)+n(t) and Y(k)=X(k)+N(k) Assume DCT coeff. ’ s are statistically independent and estimated signal is less diffenent from the original signal. -, by Bayes ’ rule and signal model

MMSE (cont ’ d) Estimating signal source by Decision Directed Estimation(DDE) ( proposed by Ephraim & Malah in ‘ 84 ) = 0.98 in computer simulation

Reduction of Residual Signal Introduction -If the source signal more likely exists then the estimated is more reliable. -two states of inputs H 0 :speech absent H 1 :speech present : modified filter output

Reduction of Residual Signal - where

Experimental Results Measure in Segmental SNR *EMFDETFDETF White noise added Fan noise added

Experimental Results