Variable Step-Size Adaptive Filters for Acoustic Echo Cancellation Constantin Paleologu Department of Telecommunications

Slides:



Advertisements
Similar presentations
Acoustic Echo Cancellation for Low Cost Applications
Advertisements

Speech Enhancement through Noise Reduction By Yating & Kundan.
Adaptive Filters S.B.Rabet In the Name of GOD Class Presentation For The Course : Custom Implementation of DSP Systems University of Tehran 2010 Pages.
Adaptive IIR Filter Terry Lee EE 491D May 13, 2005.
OPTIMUM FILTERING.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: The FIR Adaptive Filter The LMS Adaptive Filter Stability and Convergence.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Newton’s Method Application to LMS Recursive Least Squares Exponentially-Weighted.
Single-Channel Speech Enhancement in Both White and Colored Noise Xin Lei Xiao Li Han Yan June 5, 2002.
3/24/2006Lecture notes for Speech Communications Multi-channel speech enhancement Chunjian Li DICOM, Aalborg University.
Zhengyou Zhang, Qin Cai, Jay Stokes
Nonlinear and Time Variant Signal Processing R.C. Maher ECEN4002/5002 DSP Laboratory Spring 2002.
Frequency-Domain Adaptive Filters Wu, Yihong EE 491D
An Adaptive Predictor for Media Playout Buffering Phillip DeLeon New Mexico State University Cormac J. Sreenan AT&T Labs ICASSP 99’
EE513 Audio Signals and Systems Wiener Inverse Filter Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
ABSTRACT: Noise cancellation systems have been implemented to counter the effects of echoes in communications systems. These systems use algorithms that.
Adaptive Signal Processing
Normalised Least Mean-Square Adaptive Filtering
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Adaptive Noise Cancellation ANC W/O External Reference Adaptive Line Enhancement.
Sparsity-Aware Adaptive Algorithms Based on Alternating Optimization and Shrinkage Rodrigo C. de Lamare* + and Raimundo Sampaio-Neto * + Communications.
RLSELE Adaptive Signal Processing 1 Recursive Least-Squares (RLS) Adaptive Filters.
Digital Audio Signal Processing Lecture-4: Acoustic Echo Cancellation Marc Moonen Dept. E.E./ESAT-STADIUS, KU Leuven homes.esat.kuleuven.be/~moonen/
Yuan Chen Advisor: Professor Paul Cuff. Introduction Goal: Remove reverberation of far-end input from near –end input by forming an estimation of the.
Algorithm Taxonomy Thus far we have focused on:
Effect of Mutual Coupling on the Performance of Uniformly and Non-
Introduction to Adaptive Digital Filters Algorithms
Scheme for Improved Residual Echo Cancellation in Packetized Audio Transmission Jivesh Govil Digital Signal Processing Laboratory Department of Electronics.
Technical Seminar Presented by :- Debabandana Apta (EC ) National Institute of Science and Technology [1] “ECHO CANCELLATION” Presented.
By Asst.Prof.Dr.Thamer M.Jamel Department of Electrical Engineering University of Technology Baghdad – Iraq.
4/5/00 p. 1 Postacademic Course on Telecommunications Module-3 Transmission Marc Moonen Lecture-6 Adaptive Equalization K.U.Leuven/ESAT-SISTA Module-3.
Acoustic impulse response measurement using speech and music signals John Usher Barcelona Media – Innovation Centre | Av. Diagonal, 177, planta 9,
REVISED CONTEXTUAL LRT FOR VOICE ACTIVITY DETECTION Javier Ram’ırez, Jos’e C. Segura and J.M. G’orriz Dept. of Signal Theory Networking and Communications.
1 UTILIZING CHANNEL CODING INFORMATION IN CIVA- BASED BLIND SEQUENCE DETECTORS Xiaohua(Edward) Li Department of Electrical and Computer Engineering State.
Multiuser Detection (MUD) Combined with array signal processing in current wireless communication environments Wed. 박사 3학기 구 정 회.
Comparison and Analysis of Equalization Techniques for the Time-Varying Underwater Acoustic Channel Ballard Blair PhD Candidate MIT/WHOI.
CE Digital Signal Processing Fall 1992 Waveform Coding Hossein Sameti Department of Computer Engineering Sharif University of Technology.
1 Analysis for Adaptive DOA Estimation with Robust Beamforming in Smart Antenna System 指導教授:黃文傑 W.J. Huang 研究生 :蔡漢成 H.C. Tsai.
Study on the Use of Error Term in Parallel- form Narrowband Feedback Active Noise Control Systems Jianjun HE, Woon-Seng Gan, and Yong-Kim Chong 11 th Dec,
Name : Arum Tri Iswari Purwanti NPM :
EE 426 DIGITAL SIGNAL PROCESSING TERM PROJECT Objective: Adaptive Noise Cancellation.
1.Processing of reverberant speech for time delay estimation. Probleme: -> Getting the time Delay of a reverberant speech with severals microphone. ->Getting.
CHANNEL ESTIMATION FOR MIMO- OFDM COMMUNICATION SYSTEM PRESENTER: OYERINDE, OLUTAYO OYEYEMI SUPERVISOR: PROFESSOR S. H. MNENEY AFFILIATION:SCHOOL OF ELECTRICAL,
Jessica Arbona & Christopher Brady Dr. In Soo Ahn & Dr. Yufeng Lu, Advisors.
1 PCM & DPCM & DM. 2 Pulse-Code Modulation (PCM) : In PCM each sample of the signal is quantized to one of the amplitude levels, where B is the number.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Definitions Random Signal Analysis (Review) Discrete Random Signals Random.
Unit-V DSP APPLICATIONS. UNIT V -SYLLABUS DSP APPLICATIONS Multirate signal processing: Decimation Interpolation Sampling rate conversion by a rational.
Comparison and Analysis of Equalization Techniques for the Time-Varying Underwater Acoustic Channel Ballard Blair PhD Candidate MIT/WHOI.
Adv DSP Spring-2015 Lecture#9 Optimum Filters (Ch:7) Wiener Filters.
EE513 Audio Signals and Systems
Learning Using Augmented Error Criterion Yadunandana N. Rao Advisor: Dr. Jose C. Principe.
A Semi-Blind Technique for MIMO Channel Matrix Estimation Aditya Jagannatham and Bhaskar D. Rao The proposed algorithm performs well compared to its training.
Speech Enhancement for ASR by Hans Hwang 8/23/2000 Reference 1. Alan V. Oppenheim,etc., ” Multi-Channel Signal Separation by Decorrelation ”,IEEE Trans.
Dept. E.E./ESAT-STADIUS, KU Leuven
Professors: Eng. Diego Barral Eng. Mariano Llamedo Soria Julian Bruno
1. Adaptive System Identification Configuration[2] The adaptive system identification is primarily responsible for determining a discrete estimation of.
Spectral Observer with Reduced Information Demand György Orosz, László Sujbert, Gábor Péceli Department of Measurement and Information Systems Budapest.
Dept. E.E./ESAT-STADIUS, KU Leuven homes.esat.kuleuven.be/~moonen/
Statistical Signal Processing Research Laboratory(SSPRL) UT Acoustic Laboratory(UTAL) A TWO-STAGE DATA-DRIVEN SINGLE MICROPHONE SPEECH ENHANCEMENT WITH.
Impulse Response Measurement and Equalization Digital Signal Processing LPP Erasmus Program Aveiro 2012 Digital Signal Processing LPP Erasmus Program Aveiro.
Reconfigurable acceleration of robust frequency-domain echo cancellation C. H. Ho 1, K.F.C.Yiu 2, J. Huo 3, S. Nordholm 3 and W. Luk 1 1.Department of.
Spatial vs. Blind Approaches for Speaker Separation: Structural Differences and Beyond Julien Bourgeois RIC/AD.
State-Space Recursive Least Squares with Adaptive Memory College of Electrical & Mechanical Engineering National University of Sciences & Technology (NUST)
Speech Enhancement based on
Statistical Signal Processing Application to Speech, Image & Digital Communication Prof. Mohammad Reza Alsharif Department of Information Engineering,
Adv DSP Spring-2015 Lecture#11 Spectrum Estimation Parametric Methods.
Presentations on “ADAPTIVE LMS FILTERING APPROACH FOR SPEECH ENHANCEMENT” Approved By Presented By Mr. Rupesh Dubey Lalit P. Patil HOD (Elec & comm.) (
Adnan Quadri & Dr. Naima Kaabouch Optimization Efficiency
Assoc. Prof. Dr. Peerapol Yuvapoositanon
A SPARSENESS CONTROLLED PROPORTIONATE ALGORITHM FOR ACOUSTIC ECHO CANCELLATION Pradeep Loganathan, Andy W.H. Khong, Patrick A. Naylor
Whitening-Rotation Based MIMO Channel Estimation
Combination of Feature and Channel Compensation (1/2)
Presentation transcript:

Variable Step-Size Adaptive Filters for Acoustic Echo Cancellation Constantin Paleologu Department of Telecommunications

Introduction specific problems in AEC  the echo path can be extremely long  it may rapidly change at any time during the connection  the background noise can be strong and non-stationary important issue in echo cancellation  the behaviour during double-talk  the presence of Double-Talk Detector (DTD) acoustic echo cancellation (AEC)  required in hands-free communication devices, (e.g., for mobile telephony or teleconferencing systems)  ! acoustic coupling between the loudspeaker and microphone  an adaptive filter identifies the acoustic echo path between the terminal’s loudspeaker and microphone

Introduction (cont.) Adaptive algorithm Digital filter x(n)x(n) d(n)d(n) y(n)y(n)e(n)e(n) +  Adaptive filter  Cost function J[e(n)] ↓ minimized

h Acoustic echo path acoustic echo ĥ(n) Adaptive filter e(n)e(n) - d(n) microphone signal Background noise x(n)x(n) Far-end v(n)v(n) Near-end to the Far-end ≈ v(n) DTD “synthetic” echo Performance criteria: - convergence rate vs. misadjustment - tracking vs. robustness Introduction (cont.) AEC configuration

Fig. 1. Acoustic echo path: (a) impulse response; (b) frequency response. Samples Frequency [kHz]

Fig. 2. Acoustic echo path: (a) impulse response; (b) frequency response. Samples Frequency [kHz]

Fig. 3. Acoustic echo path: (a) impulse response; (b) frequency response. Samples Frequency [kHz]

Adaptive algorithms for AEC requirements  fast convergence rate and tracking  low misadjustment  double-talk robustness most common choices  normalized least-mean-square (NLMS) algorithm  affine projection algorithm (APA) step-size parameter (controls the performance of these algorithms)  large values  fast convergence rate and tracking  small values  low misadjustment and double-talk robustness conflicting requirements  variable step-size (VSS) algorithms

classical NLMS algorithm where μ(n) = μ / [x T (n)x(n)] x(n) = [x(n), x(n – 1), …, x(n – L + 1)] T adaptive filter length step-size parameter e(n) = d(n) – x T (n)ĥ(n – 1) ĥ(n) = ĥ(n – 1) + μ(n)x(n)e(n)  ε(n) = e(n)[1 – μ(n)x T (n)x(n)] ε(n) = d(n) – x T (n)ĥ(n)  a posteriori error vector ε(n) = 0 [assuming that e(n) ≠ 0]  μ(n) = 1 / [x T (n)x(n)]  μ = 1 Optimal step-size ?

h x(n) (far-end) v(n) (near-end) (to the far-end) ĥ(n)ĥ(n) - ε(n) = v(n) ε(n) = 0

E{ε 2 (n)} = E{v 2 (n)} E{e 2 (n)}[1 – μ(n)LE{x 2 (n)}] 2 = E{v 2 (n)}  λ = 1 – 1/(KL), with K > 1 ?

1) near-end signal = background noise (single-talk scenario) v(n) = w(n) background noise power estimate Problem: background noise can be time-variant 2) near-end signal = background noise + near-end speech v(n) = w(n) + u(n) (double-talk scenario) near-end speech power estimate Problem: non-stationary character of the speech signal ??? [J. Benesty et al, “A nonparametric VSS NLMS algorithm”, IEEE Signal Process. Lett., 2006]  NPVSS-NLMS algorithm

 simple VSS-NLMS (SVSS-NLMS) algorithm ! The value of γ influences the overall behaviour of the algorithm. regularization parametersmall positive constant 1. using the error signal e(n), with a larger value of the weighting factor: λ = 1 – 1/(KL), with K > 1 γ = 1 – 1/(QL), with Q > K Solutions for evaluating the near-end signal power estimate

 NEW-NPVSS-NLMS algorithm 2. using a normalized cross-correlation based echo path change detector: ! The value of ς influences the overall behaviour of the algorithm. convergence statistic [M. A. Iqbal et al, “Novel Variable Step Size NLMS Algorithms for Echo Cancellation ”, Proc. IEEE ICASSP, 2008]

h x(n) (far-end) v(n) = w(n) + u(n) (near-end) (to the far-end) ĥ(n)ĥ(n) - d(n) = y(n) + v(n) y(n)y(n) E{d 2 (n)} = E{y 2 (n)} + E{v 2 (n)}  assuming that the adaptive filter has converged to a certain degree  available !

 Practical VSS-NLMS (PVSS-NLMS) algorithm ! This algorithm assumes that the adaptive filter has converged to a certain degree. [C. Paleologu, S. Ciochina, and J. Benesty,“Variable Step-Size NLMS Algorithm for Under-Modeling Acoustic Echo Cancellation ”, IEEE Signal Process. Lett., 2008]

AlgorithmsAdditionsMultiplicationsDivisionsSquare-roots NPVSS-NLMS3311 SVSS-NLMS4511 NEW-NPVSS- NLMS 2L + 83L PVSS-NLMS6911 Table I. Computational complexities of the different variable-step sizes.

Simulation results (I) conditions  Acoustic echo cancellation (AEC) context, L =  input signal x(n) – AR(1) signal or speech sequence.  background noise w(n) – independent white Gaussian noise signal (variable SNR)  measure of performance – normalized misalignment (dB) algorithms for comparisons  NLMS  NPVSS-NLMS  SVSS-NLMS  NEW-NPVSS-NLMS  PVSS-NLMS

Fig. 4. Misalignment of the NLMS algorithm with two different step sizes (a) μ = 1 and (b) μ = 0.05, and misalignment of the NPVSS-NLMS algorithm. The input signal is an AR(1) process, L = 1000, λ = 1 − 1/(6L), and SNR = 20 dB.

Fig. 5. Misalignment of the NLMS algorithm with two different step sizes (a) μ = 1 and (b) μ = 0.05, and misalignment of the NPVSS-NLMS algorithm. The input signal is an AR(1) process, L = 1000, λ = 1 − 1/(6L), and SNR = 20 dB. Echo path changes at time 10.

Fig. 6. Misalignment of the SVSS-NLMS algorithm for different values of γ. Impulse response changes at time 10, and SNR decreases from 20 dB to 10 dB at time 20. The input signal is an AR(1) process, L = 1000, and λ = 1 – 1/(6L).

Fig. 7. Misalignment of the NEW-NPVSS-NLMS algorithm for different values of ς. Impulse response changes at time 10, and SNR decreases from 20 dB to 10 dB at time 20. The input signal is an AR(1) process, L = 1000, and λ = 1 – 1/(6L).

Fig. 8. Misalignment of the NPVSS-NLMS, SVSS-NLMS [with γ = 1 – 1/(18L)], NEW-NPVSS-NLMS (with ς = 0.1), and PVSS-NLMS algorithms. The input signal is speech, L = 1000, λ = 1 – 1/(6L), and SNR = 20 dB.

Fig. 9. Misalignment during impulse response change. The impulse response changes at time 20. Algorithms: NPVSS-NLMS, SVSS-NLMS [with γ = 1 – 1/(18L)], NEW- NPVSS-NLMS (with ς = 0.1), and PVSS-NLMS. The input signal is speech, L = 1000, λ = 1 – 1/(6L), and SNR = 20 dB.

Fig. 10. Misalignment during background noise variations. The SNR decreases from 20 dB to 10 dB between time 20 and 30, and to 0 dB between time 40 and 50. Algorithms: NPVSS-NLMS, SVSS-NLMS [with γ = 1 – 1/(18L)], NEW-NPVSS-NLMS (with ς = 0.1), and PVSS-NLMS. The input signal is speech, L = 1000, λ = 1 – 1/(6L), and SNR = 20 dB.

Fig. 11. Misalignment during double-talk, without DTD. Near-end speech appears between time 15 and 25 (with FNR = 5 dB), and between time 35 and 45 (with FNR = 3 dB). Algorithms: SVSS-NLMS [with γ = 1 – 1/(18L)], NEW-NPVSS-NLMS (with ς = 0.1), and PVSS-NLMS. The input signal is speech, L = 1000, λ = 1 – 1/(6L), and SNR = 20 dB.

Affine Projection Algorithm (APA)  superior convergence rate as compared to the NLMS algorithm  lower complexity as compared to the RLS algorithm where d(n) = [d(n), d(n–1), …, d(n – p +1)] T X(n) = [x(n), x(n – 1), …, x(n – p + 1)] x(n – l) = [x(n – l), x(n – l – 1), …, x(n – l – L + 1)] T projection order adaptive filter length step-size parameter Adaptive algorithms for AEC (cont.)

Variable Step-Size APA (VSS-APA)  Classical APA  a posteriori error vector  a priori error vector   Classical APA with μ = 1

h x(n) (far-end) v(n) (near-end) (to the far-end) ĥ(n)ĥ(n) - where v(n) = [v(n), v(n – 1), …, v(n – p +1)] T ε(n) = v(n)

  λ = 1 – 1/(KL), with K > 1 ???

Proposed VSS-APA regularization factor main advantages  non-parametric algorithm  robustness to background noise variations and double-talk [C. Paleologu, J. Benesty, and S. Ciochina,“Variable Step-Size Affine Projection Algorithm Designed for Acoustic Echo Cancellation ”, IEEE Trans. Audio, Speech, Language Process., Nov. 2008]

Simulation results (II) Fig. 12. Misalignment of the APA with three different step-sizes (μ = 1, μ = 0.25, and μ = 0.025), and VSS-APA. The input signal is speech, p = 2, L = 1000, λ = 1−1/(6L), and SNR = 20 dB.

Fig. 13. Misalignment of the VSS-APA with different projection orders, i.e., p = 1 (PVSS-NLMS algorithm), p = 2, and p = 4. Other conditions are the same as in Fig. 12.

Fig. 14. Misalignment during impulse response change. The impulse response changes at time 20. Algorithms: PVSS-NLMS algorithm, APA (with μ = 0.25), and VSS-APA. Other conditions are the same as in Fig. 12.

Fig. 15. Misalignment during background noise variations. The SNR decreases from 20 dB to 10 dB between time 20 and 40. Other conditions are the same as in Fig. 12.

Fig. 16. Misalignment during double-talk, without a DTD. Near-end speech appears between time 20 and 30 (with FNR = 4 dB). Other conditions are the same as in Fig. 12.

Fig. 17. Misalignment during double-talk with the Geigel DTD. Other conditions are the same as in Fig. 16.

Simulation results (III) conditions  AEC context, L = 512  x(n), u(n) = speech sequences  w(n) = independent white Gaussian noise signal (SNR = 20dB) algorithms for comparisons  classical APA, μ = 0.2  variable regularized APA (VR-APA) [H. Rey, L. Rey Vega, S. Tressens, and J. Benesty, IEEE Trans. Signal Process., May 2007]  robust proportionate APA (R-PAPA) [T. Gänsler, S. L. Gay, M. M. Sondhi, and J. Benesty, IEEE Trans. Speech Audio Process., Nov. 2000]  “ideal” VSS-APA (VSS-APA-id) - assuming that v(n) is available Comparisons with other VSS-type APAs

Fig. 18. Misalignments of APA with μ = 0.2, VR-APA, VSS-APA, and VSS-APA-id. Single-talk case, L = 512, p = 2 for all the algorithms, SNR = 20dB.

p = 2 p = 8 Fig. 19. Echo path changes at time 21. Other conditions are the same as in Fig. 18.

Fig. 20. Misalignments of APA, VR-APA, R-PAPA, VSS-APA, and VSS-APA-id. Background noise variation at time 14, for a period of 14 seconds (SNR decreases from 20dB to 10 dB). Other conditions are the same as in Fig. 18.

without DTDwith Geigel DTD Fig. 21. Misalignment of the algorithm during double-talk. Other conditions are the same as in Fig. 18.

Conclusions a family of VSS-type algorithms was developed in the context of AEC. they takes into account the existence of the near-end signal, aiming to recover it from the error of the adaptive filter. the VSS formulas do not require any additional parameters from the acoustic environment (i.e., non-parametric). they are robust to near-end signal variations like the increase of the background noise or double-talk. the experimental results indicate that these algorithms are reliable candidates for real-world applications.