Frequency Domain Coding of Speech 主講人:虞台文. Content Introduction The Short-Time Fourier Transform The Short-Time Discrete Fourier Transform Wide-Band Analysis/Synthesis.

Slides:



Advertisements
Similar presentations
Signal Processing in the Discrete Time Domain Microprocessor Applications (MEE4033) Sogang University Department of Mechanical Engineering.
Advertisements

EE513 Audio Signals and Systems Digital Signal Processing (Synthesis) Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Page 0 of 34 MBE Vocoder. Page 1 of 34 Outline Introduction to vocoders MBE vocoder –MBE Parameters –Parameter estimation –Analysis and synthesis algorithm.
Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.
Speech Coding Nicola Orio Dipartimento di Ingegneria dell’Informazione IV Scuola estiva AISV, 8-12 settembre 2008.
Chapter 8: The Discrete Fourier Transform
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
Lecture05 Transform Coding.
FFT-based filtering and the Short-Time Fourier Transform (STFT) R.C. Maher ECEN4002/5002 DSP Laboratory Spring 2003.
Wavelet Transform A very brief look.
Sampling Theorem 主講者:虞台文. Content Periodic Sampling Sampling of Band-Limited Signals Aliasing --- Nyquist rate CFT vs. DFT Reconstruction of Band-limited.
Introduction to Wavelets
COMP 249 :: Spring 2005 Slide: 1 Audio Coding Ketan Mayer-Patel.
Fourier Transforms Revisited
Chapter 4: Sampling of Continuous-Time Signals
Lecture 41 Practical sampling and reconstruction.
Discrete-Time and System (A Review)
DTFT And Fourier Transform
H 0 (z) x(n)x(n) o(n)o(n) M G 0 (z) M + vo(n)vo(n) yo(n)yo(n) H 1 (z) 1(n)1(n) M G 1 (z) M v1(n)v1(n) y1(n)y1(n) fo(n)fo(n) f1(n)f1(n) y(n)y(n) Figure.
Time-Domain Methods for Speech Processing 虞台文. Contents Introduction Time-Dependent Processing of Speech Short-Time Energy and Average Magnitude Short-Time.
The Wavelet Tutorial: Part3 The Discrete Wavelet Transform
Digital Image Processing Chapter # 4 Image Enhancement in Frequency Domain Digital Image Processing Chapter # 4 Image Enhancement in Frequency Domain.
Page 0 of 23 MELP Vocoders Nima Moghadam SN#: Saeed Nari SN#: Supervisor Dr. Saameti April 2005 Sharif University of Technology.
8.1 representation of periodic sequences:the discrete fourier series 8.2 the fourier transform of periodic signals 8.3 properties of the discrete fourier.
SPEECH CODING Maryam Zebarjad Alessandro Chiumento.
EE Audio Signals and Systems Digital Signal Processing (Synthesis) Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
By Sarita Jondhale1 Signal Processing And Analysis Methods For Speech Recognition.
1 Chapter 5 Ideal Filters, Sampling, and Reconstruction Sections Wed. June 26, 2013.
MPEG Audio coders. Motion Pictures Expert Group(MPEG) The coders associated with audio compression part of MPEG standard are called MPEG audio compressor.
README Lecture notes will be animated by clicks. Each click will indicate pause for audience to observe slide. On further click, the lecturer will explain.
Basics of Neural Networks Neural Network Topologies.
The Discrete Fourier Transform 主講人:虞台文. Content Introduction Representation of Periodic Sequences – DFS (Discrete Fourier Series) Properties of DFS The.
Speech Processing Short-Time Fourier Transform Analysis and Synthesis.
Notice  HW problems for Z-transform at available on the course website  due this Friday (9/26/2014) 
Zhongguo Liu_Biomedical Engineering_Shandong Univ. Chapter 8 The Discrete Fourier Transform Zhongguo Liu Biomedical Engineering School of Control.
Unit-V DSP APPLICATIONS. UNIT V -SYLLABUS DSP APPLICATIONS Multirate signal processing: Decimation Interpolation Sampling rate conversion by a rational.
1 Lecture 1: February 20, 2007 Topic: 1. Discrete-Time Signals and Systems.
1 Using Wavelets for Recognition of Cognitive Pattern Primitives Dasu Aravind Feature Group PRISM/ASU 3DK – 3DK – September 21, 2000.
VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.
EEE 503 Digital Signal Processing Lecture #2 : EEE 503 Digital Signal Processing Lecture #2 : Discrete-Time Signals & Systems Dr. Panuthat Boonpramuk Department.
Technological Educational Institute Of Crete Department Of Applied Informatics and Multimedia Neural Networks Laboratory Slide 1 FOURIER TRANSFORMATION.
Discrete-Time Signals and Systems
Subband Coding Jennie Abraham 07/23/2009. Overview Previously, different compression schemes were looked into – (i)Vector Quantization Scheme (ii)Differential.
Digital Signal Processing
Filtering x y.
Sub-Band Coding Multimedia Systems and Standards S2 IF Telkom University.
Structures for Discrete-Time Systems 主講人:虞台文. Content Introduction Block Diagram Representation Signal Flow Graph Basic Structure for IIR Systems Transposed.
The Story of Wavelets Theory and Engineering Applications
Computer Graphics & Image Processing Chapter # 4 Image Enhancement in Frequency Domain 2/26/20161.
Lecture 09b Finite Impulse Response (FIR) Filters
Decimation & Interpolation (M=4) 0  /4  /4  /2  /4  /4  /2  /4  /4  /2  /4  M=4 M M Bandwidth -  /4 Figure 12 USBLSB.
Sampling Rate Conversion by a Rational Factor, I/D
بسم الله الرحمن الرحيم Digital Signal Processing Lecture 14 FFT-Radix-2 Decimation in Frequency And Radix -4 Algorithm University of Khartoum Department.
Multiresolution Analysis (Chapter 7)
Applications of Multirate Signal Processing
Digital Communications Chapter 13. Source Coding
Vocoders.
FFT-based filtering and the
Lecture Signals with limited frequency range
Sampling rate conversion by a rational factor
Zhongguo Liu Biomedical Engineering
1 Vocoders. 2 The Channel Vocoder (analyzer) : The channel vocoder employs a bank of bandpass filters,  Each having a bandwidth between 100 HZ and 300.
MMSE Optimal Design: The Least Squares method
Image Transforms for Robust Coding
Govt. Polytechnic Dhangar(Fatehabad)
Sampling Theorem 主講者:虞台文.
Continuous-Time Fourier Transform
INTRODUCTION TO THE SHORT-TIME FOURIER TRANSFORM (STFT)
Chapter 3 Sampling.
Sampling and Aliasing.
Presentation transcript:

Frequency Domain Coding of Speech 主講人:虞台文

Content Introduction The Short-Time Fourier Transform The Short-Time Discrete Fourier Transform Wide-Band Analysis/Synthesis Sub-Band Coding

Frequency Domain Coding of Speech Introduction

Speech Coders Waveform Coders – Attempt to reproducing the original waveform according to some fidelity criteria – Performance: successful at producing good quality, robust speech. Vocoders – Correlated with speech production model. – Performance: more fragile and more model dependent. – Lower bit rate

Frequency-Domain Coders Sub-band coder (SCB). Adaptive Transform Coding (ATC). Multi-band Excited Vocoder (MBEV). Noise Shaping in Speech Coders.

Classification of Speech Coders

Frequency Domain Coding of Speech The Short-Time Fourier Transform

Definition of STFT Interpretations: Filter Bank Interpretation Block Transform Interpretation

Filter Bank Interpretation  is fixed at  0. f ( m ) Analysis Filter

Filter Bank Interpretation h(n)h(n) h(n)h(n) h(n)h(n) h(n)h(n) h(n)h(n) h(n)h(n) h(n)h(n) h(n)h(n) x(n)x(n)

Modulation   00

Filter Bank Interpretation   00 Lowpass Filter Modulation

Filter Bank Interpretation h(n)h(n) h(n)h(n) h(n)h(n) h(n)h(n) h(n)h(n) h(n)h(n) h(n)h(n) h(n)h(n) x(n)x(n) Modulated Subband signals

Block Transform Interpretation n is fixed at n 0. Windowed Data Analysis Window FT of Windowed Data

Block Transform Interpretation n is fixed at n 0. n1n1 n2n2 n3n nrnr

Analysis/Synthesis Equations Analysis Synthesis In what condition we will have

Analysis/Synthesis Equations Analysis Synthesis Replace r with n+r

Analysis/Synthesis Equations Analysis Synthesis Therefore, if

Analysis/Synthesis Equations More general, Analysis Synthesis Therefore, if

Examples

h (0) x ( n )

Examples

Frequency Domain Coding of Speech The Short-Time Discrete Fourier Transform

Definition of STDFT Analysis: Synthesis: In what condition we will have

Synthesis 1

We need only one period. Therefore, the condition is respecified as:

Implementation Consideration n Frequency k 0 Spectrogram

Sampling n Frequency k 0 Spectrogram R2R2R3R3R4R4R

Sampled STDFT Analysis: Synthesis: In what condition we will have

Sampled STDFT Analysis: Synthesis: In what condition we will have

Frequency Domain Coding of Speech Wide-Band Analysis/Synthesis

Short-Time Synthesis --- Filter Bank Summation STFT h(n)h(n) h(n)h(n) x(n)x(n) Lowpass Filter

Short-Time Synthesis --- Filter Bank Summation STFT

Short-Time Synthesis --- Filter Bank Summation  |H(e j  )|  |H k (e j  )| kk Lowpass filterBandpass filter

Short-Time Synthesis --- Filter Bank Summation hk(n)hk(n) hk(n)hk(n) x(n)x(n) Bandpass Filter h(n)h(n) h(n)h(n) x(n)x(n) Lowpass Filter Lowpass representation of for the signal in a band centered at  k.

Short-Time Synthesis --- Filter Bank Summation hk(n)hk(n) hk(n)hk(n) x(n)x(n) Bandpass Filter h(n)h(n) h(n)h(n) x(n)x(n) Lowpass Filter Encoding one bandDecoding one band

Short-Time Synthesis --- Filter Bank Summation h1(n)h1(n) h1(n)h1(n) x(n)x(n) h0(n)h0(n) h0(n)h0(n) hN1(n)hN1(n) hN1(n)hN1(n) Analysis Synthesis

Short-Time Synthesis --- Filter Bank Summation h1(n)h1(n) h1(n)h1(n) x(n)x(n) h0(n)h0(n) h0(n)h0(n) hN1(n)hN1(n) hN1(n)hN1(n) Analysis Synthesis

Short-Time Synthesis --- Filter Bank Summation h1(n)h1(n) h1(n)h1(n) x(n)x(n) h0(n)h0(n) h0(n)h0(n) hN1(n)hN1(n) hN1(n)hN1(n) Analysis Synthesis

Equal Spaced Ideal Filters 11 22 33 44 55 22  1 0 N = 6

Equal Spaced Ideal Filters h1(n)h1(n) x(n)x(n) h0(n)h0(n) hN1(n)hN1(n) What condition should be satisfied so that y(n)=x(n)?

Equal Spaced Ideal Filters Equal spaced sampling of H ( e j  ) Inverse discrete FT of H ( e j  ) Time-Aliased version of h ( n )

Equal Spaced Ideal Filters Consider FIR, i.e., h(n) is of duration of L samples. 0 L1L1 n h(n)h(n) In case that N  L,

Equal Spaced Ideal Filters

h1(n)h1(n) h1(n)h1(n) x(n)x(n) h0(n)h0(n) h0(n)h0(n) hN1(n)hN1(n) hN1(n)hN1(n) L1L1 n h(n)h(n) x(n) can always be Reconstructed if N  L,

Equal Spaced Ideal Filters h1(n)h1(n) h1(n)h1(n) x(n)x(n) h0(n)h0(n) h0(n)h0(n) hN1(n)hN1(n) hN1(n)hN1(n) L1L1 n h(n)h(n) x(n) can always be Reconstructed if N  L, Does x(n) can still be reconstructed if N<L? If affirmative, what condition should be satisfied?

Equal Spaced Ideal Filters h1(n)h1(n) h1(n)h1(n) x(n)x(n) h0(n)h0(n) h0(n)h0(n) hN1(n)hN1(n) hN1(n)hN1(n) p(n)p(n)

p(n)p(n) Signal can be reconstructed If it equals to  ( n  m ).

Typical Sequences of h (n) Ideal lowpass filter with cutoff at  /N. 0 NN 2N2N N2N2N3N3N4N4N p(n)p(n) N 0 NN 2N2N N2N2N3N3N4N4N h(n)h(n) 1/N

Typical Sequences of h (n) 0 NN 2N2N N2N2N3N3N4N4N p(n)p(n) N 0 NN 2N2N N2N2N3N3N4N4N h(n)h(n) h(0) LL 2L2L L2L2L3L3L4L4L N  LN  L

Typical Sequences of h (n) 0 NN 2N2N N2N2N3N3N4N4N p(n)p(n) N 0 NN 2N2N N2N2N3N3N4N4N h(n)h(n) h(0) 1/N A causal FIR lowpass filter

Typical Sequences of h (n) 0 NN 2N2N N2N2N3N3N4N4N p(n)p(n) N 0 NN 2N2N N2N2N3N3N4N4N h(n)h(n) h(0) 1/N A causal IIR lowpass filter

Filter Back Implementation for a Single Channel hk(n)hk(n) x(n)x(n) h(n)h(n) x(n)x(n) Analysis Synthesis

hk(n)hk(n) x(n)x(n) h(n)h(n) x(n)x(n) Filter Back Implementation for a Single Channel R:1 1:R Analysis Synthesis Decimator Interpolator

hk(n)hk(n) x(n)x(n) h(n)h(n) x(n)x(n) Filter Back Implementation for a Single Channel R:1 1:R Analysis Synthesis Decimator Interpolator Depends on the bandwidth of h(n). R=?

Frequency Domain Coding of Speech Sub-Band Coding

Analysis Synthesis Filter Bank Implementation (Direct Implementation) h(n)h(n) h(n)h(n) h(n)h(n) h(n)h(n) h(n)h(n) h(n)h(n) h(n)h(n) h(n)h(n) x(n)x(n) R:1 1:R f(n)f(n) f(n)f(n) f(n)f(n) f(n)f(n) f(n)f(n) f(n)f(n) f(n)f(n) f(n)f(n) x(n)x(n) Complex Channels R=2  B Bandwidth B/2

Filter Bank Implementation (Practical Implementation) 0 B kk 0 B  k 0 B/2  B/2 0 B/2  B/2 0 BB 0 B 0 BB B

Filter Bank Implementation (Practical Implementation) h(n)h(n) h(n)h(n) h(n)h(n) h(n)h(n) x(n)x(n)......

h(n)h(n) h(n)h(n) x(n)x(n) h(n)h(n) h(n)h(n)

h(n)h(n) h(n)h(n) x(n)x(n) h(n)h(n) h(n)h(n) D:1 Why?

Filter Bank Implementation (Practical Implementation) h(n)h(n) h(n)h(n) x(n)x(n) h(n)h(n) h(n)h(n) D:1

h(n)h(n) h(n)h(n) x(n)x(n) h(n)h(n) h(n)h(n) Filter Bank Implementation (Practical Implementation)

x(n)x(n) h(n)h(n) h(n)h(n) h(n)h(n) h(n)h(n) D:1 2D:1

Filter Bank Implementation (Practical Implementation) ADPCM CODEC h(n)h(n) h(n)h(n) h(n)h(n) h(n)h(n) 2D:1 f(n)f(n) f(n)f(n) f(n)f(n) f(n)f(n) Filter Bank Analysis Filter Bank Analysis Sub-Band Coder Modification Sub-Band Coder Modification Filter Bank Synthesis Filter Bank Synthesis