Linear Predictive Coding Methods

Slides:



Advertisements
Similar presentations
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: The Linear Prediction Model The Autocorrelation Method Levinson and Durbin.
Advertisements

Speech & Audio Coding TSBK01 Image Coding and Data Compression Lecture 11, 2003 Jörgen Ahlberg.
Liner Predictive Pitch Synchronization Voiced speech detection, analysis and synthesis Jim Bryan Florida Institute of Technology ECE5525 Final Project.
Spectral envelope analysis of TIMIT corpus using LP, WLSP, and MVDR Steve Vest Matlab implementation of methods by Tien-Hsiang Lo.
Improvement of Audio Capture in Handheld Devices through Digital Filtering Problem Microphones in handheld devices are of low quality to reduce cost. This.
Page 0 of 34 MBE Vocoder. Page 1 of 34 Outline Introduction to vocoders MBE vocoder –MBE Parameters –Parameter estimation –Analysis and synthesis algorithm.
A 12-WEEK PROJECT IN Speech Coding and Recognition by Fu-Tien Hsiao and Vedrana Andersen.
Itay Ben-Lulu & Uri Goldfeld Instructor : Dr. Yizhar Lavner Spring /9/2004.
ELE Adaptive Signal Processing
AGC DSP AGC DSP Professor A G Constantinides©1 A Prediction Problem Problem: Given a sample set of a stationary processes to predict the value of the process.
Speech Coding Nicola Orio Dipartimento di Ingegneria dell’Informazione IV Scuola estiva AISV, 8-12 settembre 2008.
6/3/20151 Voice Transformation : Speech Morphing Gidon Porat and Yizhar Lavner SIPL – Technion IIT December
Speech & Audio Processing
1 Speech Parametrisation Compact encoding of information in speech Accentuates important info –Attempts to eliminate irrelevant information Accentuates.
Overview of Adaptive Multi-Rate Narrow Band (AMR-NB) Speech Codec
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
Adaptive Signal Processing
Linear Prediction Problem: Forward Prediction Backward Prediction
LE 460 L Acoustics and Experimental Phonetics L-13
Time-Domain Methods for Speech Processing 虞台文. Contents Introduction Time-Dependent Processing of Speech Short-Time Energy and Average Magnitude Short-Time.
Automatic Pitch Tracking September 18, 2014 The Digitization of Pitch The blue line represents the fundamental frequency (F0) of the speaker’s voice.
LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Encoding of Waveforms Encoding of Waveforms to Compress Information.
Linear Prediction Coding (LPC)
1 CS 551/651: Structure of Spoken Language Lecture 8: Mathematical Descriptions of the Speech Signal John-Paul Hosom Fall 2008.
Linear Prediction Coding of Speech Signal Jun-Won Suh.
Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission.
Page 0 of 23 MELP Vocoders Nima Moghadam SN#: Saeed Nari SN#: Supervisor Dr. Saameti April 2005 Sharif University of Technology.
Speech Coding Submitted To: Dr. Mohab Mangoud Submitted By: Nidal Ismail.
SPEECH CODING Maryam Zebarjad Alessandro Chiumento.
T – Biomedical Signal Processing Chapters
1 Linear Prediction. 2 Linear Prediction (Introduction) : The object of linear prediction is to estimate the output sequence from a linear combination.
1 PATTERN COMPARISON TECHNIQUES Test Pattern:Reference Pattern:
1 Linear Prediction. Outline Windowing LPC Introduction to Vocoders Excitation modeling  Pitch Detection.
Compression No. 1  Seattle Pacific University Data Compression Kevin Bolding Electrical Engineering Seattle Pacific University.
Speech Parameter Generation From HMM Using Dynamic Features Keiichi Tokuda, Takao Kobayashi, Satoshi Imai ICASSP 1995 Reporter: Huang-Wei Chen.
Speech Signal Representations I Seminar Speech Recognition 2002 F.R. Verhage.
Linear Predictive Analysis 主講人:虞台文. Contents Introduction Basic Principles of Linear Predictive Analysis The Autocorrelation Method The Covariance Method.
ECE 5525 Osama Saraireh Fall 2005 Dr. Veton Kepuska
VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.
ITU-T G.729 EE8873 Rungsun Munkong March 22, 2004.
A Comparison Of Speech Coding With Linear Predictive Coding (LPC) And Code-Excited Linear Predictor Coding (CELP) By: Kendall Khodra Instructor: Dr. Kepuska.
More On Linear Predictive Analysis
SPEECH CODING Maryam Zebarjad Alessandro Chiumento Supervisor : Sylwester Szczpaniak.
Present document contains informations proprietary to France Telecom. Accepting this document means for its recipient he or she recognizes the confidential.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Normal Equations The Orthogonality Principle Solution of the Normal Equations.
Autoregressive (AR) Spectral Estimation
Chapter 20 Speech Encoding by Parameters 20.1 Linear Predictive Coding (LPC) 20.2 Linear Predictive Vocoder 20.3 Code Excited Linear Prediction (CELP)
Voice Sampling. Sampling Rate Nyquist’s theorem states that a signal can be reconstructed if it is sampled at twice the maximum frequency of the signal.
Page 0 of 7 Particle filter - IFC Implementation Particle filter – IFC implementation: Accept file (one frame at a time) Initial processing** Compute autocorrelations,
By Sarita Jondhale 1 Signal preprocessor: “conditions” the speech signal s(n) to new form which is more suitable for the analysis Postprocessor: operate.
1 Speech Compression (after first coding) By Allam Mousa Department of Telecommunication Engineering An Najah University SP_3_Compression.
Adv DSP Spring-2015 Lecture#11 Spectrum Estimation Parametric Methods.
Figure 11.1 Linear system model for a signal s[n].
Digital Communications Chapter 13. Source Coding
Vocoders.
Linear Prediction.
1 Vocoders. 2 The Channel Vocoder (analyzer) : The channel vocoder employs a bank of bandpass filters,  Each having a bandwidth between 100 HZ and 300.
Modern Spectral Estimation
Microcomputer Systems 2
Vocoders.
Chapter 6 Discrete-Time System
ESTIMATED INVERSE SYSTEM
Linear Prediction.
Principles of the Global Positioning System Lecture 11
Chapter 7 Finite Impulse Response(FIR) Filter Design
Chapter 8 The Discrete Fourier Transform
Chapter 8 The Discrete Fourier Transform
Chapter 7 Finite Impulse Response(FIR) Filter Design
Speech Processing Final Project
Presentation transcript:

Linear Predictive Coding Methods ECE 5525 By: Mohamed M. Eljhani Fall 2010

Problem Description The main purpose of this project is to show the different between three linear predictive methods by implementing a MATLAB program that convert from a frame of speech to a set of linear Prediction coefficients, using three basic methods, i.e. Auto-correlation Method. Covariance Method. Lattice Method. Plot LPC spectra from the three methods along with the normal spectrum from the Hamming window weighted frame. 11/28/2018 LPC Methods

Using the same parameters N =400, p =12, with Hamming Window weighting for the three methods, to show the different between three methods. Using the file ah.wav to get a vowel steady state sound beginning at sample 3000, and the file test 16k.wav to get a fricative beginning at sample 3000. Covariance and lattice methods – preserved p samples before the starting sample at n =3000 for computing correlations, and error signals. Autocorrelation Method – assumes that speech samples are zero outside the interval. 11/28/2018 LPC Methods

Introduction What is the Linear Predictive Coding means ? Linear predictive coding (LPC) is defined as a digital method for encoding an analog signal in which a particular value is predicted by a linear function of the past values of the signal. Human speech is produced in the vocal tract which can be approximated as a variable diameter tube. The linear predictive coding (LPC) model is based on a mathematical approximation of the vocal tract represented by this tube of a varying diameter. At a particular time t, the speech sample s(t) is represented as a linear sum of the p previous samples. The most important aspect of LPC is the linear predictive filter which allows the value of the next sample to be determined by a linear combination of previous samples. 11/28/2018 LPC Methods

LPC Algorithm The general algorithm for linear predictive coding involves: An analysis or encoding part. A synthesis or decoding part. In the encoding, LPC takes the speech signal in blocks or frames of speech and determines the input signal and the coefficients of the filter that will be capable of reproducing the current block of speech. This information is quantized and transmitted. In the decoding, LPC rebuilds the filter based on the coefficients received. The filter can be thought of as a tube which, when given an input signal, attempts to output speech. Additional information about the original speech signal is used by the decoder to determine the input or excitation signal that is sent to the filter for synthesis. 11/28/2018 LPC Methods

LPC Model The particular source-filter model used in LPC is known as the Linear predictive coding model. It has two key components: Analysis or encoding Synthesis or decoding The analysis part of LPC involves examining the speech signal and breaking it down into segments or blocks. Each segment is than examined further to find the answers to several key questions: • Is the segment voiced or unvoiced? • What is the pitch of the segment? • What parameters are needed to build a filter that models the vocal tract for the current segment? Human vs. Voice Coder Speech Production 11/28/2018 LPC Methods

LPC Methods • LPC are the most widely used in speech coding, speech synthesis, speech recognition, speaker recognition and verification and for speech storage. – LPC methods provide extremely accurate estimates of speech parameters, and does it extremely efficiently. – Basic idea of Linear Prediction: current speech sample can be closely approximated as a linear combination of past samples, i.e. 11/28/2018 LPC Methods

LPC Methods • LPC methods have been used in control and information theory called methods of system estimation and system identification – used extensively in speech under group of names including 1. Autocorrelation method 2. Covariance method 3. Lattice method 4. Inverse filter formulation method 5. Spectral estimation formulation method 6. Maximum likelihood method 7. Inner product method 11/28/2018 LPC Methods

LPC Estimation Issues • Need to determine {αk} directly from speech such that they give good estimates of the time-varying spectrum • Need to estimate {αk} from short segments of speech • Need to minimize mean-squared prediction error over short segments of speech • Resulting {αk} assumed to be the actual {ak} in the speech production model => intend to show that all of this can be done efficiently, reliably, and accurately for speech 11/28/2018 LPC Methods

Autocorrelation Method assume exists for 0 ≤ m ≤ L-1 and is exactly zero everywhere else (i.e., window of length L samples) Where w(m) is a finite length window of length L samples 11/28/2018 LPC Methods

if is non-zero only for 0 ≤ m ≤ L-1 then at values of m near 0 (i.e., m = 0,1,…..,p-1) we are predicting signal from zero-valued samples (outside the window range) => will be (relatively) large at values near m = L (i.e., m = L,L+1,……,L+p-1) we are predicting zero-valued samples (outside the window range) from non-zero samples for these reasons, normally use windows that taper the segment to zero (e.g., Hamming window) 11/28/2018 LPC Methods

Autocorrelation Method 11/28/2018 LPC Methods

Covariance Method Covariance is a second basic approach to defining the speech segment and the limits on the sums, namely fix the interval over which the mean-squared error is computed. Key difference from Autocorrelation Method is that limits of summation include terms before m=0 =>window extends p samples backwards from Since we are extending window backwards, don’t need to taper it using a HW- since there is no transition at window edges 11/28/2018 LPC Methods

11/28/2018 LPC Methods

Autocorrelation/Covariance Summary Use order linear predictor to predict from p previous samples Minimize mean-square error over analysis window of duration L- samples Solution for optimum predictor coefficients is based on solving a matrix equation => two solution have evolved 1. Autocorrelation Method => signal is windowed by a tapering window in order to minimize discontinuities at beginning (predicting speech from zero-valued samples) and end (predicting zero-valued samples from speech samples) of the interval; the matrix is shown to be an autocorrelation function, the resulting autocorrelation matrix can be readily solved using standard matrix solutions 11/28/2018 LPC Methods

Autocorrelation/Covariance Summary 2. Covariance method => the signal is extended by p samples outside the normal range of to include p samples occurring prior to m=0 (they are available) and eliminates the need for a tapering window; resulting matrix of correlations is symmetric => different method of solution with somewhat different set of optimal prediction coefficients, 11/28/2018 LPC Methods

Lattice Method Both covariance and autocorrelation methods use two step solutions : computation of a matrix of correlation values efficient solution of a set of linear equations Another class of LPC methods called lattice method, has evolved in which the two steps are combined into a recursive algorithm for determining LP parameters. Predictor coefficients obtained directly from speech samples =>without calculating of autocorrelation function Lattice method is guaranteed to yield stable filters without using window 11/28/2018 LPC Methods

Lattice Formulations of LP 11/28/2018 LPC Methods

Problem Solution 1. Durbin for the auto-correlation method, The following MATLAB program LPC Matlab Files\test_lpc.m reads in a file of speech and computes The original spectrum (of the signal weighted by a Hamming window), and plots on top of this the LPC spectrum from the Autocorrelation method Covariance method Lattice method There is a main program and four functions 1. Durbin for the auto-correlation method, LPC Matlab Files\durbin.m 2. Cholesky for the covariance method, LPC Matlab Files\cholesky_full.m 3. Lattice for the traditional lattice method, LPC Matlab Files\lattice.m 4. LPC Matlab Files\autolpc.m is used to generate lpc parameters. The problem solved successfully .. 11/28/2018 LPC Methods

LPC Comparisons 11/28/2018 LPC Methods

Conclusion Linear Predictive Coding is an analysis/synthesis technique to lossy speech compression that attempts to model the human production of sound instead of transmitting an estimate of the sound wave. Linear predictive coding encoders break up a sound signal into different segments and then send information on each segment to the decoder. The encoder send information on whether the segment is voiced or unvoiced and the pitch period for voiced segment which is used to create an excitement signal in the decoder. The encoder also sends information about the vocal tract which is used to build a filter on the decoder side which when given the excitement signal as input can reproduce the original speech. 11/28/2018 LPC Methods

Bibliography & References Dr. Veton Kepuska lecture slides “Analysis and Synthesis of Pole-Zero Speech Models” http://my.fit.edu/~vkepuska/ece5525/. Dr. Lawrence Rabiner, Rutgers University Course Website: www.caip.rutgers.edu/~lrr. being changed to cronos.rutgers.edu/~lrr. L. R. Rabiner and R. W. Schafer, Theory and Applications of Digital Speech Processing, Prentice-Hall Inc., 2010. Linear Predictive Coding, Jeremy Bradbury, December 5, 2000. 11/28/2018 LPC Methods