Uses of the pitch-scaled harmonic filter in speech processing by Philip Jackson * and Christine Shadle † *School of Electronic and Electrical Engineering,

Slides:



Advertisements
Similar presentations
ON THE REPRESENTATION OF VOICE SOURCE APERIODICITIES IN THE MBE SPEECH CODING MODEL Preeti Rao and Pushkar Patwardhan Department of Electrical Engineering,
Advertisements

Acoustic/Prosodic Features
Improved ASR in noise using harmonic decomposition Introduction Pitch-Scaled Harmonic Filter Recognition Experiments Results Conclusion aperiodic contribution.
Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 1 IEEE Speech Coding Workshop Sept 17–20, 2000 Lake Lawn Resort Delavan, WI Jean-Marc Valin,
aperiodic periodic Production of /z/: Covariation and weighting of harmonically decomposed streams for ASR Introduction Pitch-scaled harmonic filter Recognition.
Chapter 5 The Fourier Transform. Basic Idea We covered the Fourier Transform which to represent periodic signals We assumed periodic continuous signals.
1 A Spectral-Temporal Method for Pitch Tracking Stephen A. Zahorian*, Princy Dikshit, Hongbing Hu* Department of Electrical and Computer Engineering Old.
Liner Predictive Pitch Synchronization Voiced speech detection, analysis and synthesis Jim Bryan Florida Institute of Technology ECE5525 Final Project.
Advanced Speech Enhancement in Noisy Environments
Eeng Chapter 2 Orthogonal Representation, Fourier Series and Power Spectra  Orthogonal Series Representation of Signals and Noise Orthogonal Functions.
Physics 1251 The Science and Technology of Musical Sound Unit 1 Session 8 Harmonic Series Unit 1 Session 8 Harmonic Series.
Prosody modification in speech signals Project by Edi Fridman & Alex Zalts supervision by Yizhar Lavner.
Page 0 of 34 MBE Vocoder. Page 1 of 34 Outline Introduction to vocoders MBE vocoder –MBE Parameters –Parameter estimation –Analysis and synthesis algorithm.
Basic Spectrogram Lab 8. Spectrograms §Spectrograph: Produces visible patterns of acoustic energy called spectrograms §Spectrographic Analysis: l Acoustic.
Speech in Multimedia Hao Jiang Computer Science Department Boston College Oct. 9, 2007.
SOME SIMPLE MANIPULATIONS OF SOUND USING DIGITAL SIGNAL PROCESSING Richard M. Stern demo August 31, 2004 Department of Electrical and Computer.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
Speech Enhancement Based on a Combination of Spectral Subtraction and MMSE Log-STSA Estimator in Wavelet Domain LATSI laboratory, Department of Electronic,
Paul Heckbert Computer Science Department Carnegie Mellon University
System Microphone Keyboard Output. Cross Synthesis: Two Implementations.
Communications & Multimedia Signal Processing Formant Based Synthesizer Qin Yan Communication & Multimedia Signal Processing Group Dept of Electronic.
Digital Signal Processing A Merger of Mathematics and Machines 2002 Summer Youth Program Electrical and Computer Engineering Michigan Technological University.
Speech Recognition in Noise
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
Warped Linear Prediction Concept: Warp the spectrum to emulate human perception; then perform linear prediction on the result Approaches to warp the spectrum:
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
Multiresolution STFT for Analysis and Processing of Audio
Much Ado About Nothing: The Masks In Love 王勝雨.
Speech Perception1 Fricatives and Affricates We will be looking at acoustic cues in terms of … –Manner –Place –voicing.
Acoustic Analysis of Speech Robert A. Prosek, Ph.D. CSD 301 Robert A. Prosek, Ph.D. CSD 301.
1 ( قالوا سبحانك لا علم لنا الإ ما علمتنا إنك أنت العليم الحكيم ) صدق الله العظيم سورة البقرة آيه 32.
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
Speech Signal Representations I Seminar Speech Recognition 2002 F.R. Verhage.
NOISE DETECTION AND CLASSIFICATION IN SPEECH SIGNALS WITH BOOSTING Nobuyuki Miyake, Tetsuya Takiguchi and Yasuo Ariki Department of Computer and System.
Recognition of Speech Using Representation in High-Dimensional Spaces University of Washington, Seattle, WA AT&T Labs (Retd), Florham Park, NJ Bishnu Atal.
Linear Predictive Analysis 主講人:虞台文. Contents Introduction Basic Principles of Linear Predictive Analysis The Autocorrelation Method The Covariance Method.
EE Audio Signals and Systems Linear Prediction Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Song from William Shakespeare’s Plays: Much Ado About Nothing ‘Sigh No More, Ladies, Sigh No More’
Ways to generate computer speech Record a human speaking every sentence HAL will ever speak (not likely) Make a mathematical model of the human vocal.
EEL 6586: AUTOMATIC SPEECH PROCESSING Speech Features Lecture Mark D. Skowronski Computational Neuro-Engineering Lab University of Florida February 27,
Automatic Speech Recognition A summary of contributions from multiple disciplines Mark D. Skowronski Computational Neuro-Engineering Lab Electrical and.
By Sarita Jondhale 1 Signal preprocessor: “conditions” the speech signal s(n) to new form which is more suitable for the analysis Postprocessor: operate.
1 Acoustic Phonetics 3/28/00. 2 Nasal Consonants Produced with nasal radiation of acoustic energy Sound energy is transmitted through the nasal cavity.
Acoustic Phonetics 3/14/00.
EEL 6586: AUTOMATIC SPEECH PROCESSING Speech Features Lecture Mark D. Skowronski Computational Neuro-Engineering Lab University of Florida February 20,
Fourier Transform and Spectra
Fourier Transform and Spectra
HIGH-RESOLUTION SINUSOIDAL MODELING OF UNVOICED SPEECH GEORGE P. KAFENTZIS, YANNIS STYLIANOU MULTIMEDIA INFORMATICS LABORATORY DEPARTMENT OF COMPUTER SCIENCE.
THE AGES OF SHAKESPEARE Recital de canciones y textos de William Shakespeare en conmemoración del 400 aniversario de su muerte Madrid, 21 de abril de 2016.
PATTERN COMPARISON TECHNIQUES
ARTIFICIAL NEURAL NETWORKS
Vocoders.
Copyright © American Speech-Language-Hearing Association
Breathy Voice Note that you can hear both a buzzy (periodic) component and a hissy (aperiodic) component.
HARMONICS AND FILTERS.
1 Vocoders. 2 The Channel Vocoder (analyzer) : The channel vocoder employs a bank of bandpass filters,  Each having a bandwidth between 100 HZ and 300.
Speech and Audio Processing
Speech Perception CS4706.
Chapter 3: PCM Noise and Companding
Speech Pathologist #10.
Fourier Transform and Spectra
Richard M. Stern demo January 12, 2009
DCT-based Processing of Dynamic Features for Robust Speech Recognition Wen-Chi LIN, Hao-Teng FAN, Jeih-Weih HUNG Wen-Yi Chu Department of Computer Science.
Julia Hirschberg and Sarah Ita Levitan CS 6998
INTRODUCTION TO THE SHORT-TIME FOURIER TRANSFORM (STFT)
Music Signal Processing
S21 (at center frequency) 19 dB
Presentation transcript:

Uses of the pitch-scaled harmonic filter in speech processing by Philip Jackson * and Christine Shadle † *School of Electronic and Electrical Engineering, University of Birmingham †Department of Electronics and Computer Science, University of Southampton

Prologue Much Ado About Nothing (Act 2) Balthasar: “Note this before my notes: There’s not a note of mine that’s worth the noting.”

Pitch-scaled harmonic filter Developed to study turbulence noise during voicing Many potential speech applications: coding analysis perception recognition

Spectral smearing Effect of rectangular windowing

Spectral smearing Effect of Hann windowing

Minimising spectral smearing Influence of negative frequencies

Minimising spectral smearing Influence of higher harmonics

Interpolation: Harmonic filter: Decomposition

PSHF decomposition

Performance measure Change in Signal-to-Error Ratio: Synthesis: where

Evaluation - HNR periodic aperiodic

Evaluation (periodic) +10 dB

Evaluation (aperiodic) +10 dB

Time series for /ax-v:/

Short-time power for /ax-v:/

Power spectra for /v:/ F0F1F2F3

LPC spectra for /v:/ F0F1F2F3

MFCC spectra for /v:/ LPCMFCC F0F1F2F3F0F1F2F3

Power spectra for /zh:/ F2

Fricative spectra for /zh:/ F2

LPC spectra for /zh:/ MFCCLPC Z1

Summary Pitch-scaled harmonic filter: –Benefits of pitch scaling –Case for interpolation –Periodic + aperiodic for TD and FD –Evaluated on synthetic speech Applied to real data: –Time series –Power spectra –Short-time power –LPC –MFCC

Epilogue Much Ado About Nothing (Act 2) Balthasar: “Then sigh not so, but let them go, And be you blithe and bonny; Converting all your sounds of woe Into Hey nonny nonny [aka. HNNs].”