Pitch Tracking MUMT 611 Philippe Zaborowski February 2005.

Slides:



Advertisements
Similar presentations
Acoustic/Prosodic Features
Advertisements

10.1 fourier analysis of signals using the DFT 10.2 DFT analysis of sinusoidal signals 10.3 the time-dependent fourier transform Chapter 10 fourier analysis.
ACHIZITIA IN TIMP REAL A SEMNALELOR. Three frames of a sampled time domain signal. The Fast Fourier Transform (FFT) is the heart of the real-time spectrum.
Sampling theory Fourier theory made easy
1 Chapter 16 Fourier Analysis with MATLAB Fourier analysis is the process of representing a function in terms of sinusoidal components. It is widely employed.
DFT/FFT and Wavelets ● Additive Synthesis demonstration (wave addition) ● Standard Definitions ● Computing the DFT and FFT ● Sine and cosine wave multiplication.
An Overview of Pitch Detection Algorithms Alexandre Savard MUMT611: Music Information Acquisition, Preservation, and Retrieval February 2006.
Overview of Real-Time Pitch Tracking Approaches Music information retrieval seminar McGill University Francois Thibault.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Pitch Detection and Tracking Juhan Nam 1.
Page 0 of 34 MBE Vocoder. Page 1 of 34 Outline Introduction to vocoders MBE vocoder –MBE Parameters –Parameter estimation –Analysis and synthesis algorithm.
Speech and Audio Processing and Recognition
Pitch Recognition with Wavelets Final Presentation by Stephen Geiger.
Gerald Leung.  Implementation Goal of Phase Vocoder  Spectral Analysis and Manipulation  Matlab Implementation  Result Discussion and Conclusion.
Classification of Music According to Genres Using Neural Networks, Genetic Algorithms and Fuzzy Systems.
FFT-based filtering and the Short-Time Fourier Transform (STFT) R.C. Maher ECEN4002/5002 DSP Laboratory Spring 2003.
Communications & Multimedia Signal Processing Report of Work on Formant Tracking LP Models and Plans on Integration with Harmonic Plus Noise Model Qin.
4/25/2001ECE566 Philip Felber1 Speech Recognition A report of an Isolated Word experiment. By Philip Felber Illinois Institute of Technology April 25,
DEVON BRYANT CS 525 SEMESTER PROJECT Audio Signal MIDI Transcription.
Classification of Music According to Genres Using Neural Networks, Genetic Algorithms and Fuzzy Systems.
Discrete Fourier Transform(2) Prof. Siripong Potisuk.
Wireless communication channel
Representing Acoustic Information
Numerical algorithms for power system protection Prof. dr. sc. Ante Marušić, doc. dr. sc. Juraj Havelka University of Zagreb Faculty of Electrical Engineering.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
Lecture 1 Signals in the Time and Frequency Domains
Artificial Intelligence 2004 Speech & Natural Language Processing Natural Language Processing written text as input sentences (well-formed) Speech.
Time-Domain Methods for Speech Processing 虞台文. Contents Introduction Time-Dependent Processing of Speech Short-Time Energy and Average Magnitude Short-Time.
Multiresolution STFT for Analysis and Processing of Audio
1 CS 551/651: Structure of Spoken Language Lecture 8: Mathematical Descriptions of the Speech Signal John-Paul Hosom Fall 2008.
Chapter 2. Fourier Representation of Signals and Systems
8.1 representation of periodic sequences:the discrete fourier series 8.2 the fourier transform of periodic signals 8.3 properties of the discrete fourier.
Electronic Communications: A Systems Approach Beasley | Hymer | Miller Copyright © 2014 by Pearson Education, Inc. All Rights Reserved Information and.
The Discrete Fourier Transform 主講人:虞台文. Content Introduction Representation of Periodic Sequences – DFS (Discrete Fourier Series) Properties of DFS The.
Speech Signal Representations I Seminar Speech Recognition 2002 F.R. Verhage.
Spectral analysis of multiple timeseries Kenneth D. Harris 18/2/15.
CHAPTER 10 Applications of Digital Signal Processing
Chapter 9 Frequency Response and Transfer Function
Chapter 6 Spectrum Estimation § 6.1 Time and Frequency Domain Analysis § 6.2 Fourier Transform in Discrete Form § 6.3 Spectrum Estimator § 6.4 Practical.
Pre-Class Music Paul Lansky Six Fantasies on a Poem by Thomas Campion.
Extends Euclidean space algebra to higher dimensions
Artificial Intelligence 2004 Speech & Natural Language Processing Speech Recognition acoustic signal as input conversion into written words Natural.
EE Audio Signals and Systems Linear Prediction Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Noise Reduction Two Stage Mel-Warped Weiner Filter Approach.
Pitch Estimation by Enhanced Super Resolution determinator By Sunya Santananchai Chia-Ho Ling.
Lecture#10 Spectrum Estimation
Z bigniew Leonowicz, Wroclaw University of Technology Z bigniew Leonowicz, Wroclaw University of Technology, Poland XXIX  IC-SPETO.
EEL 6586: AUTOMATIC SPEECH PROCESSING Speech Features Lecture Mark D. Skowronski Computational Neuro-Engineering Lab University of Florida February 27,
MSc Project Musical Instrument Identification System MIIS Xiang LI ee05m216 Supervisor: Mark Plumbley.
Chapter 2. Fourier Representation of Signals and Systems
Chapter 2 Harmonics and Interharmonics Theory
Query by Singing and Humming System
By Sarita Jondhale 1 Signal preprocessor: “conditions” the speech signal s(n) to new form which is more suitable for the analysis Postprocessor: operate.
The Discrete Fourier Transform
Digital Communications Chapter 1 Signals and Spectra Signal Processing Lab.
Chapter 2. Characteristics of Signal ※ Signal : transmission of information The quality of the information depends on proper selection of a measurement.
A NOVEL SIGNAL PROCESSING METHOD BASED ON THE FREQUENCY MODALITY FOR INTRA-BODY MEDICAL INSTRUMENT TRACKING Dr. DIMITRIS A. FOTIADIS TECHNOLOGICAL EDUCATIONAL.
Content: Distortion at electronic loads
Topic: Waveforms in Noesis
National Mathematics Day
Discrete Fourier Transform (DFT)
Spectral Analysis Spectral analysis is concerned with the determination of the energy or power spectrum of a continuous-time signal It is assumed that.
MECH 373 Instrumentation and Measurements
ARTIFICIAL NEURAL NETWORKS
Vocoders.
Pitch Estimation By Chih-Ti Shih 12/11/2006 Chih-Ti Shih.
1-D DISCRETE COSINE TRANSFORM DCT
Speech Processing Final Project
BeatBox Hero Ankit Gupta Rohan Jain.
Music Signal Processing
Presentation transcript:

Pitch Tracking MUMT 611 Philippe Zaborowski February 2005

Pitch Tracking Goal is to track the fundamental Vast area of research mostly focused on voice coding Dozens of different algorithms All algorithms have limitations None are ideal

Technical Difficulties: Piano

Technical Difficulties: E. Bass

Algorithm Classification Time Domain Spectral Domain Combined Time/Spectral Domain Neural Networks

Time Domain Common Features: Analysis performed on sample basis instead of buffered intervals No transformation needed Cheap on computation Common Drawbacks: Not suited for signals where the fundamental is weak and the harmonics are strong DC offset can be a problem

Time Domain Threshold Crossing (zero crossing)

Time Domain Dolansky (1954)

Time Domain Rabiner and Gold (1969)

Time Domain Autocorrelation (Rabiner 1977)

Time Domain Average Magnitude Difference Function (Ross 1974)

Time Domain Cooper and Ng (1994)

Time/Spectral Domain Least-Square (Choi 1995) Combines the reliability of frequency-domain with high resolution of time-domain Able to analyze shorter signal segments Suitable for real-time Uses constant Q tranform

Spectral Domain Common Features: Transformation from time to spectral domain is computationally intensive Superior control and analysis of formants Common Drawbacks: Simple study of spectrum not enough DFT based algorithms use equally spaced bins

Spectral Domain FFT with different harmonic analysis: Maximum of FFT (Division Method) Piszczalski and Galler (1979) Harmonic Product (Schroeder 1968)

Spectral Domain Constant Q transform (Brown and Puckette 1992)

Spectral Domain Cepstrum (Andrews 1990)

Conclusion Spectral Domain: Give good results Require a demanding analysis of spectrum Time Domain: Generally inferior to spectral domain Some have comparable results with less computation