Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.

Slides:



Advertisements
Similar presentations
Pattern Recognition and Machine Learning
Advertisements

ECG Signal processing (2)
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
SVM - Support Vector Machines A new classification method for both linear and nonlinear data It uses a nonlinear mapping to transform the original training.
Pattern Recognition and Machine Learning
CHAPTER 10: Linear Discrimination
An Introduction of Support Vector Machine
Support Vector Machines and Kernels Adapted from slides by Tim Oates Cognition, Robotics, and Learning (CORAL) Lab University of Maryland Baltimore County.
Support Vector Machines
Machine learning continued Image source:
An Overview of Machine Learning
Vineel Pratap Girish Govind Abhilash Veeragouni. Human listeners are capable of extracting information from the acoustic signal beyond just the linguistic.
Speech in Multimedia Hao Jiang Computer Science Department Boston College Oct. 9, 2007.
Pattern Recognition and Machine Learning
Page 0 of 8 Time Series Classification – phoneme recognition in reconstructed phase space Sanjay Patil Intelligent Electronics Systems Human and Systems.
Speaker Adaptation for Vowel Classification
Machine Learning CMPT 726 Simon Fraser University
1 lBayesian Estimation (BE) l Bayesian Parameter Estimation: Gaussian Case l Bayesian Parameter Estimation: General Estimation l Problems of Dimensionality.
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)
Authors: Anastasis Kounoudes, Anixi Antonakoudi, Vasilis Kekatos
Introduction to machine learning
EE513 Audio Signals and Systems Statistical Pattern Classification Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Ch. Eick: Support Vector Machines: The Main Ideas Reading Material Support Vector Machines: 1.Textbook 2. First 3 columns of Smola/Schönkopf article on.
Soft Margin Estimation for Speech Recognition Main Reference: Jinyu Li, " SOFT MARGIN ESTIMATION FOR AUTOMATIC SPEECH RECOGNITION," PhD thesis, Georgia.
Isolated-Word Speech Recognition Using Hidden Markov Models
Outline Separating Hyperplanes – Separable Case
0 Pattern Classification, Chapter 3 0 Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda,
Institute of Information Science, Academia Sinica, Taiwan Speaker Verification via Kernel Methods Speaker : Yi-Hsiang Chao Advisor : Hsin-Min Wang.
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
Speech Recognition Pattern Classification. 22 September 2015Veton Këpuska2 Pattern Classification  Introduction  Parametric classifiers  Semi-parametric.
7-Speech Recognition Speech Recognition Concepts
International Conference on Intelligent and Advanced Systems 2007 Chee-Ming Ting Sh-Hussain Salleh Tian-Swee Tan A. K. Ariff. Jain-De,Lee.
1 SUPPORT VECTOR MACHINES İsmail GÜNEŞ. 2 What is SVM? A new generation learning system. A new generation learning system. Based on recent advances in.
Machine Learning Using Support Vector Machines (Paper Review) Presented to: Prof. Dr. Mohamed Batouche Prepared By: Asma B. Al-Saleh Amani A. Al-Ajlan.
Overview of Supervised Learning Overview of Supervised Learning2 Outline Linear Regression and Nearest Neighbors method Statistical Decision.
Signature with Text-Dependent and Text-Independent Speech for Robust Identity Verification B. Ly-Van*, R. Blouet**, S. Renouard** S. Garcia-Salicetti*,
An Introduction to Support Vector Machines (M. Law)
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: ML and Simple Regression Bias of the ML Estimate Variance of the ML Estimate.
Optimal Bayes Classification
Speaker Verification Speaker verification uses voice as a biometric to determine the authenticity of a user. Speaker verification systems consist of two.
Speaker Identification by Combining MFCC and Phase Information Longbiao Wang (Nagaoka University of Technologyh, Japan) Seiichi Nakagawa (Toyohashi University.
Linear Models for Classification
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
Speech Communication Lab, State University of New York at Binghamton Dimensionality Reduction Methods for HMM Phonetic Recognition Hongbing Hu, Stephen.
Automatic Speech Recognition A summary of contributions from multiple disciplines Mark D. Skowronski Computational Neuro-Engineering Lab Electrical and.
Final Exam Review CS479/679 Pattern Recognition Dr. George Bebis 1.
Elements of Pattern Recognition CNS/EE Lecture 5 M. Weber P. Perona.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 12: Advanced Discriminant Analysis Objectives:
CS Statistical Machine learning Lecture 12 Yuan (Alan) Qi Purdue CS Oct
METU Informatics Institute Min720 Pattern Classification with Bio-Medical Applications Part 9: Review.
1 Electrical and Computer Engineering Binghamton University, State University of New York Electrical and Computer Engineering Binghamton University, State.
Speaker Verification System Middle Term Presentation Performed by: Barak Benita & Daniel Adler Instructor: Erez Sabag.
Next, this study employed SVM to classify the emotion label for each EEG segment. The basic idea is to project input data onto a higher dimensional feature.
Neural networks (2) Reminder Avoiding overfitting Deep neural network Brief summary of supervised learning methods.
High resolution product by SVM. L’Aquila experience and prospects for the validation site R. Anniballe DIET- Sapienza University of Rome.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
Neural networks and support vector machines
LECTURE 11: Advanced Discriminant Analysis
Chapter 3: Maximum-Likelihood and Bayesian Parameter Estimation (part 2)
Pattern Recognition CS479/679 Pattern Recognition Dr. George Bebis
EE513 Audio Signals and Systems
Generally Discriminant Analysis
CSCE833 Machine Learning Lecture 9 Linear Discriminant Analysis
Parametric Methods Berlin Chen, 2005 References:
Linear Discrimination
Chapter 3: Maximum-Likelihood and Bayesian Parameter Estimation (part 2)
A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models Jeff A. Bilmes International.
NON-NEGATIVE COMPONENT PARTS OF SOUND FOR CLASSIFICATION Yong-Choon Cho, Seungjin Choi, Sung-Yang Bang Wen-Yi Chu Department of Computer Science &
A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models Jeff A. Bilmes International.
Presentation transcript:

Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System using SVM

Page 1 of 12 Research Progress: Jun-Won Suh Outline – Summary of Ph.d Dissertation of Vincent Wan Speaker verification system  Extracting features Creating models of speakers  Generative models, discriminative models  Making generative models discriminative Developing speaker verification using SVMs My interest to improve our system.

Page 2 of 12 Research Progress: Jun-Won Suh Speaker verification system Authenticate a person’s claimed identity Text dependent and independent  The system models the sound of the client’s voice. (based on physical characteristics of the client’s vocal tract.) A generic speaker verification system Feature extraction Enrolment  Creates a model for client’s voice Pattern matching Decision theory

Page 3 of 12 Research Progress: Jun-Won Suh Extracting features Building models of speakers depends on frequency analysis of the speaker’s voice. Linear predictive coding (LPC)  LPC assumes that speech can be modelled as the output of periodic pulses or random noise.  The solutions for these LPC coefficients is obtained by minimizing MSE. Perceptual linear prediction (PLP)  PLP combines LPC analysis with psychophysics knowledge of the human auditory system.  Ex: Human ear has a higher frequency resolution at low frequencies.

Page 4 of 12 Research Progress: Jun-Won Suh Creating models of speakers Generative models  Gaussian Mixture Model (GMM), Hidden Markov Model (HMM)  Models are probability density estimators that attempt to capture all of the fluctuations and variations of the data. Discriminative models  Polynomial classifiers, Support Vector Machines (SVM)  Models are optimized to minimize the error on a set of training samples.  Models draw the boundary between classes and ignores the fluctuations within each class. Generative models discriminative  Generative models use to estimate the within class probability densities and do not minimize a classification error.  Discriminative models achieves the highest performance in classification tasks.

Page 5 of 12 Research Progress: Jun-Won Suh Making generative models discriminative GMM-LR/SVM combination  GMM likelihood ratio  Bengio proposed that the probability estimates are not perfect and a better version would be  Bayes decision rule  The input to the SVM is the two dimensional vector made up of the log likelihoods of the client and world models.  A limitation of these approaches arises from frame basis discrimination.

Page 6 of 12 Research Progress: Jun-Won Suh Importance of kernels Early SVM using polynomial and RBF kernels  Optimization problems requiring significant computational resources that were unsustainable.  Employing cluster algorithms to reduce the accuracy.  Frame level training inputs discard the useful speaker classification information. SVM using score-space kernels  The variable length of utterance can be classified by sequence level.

Page 7 of 12 Research Progress: Jun-Won Suh Classifying sequences using score-space kernels The score-space kernel enables SVMs to classify whole sequences. A variable length sequence of input vectors is mapped explicitly onto a single point in a space of fixed dimension. The score-space is derived from the likelihood score. The likelihood ratio score-space

Page 8 of 12 Research Progress: Jun-Won Suh Computing the score-space vectors Define the global likelihood of a sequence X = {x 1, …, x N l }

Page 9 of 12 Research Progress: Jun-Won Suh Computing the score-space vectors The fixed length vectors of the likelihood ration kernel can be expressed as The final likelihood ratio kernel is The dimensionality of the score-space is equal to the total number of parameters in the generative models. Hence the SVM can classify the complete utterance sequences.

Page 10 of 12 Research Progress: Jun-Won Suh Experiment Results on PolyVar The data has a noise. The data has a much more clients tests than YOHO.

Page 11 of 12 Research Progress: Jun-Won Suh Conclusion Add GMM-LR/SVM model in our verification system Add score-space kernel on SVM  Need to compare the computation requirement for Fisher and LR kernels.

Page 12 of 12 Research Progress: Jun-Won Suh References V. Wan, Speaker Verification using Support Vector Machines, University of Sheffield, June 2003 V. Wan, Building Sequence Kernels for Speaker Verificaiton and Speech Recognition, University of Sheffield S. Bengio, and J. Marithoz, Learning the Decision Function for the Speaker Verification, IDIAP, 2001