Applying Recent Advances in Speaker Recognition to the Field of Document Classification CS294-5 Class Project Kofi Boakye Andrew Hatch.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Speed dating Classification What you should know about dating Stephen Cohen Rajesh Ranganath Te Thamrongrattanarit.
Known Non-targets for PLDA-SVM Training/Scoring Construction of Discriminative Kernels from Known and Unknown Non-targets for PLDA-SVM Scoring Results.
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
An Introduction of Support Vector Machine
Rajat Raina Honglak Lee, Roger Grosse Alexis Battle, Chaitanya Ekanadham, Helen Kwong, Benjamin Packer, Narut Sereewattanawoot Andrew Y. Ng Stanford University.
An Introduction of Support Vector Machine
The Pyramid Match Kernel: Discriminative Classification with Sets of Image Features Kristen Grauman Trevor Darrell MIT.
CVPR2013 Poster Representing Videos using Mid-level Discriminative Patches.
COMPUTER AIDED DIAGNOSIS: FEATURE SELECTION Prof. Yasser Mostafa Kadah –
Discriminative and generative methods for bags of features
HIWIRE MEETING Torino, March 9-10, 2006 José C. Segura, Javier Ramírez.
Presented by Li-Tal Mashiach Learning to Rank: A Machine Learning Approach to Static Ranking Algorithms for Large Data Sets Student Symposium.
Combining Labeled and Unlabeled Data for Multiclass Text Categorization Rayid Ghani Accenture Technology Labs.
1 LM Approaches to Filtering Richard Schwartz, BBN LM/IR ARDA 2002 September 11-12, 2002 UMASS.
9/20/2004Speech Group Lunch Talk Speaker ID Smorgasbord or How I spent My Summer at ICSI Kofi A. Boakye International Computer Science Institute.
EE225D Final Project Text-Constrained Speaker Recognition Using Hidden Markov Models Kofi A. Boakye EE225D Final Project.
SVM (Support Vector Machines) Base on statistical learning theory choose the kernel before the learning process.
Scalable Text Mining with Sparse Generative Models
Visual Speech Recognition Using Hidden Markov Models Kofi A. Boakye CS280 Course Project.
Discriminative and generative methods for bags of features
Introduction to machine learning
1  The goal is to estimate the error probability of the designed classification system  Error Counting Technique  Let classes  Let data points in class.
SoundSense: Scalable Sound Sensing for People-Centric Application on Mobile Phones Hon Lu, Wei Pan, Nocholas D. lane, Tanzeem Choudhury and Andrew T. Campbell.
Learning to Rank for Information Retrieval
Step 3: Classification Learn a decision rule (classifier) assigning bag-of-features representations of images to different classes Decision boundary Zebra.
1 International Computer Science Institute Data Sampling for Acoustic Model Training Özgür Çetin International Computer Science Institute Andreas Stolcke.
Age and Gender Classification using Modulation Cepstrum Jitendra Ajmera (presented by Christian Müller) Speaker Odyssey 2008.
A Phonotactic-Semantic Paradigm for Automatic Spoken Document Classification Bin MA and Haizhou LI Institute for Infocomm Research Singapore.
This week: overview on pattern recognition (related to machine learning)
1 Scaling multi-class Support Vector Machines using inter- class confusion Author:Shantanu Sunita Sarawagi Sunita Sarawagi Soumen Chakrabarti Soumen Chakrabarti.
©2008 Srikanth Kallurkar, Quantum Leap Innovations, Inc. All rights reserved. Apollo – Automated Content Management System Srikanth Kallurkar Quantum Leap.
Automatic Detection of Tags for Political Blogs Khairun-nisa Hassanali Vasileios Hatzivassiloglou The University.
Support Vector Machines Mei-Chen Yeh 04/20/2010. The Classification Problem Label instances, usually represented by feature vectors, into one of the predefined.
Multimodal Information Analysis for Emotion Recognition
Modern Topics in Multivariate Methods for Data Analysis.
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
Data Sampling & Progressive Training T. Shinozaki & M. Ostendorf University of Washington In collaboration with L. Atlas.
Object Recognition in Images Slides originally created by Bernd Heisele.
NOISE DETECTION AND CLASSIFICATION IN SPEECH SIGNALS WITH BOOSTING Nobuyuki Miyake, Tetsuya Takiguchi and Yasuo Ariki Department of Computer and System.
1 Opinion Retrieval from Blogs Wei Zhang, Clement Yu, and Weiyi Meng (2007 CIKM)
1 Modeling Long Distance Dependence in Language: Topic Mixtures Versus Dynamic Cache Models Rukmini.M Iyer, Mari Ostendorf.
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
CS 1699: Intro to Computer Vision Support Vector Machines Prof. Adriana Kovashka University of Pittsburgh October 29, 2015.
Voice Activity Detection based on OptimallyWeighted Combination of Multiple Features Yusuke Kida and Tatsuya Kawahara School of Informatics, Kyoto University,
Number Sense Disambiguation Stuart Moore Supervised by: Anna Korhonen (Computer Lab)‏ Sabine Buchholz (Toshiba CRL)‏
Support Vector Machines and Gene Function Prediction Brown et al PNAS. CS 466 Saurabh Sinha.
Guest lecture: Feature Selection Alan Qi Dec 2, 2004.
A DYNAMIC APPROACH TO THE SELECTION OF HIGH ORDER N-GRAMS IN PHONOTACTIC LANGUAGE RECOGNITION Mikel Penagarikano, Amparo Varona, Luis Javier Rodriguez-
GENDER AND AGE RECOGNITION FOR VIDEO ANALYTICS SOLUTION PRESENTED BY: SUBHASH REDDY JOLAPURAM.
Iterative similarity based adaptation technique for Cross Domain text classification Under: Prof. Amitabha Mukherjee By: Narendra Roy Roll no: Group:
Speaker Change Detection using Support Vector Machines V.Kartik, D.Srikrishna Satish and C.Chandra Sekhar Speech and Vision Laboratory Department of Computer.
Final Report (30% final score) Bin Liu, PhD, Associate Professor.
Statistical Models for Automatic Speech Recognition Lukáš Burget.
9.913 Pattern Recognition for Vision Class9 - Object Detection and Recognition Bernd Heisele.
A Comprehensive Comparative Study on Term Weighting Schemes for Text Categorization with SVM Lan Man 3 Nov, 2004.
1 ICASSP Paper Survey Presenter: Chen Yi-Ting. 2 Improved Spoken Document Retrieval With Dynamic Key Term Lexicon and Probabilistic Latent Semantic Analysis.
A Maximum Entropy Language Model Integrating N-grams and Topic Dependencies for Conversational Speech Recognition Sanjeev Khudanpur and Jun Wu Johns Hopkins.
Maximum Entropy techniques for exploiting syntactic, semantic and collocational dependencies in Language Modeling Sanjeev Khudanpur, Jun Wu Center for.
Support Vector Machines Optimization objective Machine Learning.
Discriminative n-gram language modeling Brian Roark, Murat Saraclar, Michael Collins Presented by Patty Liu.
CS791 - Technologies of Google Spring A Web­based Kernel Function for Measuring the Similarity of Short Text Snippets By Mehran Sahami, Timothy.
Roughly overview of Support vector machines Reference: 1.Support vector machines and machine learning on documents. Christopher D. Manning, Prabhakar Raghavan.
Qifeng Zhu, Barry Chen, Nelson Morgan, Andreas Stolcke ICSI & SRI
CRF &SVM in Medication Extraction
Machine Learning Week 1.
Learning with information of features
Recognizing Structure: Sentence, Speaker, andTopic Segmentation
Modeling IDS using hybrid intelligent systems
Presentation transcript:

Applying Recent Advances in Speaker Recognition to the Field of Document Classification CS294-5 Class Project Kofi Boakye Andrew Hatch

speech recognizer Here are some words and shit, and some phones… iuf aweufb aijf aiuwedf aiu faiuf awiuef awiue faiuw juie ui uji iuf aiuw iuaw iuaw fiuoaw efoi iua iu dfiouas doia doifab sjkbd iuas uijos djhg wei woie oie aiouw efoiwbef oiweb foiwbe oiw efiow fojaw wue jw8i output “document” expected word/phoneme counts input speech document classifier Speaker recognition can be framed as a document classification problem Motivation

SVM-based techniques for speaker recognition ● Use relative frequencies of phone/word n-grams as features for each utterance, utt i : – Symmetry-based feature transformations: ● variance normalization: ● rank normalization (Andreas Stolcke): – Kernelized log-likelihood ratio (Campbell et al., NIPS 2003):

The Task 20 Newsgroups: Data set consists of 20,000 articles partitioned nearly evenly across 20 Usenet newsgroups Examples: alt.atheism, comp.windows.x, talk.politics.misc For evaluation, data is divided into training (60%) and test (40%) Speaker recognition paradigm adopted for evaluation Obtain true/false decisions for test/target pairs Each document tested against all 20 topics Summary statistic is equal error rate (EER)

The Process SVM classifier used with unigram statistics as features Unigrams selected from a ranked list according to TFIDF score(w) = p(w) * log(N/d(w)) Experimented with: Vocabulary size Smoothed (Good-Turing) counts One-versus-All approach to classifier training taken Experimented with various scaling factors and normalizations for feature vectors Variance, rank, kernelized LLR

Results Note: Present results only from alt.atheism Smoothing yielded no performance improvement Vocabulary expansion made asymptotic gains Inclusion of word stems permitted gains beyond apparent asymptote

Future Work Expand analysis to documents from all topics More sophisticated stemming Reworking of smoothing Multiclass SVMs