1 Music Classification Using SVM Ming-jen Wang Chia-Jiu Wang.

Slides:



Advertisements
Similar presentations
Introduction to Support Vector Machines (SVM)
Advertisements

ECG Signal processing (2)
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
ONLINE ARABIC HANDWRITING RECOGNITION By George Kour Supervised by Dr. Raid Saabne.
An Introduction of Support Vector Machine
Classification / Regression Support Vector Machines
Masters Presentation at Griffith University Master of Computer and Information Engineering Magnus Nilsson
An Introduction of Support Vector Machine
Support Vector Machines
SVM—Support Vector Machines
Support vector machine
Machine learning continued Image source:
LOGO Classification IV Lecturer: Dr. Bo Yuan
Introduction The aim the project is to analyse non real time EEG (Electroencephalogram) signal using different mathematical models in Matlab to predict.
Face Recognition & Biometric Systems Support Vector Machines (part 2)
Support Vector Machines (and Kernel Methods in general)
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Fuzzy Support Vector Machines (FSVMs) Weijia Wang, Huanren Zhang, Vijendra Purohit, Aditi Gupta.
TRADING OFF PREDICTION ACCURACY AND POWER CONSUMPTION FOR CONTEXT- AWARE WEARABLE COMPUTING Presented By: Jeff Khoshgozaran.
Artificial Intelligence Statistical learning methods Chapter 20, AIMA (only ANNs & SVMs)
Speaker Adaptation for Vowel Classification
CS 4700: Foundations of Artificial Intelligence
A Kernel-based Support Vector Machine by Peter Axelberg and Johan Löfhede.
Classification of Music According to Genres Using Neural Networks, Genetic Algorithms and Fuzzy Systems.
Lecture outline Support vector machines. Support Vector Machines Find a linear hyperplane (decision boundary) that will separate the data.
Lecture 10: Support Vector Machines
An Introduction to Support Vector Machines Martin Law.
Audio Retrieval David Kauchak cs458 Fall Administrative Assignment 4 Two parts Midterm Average:52.8 Median:52 High:57 In-class “quiz”: 11/13.
Support Vector Machine & Image Classification Applications
Implementing a Speech Recognition System on a GPU using CUDA
Support Vector Machines Mei-Chen Yeh 04/20/2010. The Classification Problem Label instances, usually represented by feature vectors, into one of the predefined.
GA-Based Feature Selection and Parameter Optimization for Support Vector Machine Cheng-Lung Huang, Chieh-Jen Wang Expert Systems with Applications, Volume.
Support Vector Machines Reading: Ben-Hur and Weston, “A User’s Guide to Support Vector Machines” (linked from class web page)
CS Statistical Machine learning Lecture 18 Yuan (Alan) Qi Purdue CS Oct
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
Classifiers Given a feature representation for images, how do we learn a model for distinguishing features from different classes? Zebra Non-zebra Decision.
An Introduction to Support Vector Machines (M. Law)
Hidden Markov Classifiers for Music Genres. Igor Karpov Rice University Comp 540 Term Project Fall 2002.
Look who’s talking? Project 3.1 Yannick Thimister Han van Venrooij Bob Verlinden Project DKE Maastricht University.
CISC667, F05, Lec22, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Support Vector Machines I.
컴퓨터 과학부 김명재.  Introduction  Data Preprocessing  Model Selection  Experiments.
An Introduction to Support Vector Machine (SVM)
CSSE463: Image Recognition Day 14 Lab due Weds, 3:25. Lab due Weds, 3:25. My solutions assume that you don't threshold the shapes.ppt image. My solutions.
CSSE463: Image Recognition Day 15 Announcements: Announcements: Lab 5 posted, due Weds, Jan 13. Lab 5 posted, due Weds, Jan 13. Sunset detector posted,
Musical Genre Categorization Using Support Vector Machines Shu Wang.
A Kernel Approach for Learning From Almost Orthogonal Pattern * CIS 525 Class Presentation Professor: Slobodan Vucetic Presenter: Yilian Qin * B. Scholkopf.
Support Vector Machines Reading: Ben-Hur and Weston, “A User’s Guide to Support Vector Machines” (linked from class web page)
A Parallel Mixture of SVMs for Very Large Scale Problems Ronan Collobert Samy Bengio Yoshua Bengio Prepared : S.Y.C. Neural Information Processing Systems,
SVMs in a Nutshell.
Support Vector Machine: An Introduction. (C) by Yu Hen Hu 2 Linear Hyper-plane Classifier For x in the side of o : w T x + b  0; d = +1; For.
Classification of Breast Cancer Cells Using Artificial Neural Networks and Support Vector Machines Emmanuel Contreras Guzman.
High resolution product by SVM. L’Aquila experience and prospects for the validation site R. Anniballe DIET- Sapienza University of Rome.
Non-separable SVM's, and non-linear classification using kernels Jakob Verbeek December 16, 2011 Course website:
Automatic Classification of Audio Data by Carlos H. L. Costa, Jaime D. Valle, Ro L. Koerich IEEE International Conference on Systems, Man, and Cybernetics.
CSSE463: Image Recognition Day 14
PREDICT 422: Practical Machine Learning
Recognition of bumblebee species by their buzzing sound
Schizophrenia Classification Using
Support Vector Machines (SVM)
An Introduction to Support Vector Machines
An Introduction to Support Vector Machines
Pattern Recognition CS479/679 Pattern Recognition Dr. George Bebis
Statistical Learning Dong Liu Dept. EEIS, USTC.
Hyperparameters, bias-variance tradeoff, validation
CSSE463: Image Recognition Day 14
Support Vector Machine _ 2 (SVM)
CSSE463: Image Recognition Day 15
CSE 802. Prepared by Martin Law
Linear Discrimination
Presentation transcript:

1 Music Classification Using SVM Ming-jen Wang Chia-Jiu Wang

2 Outline Introduction Support Vector Machine (SVM) Implementation with SVM Results Comparison with other algorithms Conclusion

3 Music Genre Classification Human can identify music genre easily. (play clips) How could machines perform this task? What would make it easier for machines? What are the differences between the genres?

4 Motivation Apple’s website iTunes MP3.com Napster.com All boast millions of songs and over 15 genres

5 Support Vector Machine Many decision boundaries between two classes of data How to find the optimal boundary? Class 2 Class 1

6 Support Vectors Linear SVM Class 2 Class 1 m w T x i +b = -1 w T x i +b = 0 w T x i +b = 1 x-x- x+x+

7 Optimal Boundary Optimal boundary should be as far away from data points in both classes Maximize margin or minimize w Class 2 Class 1 m w T x i +b = -1 w T x i +b = 0 w T x i +b = 1 x-x- x+x+

8 Constraint Problem Lagrange Multiplier Minimize the function with respect to w and b => => After solving the Quadratic Programming problem, many α are zero. X with non-zero α are called support vectors.

9 Kernel Functions Kernel functions transforms features to a linearly separable space K(x)

10 Common Kernel Functions Polynomial Radial Basis Function Sigmoid

11 Implementation Quadratic Programming MySVM by Stefan Rueping Matlab scripts

12 Example Training data points

13 Example Test data points

14 # svm example set dimension 3 number 20 b format xy

15 Classifying Music Genres Many features to choose from Using FFT spectrum Classical, Jazz and Rock Each genre has its dynamic range

16 Why FFT? Other features such as MFCC (Mel- Frequency Ceptral Coefficient), LPC (Linear Predictive Coding) have been used in other papers. Each sample is formed with only 22.7 ms worth of data. Small number of catagories.

17 Song Collection Total of 18 songs (6 songs per genre) About samples overall Over used for training samples were used for testing

18 Song Collection Artists include Nora Jones, Zoltan Tokos and Budapest Strings, Blink 182, Goo Goo Dolls, Green Day and MatchBox 20 Most of the files are recorded at 128kbps and sampled at 44.1kHz.

19 Feature Extraction Process flow MP3WAVConversion Utility FFT Partition the file into n-second clips Input Vectors

20 Feature Extraction Convert MP3 to Windows wav format Preprocess with Matlab scripts Partition into 1024 point clips Perform 1024-point FFT

21 Evaluation Samples are divided into two pools, training pool and testing pool. Samples in training pool are used to train all 3 SVM. Samples in testing pool are used to evaluate the accuracy.

22 1v1 and 1v2 SVM Instead of training with one class vs. another, train the SVM with one class vs. two classes. [ie: Classical (1) vs Jazz (-1), Classical (1) vs Jazz and Rock (-1)] 1v1 produces better result than 1v2.

23 Certain Combination Produces Better Result ClassicalJazzRock SVMCvJRvCCvJJvRRvCJvR Accuracy (%)

24 Classical Spectrum

25 Classical in Time Domain

26 Jazz Spectrum

27 Jazz in Time Domain

28 Rock Spectrum

29 Rock in Time Domain

30 Sample-Set Method 1 sample-set = 100 individual samples Average the scores for each class Take the class of maximum as the classifier

31 Decision Strategy Chart C CvJCvRJvCJvRRvCRvJ CvJ SVM RvC SVM JvR SVM Sample 90%85%10%45%15%55% Avg Max 87.5% 27.5% 35%

32 Another example R CvJCvRJvCJvRRvCRvJ CvJ SVM RvC SVM JvR SVM Sample 58%15%42%25%85%75% Avg Max 36.5% 33.5% 80%

33 Spreadsheet based on the chart SetclassicalJazzRockclassicaljazzrock CvJCvRJvCJvRRvCRvJaverage max C C C C C C C C C C C C C

34 Individual Result 600 SamplesClassicalJazzRock Classical Jazz41590 Roc00190 Accuracy98%79.5%95%

35 Sample Set Result 300 Sample-setClassicalJazzRock Classical9900 Jazz1966 Rock0494 Accuracy99%96%94%

36 Other Algorithms Neural Network Gaussian Classifier Hidden Markov Model

37 Gaussian Classifier [7] Feature vector used is a conglomeration of different types of features. (mean-centroid, mean-rolloff, mean-flux, mean-zero-crossing, std-centroid, std-rolloff, std-flux, std-zero- crossing and LowEnergy) 6 genres, Classical, Country, Disco, Hiphop, Jazz, Rock. Each classifier is trained by 50 samples each 30 seconds in length.

38 Neural Network Approach [8] Feature vector includes LPC taps, DFT amplitude, log DFT amplitude, IDFT of log DFT amplitude, MFC and Volume. 4 genres: Classical, Rock, Country and Soul/R&B. 8 CDs, 2 of each feature vectors. Half is used for training, half for testing.

39 Comparison with other algorithms AccuracyClassicalJazzRock Gaussian Classifier [7]86%38%49% Neural Network [8]97%n/a93% SVM (individual sample)98%79.5%95% SVM (sample-set)99%96%94%

40 Summary Sample-Set method produces better result than individual samples. SVM results are comparable to Neural Network results Only used one feature

41 Other Applications of SVM Optical Character Recognition Hand-Writing Recognition Image Classification Voice Recognition Protein Structure Prediction

42 Conclusion Viable approach for music classification More distinct features Larger scale evaluation Possible embedded application

43 Questions ???