Stochastic Text Models for Music Categorization Carlos Pérez-Sancho, José M. Iñesta, David Rizo Pattern Recognition and Artificial Intelligence group Department.

Slides:



Advertisements
Similar presentations
Mining customer ratings for product recommendation using the support vector machine and the latent class model William K. Cheung, James T. Kwok, Martin.
Advertisements

Florida International University COP 4770 Introduction of Weka.
Document Summarization using Conditional Random Fields Dou Shen, Jian-Tao Sun, Hua Li, Qiang Yang, Zheng Chen IJCAI 2007 Hao-Chin Chang Department of Computer.
Perceptron Learning Rule
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Combining Classification and Model Trees for Handling Ordinal Problems D. Anyfantis, M. Karagiannopoulos S. B. Kotsiantis, P. E. Pintelas Educational Software.
1 Semi-supervised learning for protein classification Brian R. King Chittibabu Guda, Ph.D. Department of Computer Science University at Albany, SUNY Gen*NY*sis.
CSCI 347 / CS 4206: Data Mining Module 07: Implementations Topic 03: Linear Models.
Oct 14, 2014 Lirong Xia Recommender systems acknowledgment: Li Zhang, UCSC.
Outline Introduction Music Information Retrieval Classification Process Steps Pitch Histograms Multiple Pitch Detection Algorithm Musical Genre Classification.
A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts 04 10, 2014 Hyun Geun Soo Bo Pang and Lillian Lee (2004)
Iowa State University Department of Computer Science Artificial Intelligence Research Laboratory Research supported in part by grants from the National.
Functional Music Interim Presentation Simon McNeilly Supervisors Dr. Lloyd Allison Dr. Jon McCormack.
Tree structured representation of music for polyphonic music information retrieval David Rizo Departament of Software and Computing Systems University.
A Technique for Advanced Dynamic Integration of Multiple Classifiers Alexey Tsymbal*, Seppo Puuronen**, Vagan Terziyan* *Department of Artificial Intelligence.
Language Model. Major role: Language Models help a speech recognizer figure out how likely a word sequence is, independent of the acoustics. A lot of.
Final Presentation Simon McNeilly Supervisors Dr. Lloyd Allison Jon McCormack Melody Generation by Phrase.
5/30/2006EE 148, Spring Visual Categorization with Bags of Keypoints Gabriella Csurka Christopher R. Dance Lixin Fan Jutta Willamowski Cedric Bray.
Scalable Text Mining with Sparse Generative Models
Learning Programs Danielle and Joseph Bennett (and Lorelei) 4 December 2007.
DESIGN OF A SELF- ORGANIZING LEARNING ARRAY SYSTEM Dr. Janusz Starzyk Tsun-Ho Liu Ohio University School of Electrical Engineering and Computer Science.
Rotation Forest: A New Classifier Ensemble Method 交通大學 電子所 蕭晴駿 Juan J. Rodríguez and Ludmila I. Kuncheva.
Text Classification Using Stochastic Keyword Generation Cong Li, Ji-Rong Wen and Hang Li Microsoft Research Asia August 22nd, 2003.
Selective Sampling on Probabilistic Labels Peng Peng, Raymond Chi-Wing Wong CSE, HKUST 1.
JSymbolic and ELVIS Cory McKay Marianopolis College Montreal, Canada.
Polyphonic Queries A Review of Recent Research by Cory Mckay.
TINONS1 Nonlinear SP and Pattern recognition
1 Music Classification Using Significant Repeating Patterns Chang-Rong Lin, Ning-Han Liu, Yi-Hung Wu, Arbee L.P. Chen, Proc. of 9th International Conference,
Data mining and machine learning A brief introduction.
Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Program Computational Intelligence, Learning, and Discovery Program.
Time Series Data Analysis - I Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What are Time Series? How to.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall /14.
Study of Protein Prediction Related Problems Ph.D. candidate Le-Yi WEI 1.
Using Machine Learning Techniques in Stylometry Ramyaa, Congzhou He, Dr. Khaled Rasheed.
Music Software Projects New York University Adjunct Instructor Scott Burton.
Neural Text Categorizer for Exclusive Text Categorization Journal of Information Processing Systems, Vol.4, No.2, June 2008 Taeho Jo* 報告者 : 林昱志.
Mr. Jackson AP Music Theory
Comparative Experiments on Sentiment Classification for Online Product Reviews Hang Cui, Vibhu Mittal, and Mayur Datar AAAI 2006.
Issues in Automatic Musical Genre Classification Cory McKay.
1 Adaptive Subjective Triggers for Opinionated Document Retrieval (WSDM 09’) Kazuhiro Seki, Kuniaki Uehara Date: 11/02/09 Speaker: Hsu, Yu-Wen Advisor:
Validation methods.
Module –I Codes: Weighted and non-weighted codes
1 Hidden Markov Model: Overview and Applications in MIR MUMT 611, March 2005 Paul Kolesnik MUMT 611, March 2005 Paul Kolesnik.
Alex Stabile. Research Questions: Could a computer learn to distinguish between different composers? Why does music by different composers even sound.
BASS TRACK SELECTION IN MIDI FILES AND MULTIMODAL IMPLICATIONS TO MELODY gPRAI Pattern Recognition and Artificial Intelligence Group Computer Music Laboratory.
Metamidi: a tool for automatic metadata extraction from MIDI files Tomás Pérez-García, Jose M. Iñesta, and David Rizo Computer Music Laboratory University.
Learning to analyse tonal music Pl á cido Rom á n Illescas David Rizo Jos é Manuel I ñ esta Pattern recognition and Artificial Intelligence group University.
Melody Recognition with Learned Edit Distances Amaury Habrard Laboratoire d’Informatique Fondamentale CNRS Université Aix-Marseille José Manuel Iñesta,
Melody Characterization by a Fuzzy Rule System Pedro J. Ponce de León, David Rizo, José M. Iñesta (DLSI, Univ. Alicante) Rafael Ramírez (MTG, Univ. Pompeu.
Proximity based one-class classification with Common N-Gram dissimilarity for authorship verification task Magdalena Jankowska, Vlado Kešelj and Evangelos.
Does one size really fit all? Evaluating classifiers in a Bag-of-Visual-Words classification Christian Hentschel, Harald Sack Hasso Plattner Institute.
A Document-Level Sentiment Analysis Approach Using Artificial Neural Network and Sentiment Lexicons Yan Zhu.
A shallow description framework for musical style recognition Pedro J. Ponce de León, Carlos Pérez-Sancho and José Manuel Iñesta Departamento de Lenguajes.
Genre Classification of Music by Tonal Harmony Carlos Pérez-Sancho, David Rizo Departamento de Lenguajes y Sistemas Informáticos, Universidad de Alicante,
Melody track identification in music symbolic files David Rizo Pedro J. Ponce de León Antonio Pertusa Carlos Pérez-Sancho José M. Iñesta The 19th International.
1 Tempo Induction and Beat Tracking for Audio Signals MUMT 611, February 2005 Assignment 3 Paul Kolesnik.
A Simple Approach for Author Profiling in MapReduce
Learning to Detect and Classify Malicious Executables in the Wild by J
Tomás Pérez-García, Carlos Pérez-Sancho, José M. Iñesta
Boosted Augmented Naive Bayes. Efficient discriminative learning of
A Melody Composer for both Tonal and Non-Tonal Languages
Module 1 Knowledge Organiser
Efficient Estimation of Word Representation in Vector Space
Automatic Language Identification – A Syntactic Approach
Texture Key Words.
Using Bayesian Network in the Construction of a Bi-level Multi-classifier. A Case Study Using Intensive Care Unit Patients Data B. Sierra, N. Serrano,
Automatic Handwriting Generation
Chord Recognition with Application in Melodic Similarity
Presentation transcript:

Stochastic Text Models for Music Categorization Carlos Pérez-Sancho, José M. Iñesta, David Rizo Pattern Recognition and Artificial Intelligence group Department of Software and Computing Systems University of Alicante, Spain

SSPR Outline ► Introduction ► Music encoding  Melody  Harmony ► Experiments  Plain classification  Classifier ensembles  Hierarchical classification ► Conclusions

SSPR Introduction ► Premise: music content can be used to model musical style ► We use language modeling techniques to classify symbolic digital scores ► For that, digital scores need to be encoded into sequences of symbols

SSPR Music encoding ► Two different sources of information Melody Harmony

SSPR Melody encoding ► Polyphonic sequences are reduced to monophonic using skyline ► Pitch intervals and duration ratios are computed for each pair of consecutive notes ► Numeric values are encoded into ASCII symbols (2,×½) → Bf (1,×1) → AZ (-1,×2) → aF (-2,×1) → bZ (2,×1) → BZ (-4,×1) → dZ (-2,×1) → bZ (4,×1) → DZ Bf AZ aF bZ BZ dZ bZ DZ ( interval, duration ratio ) input string

SSPR Harmony encoding ► Chords are encoded as degrees relative to the tonality for transposition invariance ► Only chord changes are encoded Key: E flat VIm V I input string

SSPR Experiments ► Dataset: music from 3 genres and 9 sub-genres (around 60 hours of music) ► Classification techniques  Naïve Bayes  Language modeling (n-grams), classifying by lowest perplexity

SSPR Experimental setup ► First step: plain classification  80% of the dataset used  10-fold cross validation ► Second step: hierarchical classification using classifier ensembles  Weights of the ensembles adjusted using previous results  Remaining 20% of dataset used for validation

SSPR Classification results ► Best classification rates in the 3-classes problem were obtained using harmonic information ► When classifying sub-genres, melody usually performs better ► Naïve Bayes performs better most of the times ► No significant differences for different context sizes in n-grams. Chords (harmony)Melody 2-grams 3-grams 4-grams N.B.

SSPR Confussion matrix ► Misclassifications occur more frequently within broad domains ► Try to prevent intra-domain errors by using a hierarchical classifier ×100 %

SSPR Hierarchical classification

SSPR Hierarchical classification ► Harmony (chord progressions) is used at the first level ► Melody is used at the second level ► Instead of using single classifiers, an ensemble of classifiers is used at each level to increase robustness

SSPR Classifier ensembles ► Decisions are made by weighted majority vote ► Two weighting schemes  Linear best-worst weighting vote  Quadratic best-words weighting vote # errors weight

SSPR Hierarchical classification results ► With the remaining 20% of the dataset Single classifiers 2-grams3-grams4-gramsN.Bayes 3 classes Using harmony classes Using melody Best singleLinearQuadratic 3 classes 88.4 (melody 2-grams) 90.1 Hierarchical classification 9 classes st level (3 classes): harmony 2-grams 2nd level (3x3 classes): academic – melody NB jazz – melody NB popular – melody 4-grams Classifier ensembles

SSPR Conclusions ► Harmony and melody are suitable features for music genre classification ► Harmony is better for classifying broad musical domains, while melody is better for distinguishing sub-genres ► Misclassifications occur more frequently within broad domains ► Hierarchical classification and classifier ensembles outperformed the best single classifiers

Stochastic Text Models for Music Categorization Carlos Pérez-Sancho, José M. Iñesta, David Rizo Pattern Recognition and Artificial Intelligence group Department of Software and Computing Systems University of Alicante, Spain