Metamidi: a tool for automatic metadata extraction from MIDI files Tomás Pérez-García, Jose M. Iñesta, and David Rizo Computer Music Laboratory University.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

KARAOKE FORMATION Pratik Bhanawat (10bec113) Gunjan Gupta Gunjan Gupta (10bec112)
LYRIC-BASED ARTIST NETWORK Derek Gossi CS 765 Fall 2014.
Outline Introduction Music Information Retrieval Classification Process Steps Pitch Histograms Multiple Pitch Detection Algorithm Musical Genre Classification.
Chapter 4: Linear Models for Classification
Probabilistic Generative Models Rong Jin. Probabilistic Generative Model Classify instance x into one of K classes Class prior Density function for class.
Assuming normally distributed data! Naïve Bayes Classifier.
Functional Music Interim Presentation Simon McNeilly Supervisors Dr. Lloyd Allison Dr. Jon McCormack.
Tree structured representation of music for polyphonic music information retrieval David Rizo Departament of Software and Computing Systems University.
People use words to describe music
Object Class Recognition Using Discriminative Local Features Gyuri Dorko and Cordelia Schmid.
ISE 582 Making Music (with MIDI) November 6, 2003 Arpi Mardirossian.
Honours project: Interim presentation Amanda Crawford – Comp. Sci. Honours
Distributed Representations of Sentences and Documents
Naive Bayes model Comp221 tutorial 4 (assignment 1) TA: Zhang Kai.
Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)
Improving Musical Genre Classification with RBF Networks Douglas Turnbull Department of Computer Science and Engineering University of California, San.
Postgraduate Department of Electrical Engineering PPGEE UFPR - Federal University of Paraná Luis Gustavo Weigert Machado
Image Classification and its Applications
Title Extraction from Bodies of HTML Documents and its Application to Web Page Retrieval Microsoft Research Asia Yunhua Hu, Guomao Xin, Ruihua Song, Guoping.
1 Music Classification Using Significant Repeating Patterns Chang-Rong Lin, Ning-Han Liu, Yi-Hung Wu, Arbee L.P. Chen, Proc. of 9th International Conference,
MACHINE LEARNING TECHNIQUES FOR MUSIC PREDICTION S. Grant Lowe Advisor: Prof. Nick Webb.
©2008 Srikanth Kallurkar, Quantum Leap Innovations, Inc. All rights reserved. Apollo – Automated Content Management System Srikanth Kallurkar Quantum Leap.
JSymbolic Cedar Wingate MUMT 621 Professor Ichiro Fujinaga 22 October 2009.
The identification of interesting web sites Presented by Xiaoshu Cai.
Review of the web page classification approaches and applications Luu-Ngoc Do Quang-Nhat Vo.
Naive Bayes Classifier
Aspects of Music Information Retrieval Will Meurer School of Information University of Texas.
Partially Supervised Classification of Text Documents by Bing Liu, Philip Yu, and Xiaoli Li Presented by: Rick Knowles 7 April 2005.
Learning from Multi-topic Web Documents for Contextual Advertisement KDD 2008.
MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.
Extracting Melody Lines from Complex Audio Jana Eggink Supervisor: Guy J. Brown University of Sheffield {j.eggink
Detection, Classification and Tracking in a Distributed Wireless Sensor Network Presenter: Hui Cao.
Music Information Retrieval Information Universe Seongmin Lim Dept. of Industrial Engineering Seoul National University.
Automatic music classification and the importance of instrument identification Cory McKay and Ichiro Fujinaga Music Technology Area Faculty of Music McGill.
Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall /14.
Limitations of Cotemporary Classification Algorithms Major limitations of classification algorithms like Adaboost, SVMs, or Naïve Bayes include, Requirement.
Singer similarity / identification Francois Thibault MUMT 614B McGill University.
HELUS Middle School Elective Mr. Nosik
Quadratic Classifiers (QC) J.-S. Roger Jang ( 張智星 ) CS Dept., National Taiwan Univ Scientific Computing.
Duraid Y. Mohammed Philip J. Duncan Francis F. Li. School of Computing Science and Engineering, University of Salford UK Audio Content Analysis in The.
Active learning Haidong Shi, Nanyi Zeng Nov,12,2008.
Issues in Automatic Musical Genre Classification Cory McKay.
From “Music” to MIDI Music 253/CS 275A Stanford University.
Improving Music Genre Classification Using Collaborative Tagging Data Ling Chen, Phillip Wright *, Wolfgang Nejdl Leibniz University Hannover * Georgia.
1 Hidden Markov Model: Overview and Applications in MIR MUMT 611, March 2005 Paul Kolesnik MUMT 611, March 2005 Paul Kolesnik.
 Effective Multi-Label Active Learning for Text Classification Bishan yang, Juan-Tao Sun, Tengjiao Wang, Zheng Chen KDD’ 09 Supervisor: Koh Jia-Ling Presenter:
Classification of melody by composer using hidden Markov models Greg Eustace MUMT 614: Music Information Acquisition, Preservation, and Retrieval.
BASS TRACK SELECTION IN MIDI FILES AND MULTIMODAL IMPLICATIONS TO MELODY gPRAI Pattern Recognition and Artificial Intelligence Group Computer Music Laboratory.
Tree structured and combined methods for comparing metered polyphonic music Kjell Lëmstrom David Rizo Valero José Manuel Iñesta CMMR’08 May 21, 2008.
Melody Recognition with Learned Edit Distances Amaury Habrard Laboratoire d’Informatique Fondamentale CNRS Université Aix-Marseille José Manuel Iñesta,
Melody Characterization by a Fuzzy Rule System Pedro J. Ponce de León, David Rizo, José M. Iñesta (DLSI, Univ. Alicante) Rafael Ramírez (MTG, Univ. Pompeu.
Stochastic Text Models for Music Categorization Carlos Pérez-Sancho, José M. Iñesta, David Rizo Pattern Recognition and Artificial Intelligence group Department.
PREDICTING SONG HOTNESS
A content-based System for Music Recommendation and Visualization of User Preference Working on Semantic Notions Dmitry Bogdanov, Martin Haro, Ferdinand.
A Tutorial on Speaker Verification First A. Author, Second B. Author, and Third C. Author.
What is automatic music transcription? Transforming an audio signal of a music performance in a symbolic representation (MIDI or score). Aim: This prototype.
A shallow description framework for musical style recognition Pedro J. Ponce de León, Carlos Pérez-Sancho and José Manuel Iñesta Departamento de Lenguajes.
Genre Classification of Music by Tonal Harmony Carlos Pérez-Sancho, David Rizo Departamento de Lenguajes y Sistemas Informáticos, Universidad de Alicante,
Melody track identification in music symbolic files David Rizo Pedro J. Ponce de León Antonio Pertusa Carlos Pérez-Sancho José M. Iñesta The 19th International.
Unsupervised Learning Part 2. Topics How to determine the K in K-means? Hierarchical clustering Soft clustering with Gaussian mixture models Expectation-Maximization.
The Greek Audio Dataset
Tomás Pérez-García, Carlos Pérez-Sancho, José M. Iñesta
Julián ALARTE DAVID INSA JOSEP SILVA
Introduction to Music Information Retrieval (MIR)
Aspects of Music Information Retrieval
Presented by Steven Lewis
AUDIO SURVEILLANCE SYSTEMS: SUSPICIOUS SOUND RECOGNITION
Ping LUO*, Fen LIN^, Yuhong XIONG*, Yong ZHAO*, Zhongzhi SHI^
Training Presentation For
Presentation transcript:

metamidi: a tool for automatic metadata extraction from MIDI files Tomás Pérez-García, Jose M. Iñesta, and David Rizo Computer Music Laboratory University of Alicante

Introduction Metadata permit organization, indexing, and retrieval of music information. Metadata are an important subject of research. metamidi is introduced as a tool for metadata extraction from MIDI files: – Global technical – Track content description and metaevents

MIDI Metadata File global metadata: – filename, text metaevents, size, format, number of tracks, resolution, tempo, meter, key, timbral, … Track metatada: – Text metaevents: track name, instrument name, lyrics, markers, … – Content description: duration, occupation, polyphony, pitch range, …

The metamidi tool Downloadable from our group web site – Source code, DTD, XSD Developed in ANSI C version Open source Tested under Linux.

The metamidi tool metamidi –{r|x|l} file [-o fileoutput] Metadata output formats: -r: register -l: line -x: XML -r Midi features name: /Users/inesta/metamidi/straussmyrthenbluten.mid text metaevent: $ size: format: 1 num tracks: 5 resolution: 1024 tempo: tempo changes: 70 meter: 6/8(0),1/4(15360),6/8(24576),1/4(113664),3/4(114688) meter changes: 5 key: FM(0),CM(225280),FM(335872),BbM(385024),FM(495616),BbM(606208) key changes: 6 instruments: 1 percussion: -1 duration: has sysex: Features of track text metaevent: $ channel: -1 duration: duration rate: 1.00 occupation: 0 occupation rate: 0.00 polyphony duration rate: 0.00 max polyphony: 0 avg polyphony: 0.00 low pitch: -1 high pitch: -1 modulations: 0 aftertouches: 0 pitch bends: 0 program changes: $

The metamidi tool metamidi –{r|x|l} file [-o fileoutput] Metadata output formats: -r: register -l: line -x: XML -l '/Users/inesta/metamidi/straussmyrthenbluten.mid';'$';48323;1;5;1024;176.00;70;6/8(0), 1/4(15360),6/8(24576),1/4(113664),3/4(114688);5;FM(0),CM(225280),FM(335872),BbM( ),FM(495616),BbM(606208);6;1;-1; ;0;|0;'$';-1; ;1.00;0;0.00;0.00;0;0.00;- 1;-1;0;0;0;$;|1;'Acoustic Grand Piano';1; ;1.00;606315;0.40;0.47;4;1.69;33;69;0;0;0;1(0);|2;'ACP 4';4; ;1.00; ;0.83;0.71;4;1.85;43;93;0;0;0;1(0);|3;'ACP 3';3; ;1.00;127284;0.08;0.27;9;1.71;48;84;0;0;0;1(0);|4;'B.S. Lengton';2; ;1.00;760688;0.50;0.96;4;2.87;29;70;0;0;0;1(0);

The metamidi tool metamidi –{r|x|l} file [-o fileoutput] Metadata output formats: -r: register -l: line -x: XML -x

Applications Metadata can help in MIDI file indexing, organizing or classifying tasks. In this paper we have presented two examples of the use of the metamidi output in MIR problems: – Melody track selection using track content features – Genre classification using timbral metadata

Melody track selection Metadata are used to compute the probability of a track to contain the melody of a MIDI file. What metadata should be used? – Amount of music information: occupation rate – Melody are usually monophonic: polyphony duration rate, max polyphony, average polypohony – Pitch ranges: lowest pitch, highest pitch

Melody track selection Training phase: assuming a Gaussian distribution for the values of a descriptor d, P(d|M) and P(d|  M) are obtained from the tracks labeled as melodies and not melodies. Test phase: Bayes theorem is used to compute the probability of a track to be a melody:

Melody track selection The decision is taken using a maximum likelihood criterion for all the tracks ( i ) in the file with probability over a threshold,  : CorpusDescr.  S%PRF Clas200High pitch ClasAvg.Poly Jazz200Max.Poly & Low p JazzMax.Poly & Low p Kar200Avg.Poly KarAvg. Poly

Timbre-based genre classification Use timbral information provided by metamidi to classify MIDI files into musical genres also under a probabilistic approach. File representation: x = (1,0,0,0,0,0,0,0,0,1,0,0,0,0,1,0,0,0,0 … 0,0,1,0,0) (128 GM + 3 percussions) Grand pianoElectric bassElectric guitarDrum kit

Timbre-based genre classification The probability for instrument t i in class c j is computed from the training set A MIDI file is assigned to the class c j that maximizes the a posteriori probability # files of c j with t i # files of c j

Timbre-based genre classification Data set: – Hierarchical classes: 3  9 (235) Classical (338) Jazz (283) Popular Baroque (56) Classic (50) Romantic (129) Pre-bop (178) Bop (94) Bossanova (66) Celtic (99) Blues (84) Pop (100)

Timbre-based genre classification AcademicJazzPopular Academic22861 Jazz Popular class experiment – (10 fold crossvalidation) – 93  2 % (39.5 % baseline) (+53.5 %)

Timbre-based genre classification barclasRomPreBopBosCelBlupop Baroque Classic Romantic Prebop Bop bossa Celtic Blues pop class experiment – (10 fold crossvalidation) – 68  5 % (20.8 % baseline) (+47.2 %)

Conclusions metamidi has been designed as a tool for metadata extraction from standard MIDI files. The good performances of the presented examples show the power of metadata automatc extraction tools for helping in the solution of MIR problems.

metamidi: a tool for automatic metadata extraction from MIDI files Tomás Pérez-García, Jose M. Iñesta, and David Rizo Computer Music Laboratory University of Alicante