Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall 2009 1/14.

Slides:



Advertisements
Similar presentations
Speaker Associate Professor Ning-Han Liu. What’s MIR  Music information retrieval (MIR) is the interdisciplinary science of retrieving information from.
Advertisements

Note-level Music Transcription by Maximum Likelihood Sampling Zhiyao Duan ¹ & David Temperley ² 1.Department of Electrical and Computer Engineering 2.Eastman.
Franz de Leon, Kirk Martinez Web and Internet Science Group  School of Electronics and Computer Science  University of Southampton {fadl1d09,
Toward Automatic Music Audio Summary Generation from Signal Analysis Seminar „Communications Engineering“ 11. December 2007 Patricia Signé.
Enabling Access to Sound Archives through Integration, Enrichment and Retrieval Report about polyphonic music transcription.
Outline Introduction Music Information Retrieval Classification Process Steps Pitch Histograms Multiple Pitch Detection Algorithm Musical Genre Classification.
tracking beat tracking beat Mcgill university :: music technology :: mumt 611>>
ECE 8443 – Pattern Recognition Objectives: Course Introduction Typical Applications Resources: Syllabus Internet Books and Notes D.H.S: Chapter 1 Glossary.
1 Machine learning for note onset detection. Alexandre Lacoste & Douglas Eck.
Rhythmic Similarity Carmine Casciato MUMT 611 Thursday, March 13, 2005.
Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise T. Scott Brandes IEEE Transactions.
Pitch Recognition with Wavelets Final Presentation by Stephen Geiger.
HMM-BASED PATTERN DETECTION. Outline  Markov Process  Hidden Markov Models Elements Basic Problems Evaluation Optimization Training Implementation 2-D.
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
Multiple Pitch Tracking for Blind Source Separation Using a Single Microphone Joseph Tabrikian Dept. of Electrical and Computer Engineering Ben-Gurion.
Learning to Align Polyphonic Music. Slide 1 Learning to Align Polyphonic Music Shai Shalev-Shwartz Hebrew University, Jerusalem Joint work with Yoram.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Introduction to MIR Course Overview 1.
IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING MARCH 2010 Lan-Ying Yeh
/14 Automated Transcription of Polyphonic Piano Music A Brief Literature Review Catherine Lai MUMT-611 MIR February 17,
Harmonically Informed Multi-pitch Tracking Zhiyao Duan, Jinyu Han and Bryan Pardo EECS Dept., Northwestern Univ. Interactive Audio Lab,
1 AUTOMATIC TRANSCRIPTION OF PIANO MUSIC - SARA CORFINI LANGUAGE AND INTELLIGENCE U N I V E R S I T Y O F P I S A DEPARTMENT OF COMPUTER SCIENCE Automatic.
Polyphonic Queries A Review of Recent Research by Cory Mckay.
Instrument Recognition in Polyphonic Music Jana Eggink Supervisor: Guy J. Brown University of Sheffield
A Time Based Approach to Musical Pattern Discovery in Polyphonic Music Tamar Berman Graduate School of Library and Information Science University of Illinois.
9 th Conference on Telecommunications – Conftele 2013 Castelo Branco, Portugal, May 8-10, 2013 Sara Candeias 1 Dirce Celorico 1 Jorge Proença 1 Arlindo.
Theoretic and artistic research studying opportunities of symbiosis of Western and non-Western musical idiom Olmo Cornelis.
7-Speech Recognition Speech Recognition Concepts
Polyphonic Music Transcription Using A Dynamic Graphical Model Barry Rafkind E6820 Speech and Audio Signal Processing Wednesday, March 9th, 2005.
Fundamentals of Music Processing
MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.
TEMPLATE DESIGN © Zhiyao Duan 1,2, Lie Lu 1, and Changshui Zhang 2 1. Microsoft Research Asia (MSRA), Beijing, China.2.
Rhythmic Transcription of MIDI Signals Carmine Casciato MUMT 611 Thursday, February 10, 2005.
Structure Discovery of Pop Music Using HHMM E6820 Project Jessie Hsu 03/09/05.
Artificial Neural Networks Bruno Angeles McGill University – Schulich School of Music MUMT-621 Fall 2009.
Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise T. Scott Brandes IEEE Transactions.
Introduction to Onset Detection Functions HAO-HSUN LI 1/30.
Audio Tempo Extraction Presenter: Simon de Leon Date: February 9, 2006 Course: MUMT611.
Singer similarity / identification Francois Thibault MUMT 614B McGill University.
PhD Candidate: Tao Ma Advised by: Dr. Joseph Picone Institute for Signal and Information Processing (ISIP) Mississippi State University Linear Dynamic.
THE 2005 MUSIC INFORMATION RETRIEVAL EVALUATION EXCHANGE (MIREX 2005) RESULTS OVERVIEW Graduate School of Library and Information Science J. Stephen DownieKris.
Mestrado em Ciência de Computadores Mestrado Integrado em Engenharia de Redes e Sistemas Informáticos VC 15/16 – TP14 Pattern Recognition Miguel Tavares.
Arlindo Veiga Dirce Celorico Jorge Proença Sara Candeias Fernando Perdigão Prosodic and Phonetic Features for Speaking Styles Classification and Detection.
Automatic Transcription System of Kashino et al. MUMT 611 Doug Van Nort.
Query by Singing and Humming System
1 Hidden Markov Model: Overview and Applications in MIR MUMT 611, March 2005 Paul Kolesnik MUMT 611, March 2005 Paul Kolesnik.
Piano Music Transcription Wes “Crusher” Hatch MUMT-614 Thurs., Feb.13.
Statistical techniques for video analysis and searching chapter Anton Korotygin.
Classification of melody by composer using hidden Markov models Greg Eustace MUMT 614: Music Information Acquisition, Preservation, and Retrieval.
BASS TRACK SELECTION IN MIDI FILES AND MULTIMODAL IMPLICATIONS TO MELODY gPRAI Pattern Recognition and Artificial Intelligence Group Computer Music Laboratory.
Automatic Music Transcription: Employing Hidden Markov Models to assist with Multiple Fundamental Pitch Estimation ASHWIN D. D’CRUZ Acting Supervisor:
Stochastic Text Models for Music Categorization Carlos Pérez-Sancho, José M. Iñesta, David Rizo Pattern Recognition and Artificial Intelligence group Department.
Elements of Music. Melody Single line of notes heard in succession as unit Phrases Cadences—Points of arrival/rest Conjunct vs. disjunct motion Contour:
Christoph Prinz / Automatic Speech Recognition Research Progress Hits the Road.
Genre Classification of Music by Tonal Harmony Carlos Pérez-Sancho, David Rizo Departamento de Lenguajes y Sistemas Informáticos, Universidad de Alicante,
Audio Processing Mitch Parry. Resource! Sound Waves and Harmonic Motion.
1 Tempo Induction and Beat Tracking for Audio Signals MUMT 611, February 2005 Assignment 3 Paul Kolesnik.
Automatic Transcription of Polyphonic Music
Course Outline (6 Weeks) for Professor K.H Wong
Introduction to ISMIR/MIREX
David Sears MUMT November 2009
Rhythmic Transcription of MIDI Signals
Catherine Lai MUMT-611 MIR February 17, 2005
Introduction to Music Information Retrieval (MIR)
Term Project Presentation By: Keerthi C Nagaraj Dated: 30th April 2003
Presenter: Simon de Leon Date: March 2, 2006 Course: MUMT611
Voice Separation: A 15-minute Introduction
Measuring the Similarity of Rhythmic Patterns
Harmonically Informed Multi-pitch Tracking
Music Signal Processing
Presentation transcript:

Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall /14

Outline Polyphonic vs Monophonic The Human Method Issues Methods Evaluation of Transcription Methods 2/14

Polyphonic vs Monotonic Transcription Audio signal with musical content  symbolic format Monophonic: a single instrument playing individual notes Polyphonic transcription: A single instrument playing several concurrent notes An instrument playing individual notes with other instruments An instrument playing several concurrent notes with other instruments Several instruments playing concurrent notes 3/14

The Human Method 1.Initial sketch  Sections and key phrases 2.Chord scheme or bass line 3.Melody and counter–melodies Instrument playback Musical knowledge Beat tracking Style detection Instrument identification Hainsworth and Macleod (2004) 4/14

Issues with Polyphonic Transcription Restrictions: instrument, genre, maximum polyphony Transcription: pitch + timing + attack + release Overlapping harmonics  difficult frequency–based analysis Solution: better understanding and reproduction of the human method? 5/14

A Few Definitions Musical note: “a discrete note pitch with a specific onset and an offset time” Melodies: “consecutive note sequences with organized and recognizable shape” Chords: “combinations of simultaneously sounding notes” Ryynänen and Klapuri (2005) 6/14

Methods Bottom–up methods: no high–level analysis Blackboard systems: develop hypotheses at various levels Model–based algorithms: high– and low–level analysis + parameter extraction Common method: 1.Preprocessing (e.g., lowpass filtering) 2.Event extraction 3.Classification: Support Vector Machines, Hidden Markov Models, Neural Networks, Gaussian Mixture Models, etc. 4.Postprocessing (e.g., remove outliers) Other methods: probabilistic note-based, matrix factorization 7/14

Evaluation of Transcription Methods N C : number of correct events detected N D : total number of events detected N: actual number of events Precision: P = N C / N D Recall: R = N C / N F–Measure: F = 2RP/(R+P) = 2 N C / (N + N D ) 8/14

Evaluation of Transcription Methods Gillet and Richard (2005) 2 drummers Different musical sequences Celtic & Groove 5/4: Many ghost notes Table 4 of Gillet and Richard (2005) 9/14

Evaluation of Transcription Methods Ryynänen and Klapuri (2005): R ≈ P ≈ 40% Multiple instruments Multiple concurrent notes Hainsworth and Macleod (2005): P = 78.7% Bass guitar Polyphonic context Gillet and Richard (2005): R ≈ P ≈ 84% Drums 10/14 Ryynänen and Klapuri (2005):

Conclusion Analogy with speech recognition The problem has not yet been solved Still easier for humans to do – although complex Combine with source separation? 11/14

Thank you! 12/14

References Bruno, I., S. L. Monni, and P. Nesi Automatic music transcription supporting different instruments. In Proceedings of the Third International Conference on Web Delivering of Music. Leeds, UK. 37–44. Cemgil, A. T., B. Kappen, and D. Barber Generative model based polyphonic music transcription. In Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. New Paltz, NY, USA. 181–4. Gillet, O. K., and G. Richard Drum Track Transcription of Polyphonic Music Using Noise Subspace Projection. In Proceedings of the International Conference on Music Information Retrieval. London, UK. 92–99. Hainsworth, S. W Analysis of musical audio for polyphonic transcription. In First Year Ph.D. Report. Hainsworth, S. W., and M. D. Macleod Automatic bass line transcription from polyphonic music. In Proceedings of the International Computer Music Conference. Havana, Cuba. Hainsworth, S. W., and M. D. Macleod The automated music transcription problem. In Cambridge University Engineering Department. Klapuri, A Signal processing methods for the automatic transcription of music. In Ph.D. Dissertation. 13/14

References Lidy, T., A. Rauber, A. Pertusa, and J. M. Iñesta Improving genre classification by combination of audio and symbolic descriptors using a transcription system. In Proceedings of the International Conference on Music Information Retrieval. Vienna, Austria. 61–6. Marolt, M SONIC: Transcription of polyphonic piano music with neural networks. In Proceedings of the Workshop on Current Research Directions in Computer Music. Barcelona, Spain. 217–24. Nichols, E., and C. Raphael Automatic transcription of music audio through continuous parameter tracking. In Proceedings of the International Conference on Music Information Retrieval. Vienna, Austria. 387–92. Niedermayer, B Non-negative matrix division for the automatic transcription of polyphonic music. In Proceedings of the International Conference on Music Information Retrieval. Philadelphia, PA, USA. 544–9. Paulus, J Acoustic modelling of drum sounds with Hidden Markov Models for music transcription. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. Toulouse, France. Pertusa, A., and J. M. Iñesta Polyphonic music transcription through dynamic networks and spectral pattern identification. In IAPR International Workshop on Artificial Neural Networks in Pattern Recognition. 19–25. Ryynänen, M., and A. Klapuri Transcription of the singing melody in polyphonic music. In Proceedings of the International Conference on Music Information Retrieval. Victoria, BC, Canada. 14/14