Rhythmic Similarity Carmine Casciato MUMT 611 Thursday, March 13, 2005.

Slides:



Advertisements
Similar presentations
Matthias Gruhne, Page 1 Fraunhofer Institut Integrierte Schaltungen Robust Audio Identification for Commercial Applications Matthias.
Advertisements

Robust Speech recognition V. Barreaud LORIA. Mismatch Between Training and Testing n mismatch influences scores n causes of mismatch u Speech Variation.
A Musical Data Mining Primer CS235 – Spring ’03 Dan Berger
Local Discriminative Distance Metrics and Their Real World Applications Local Discriminative Distance Metrics and Their Real World Applications Yang Mu,
Creating Music Videos using Automated Media Analysis Authored by Jonathan Foote, Matthew Cooper, and Andreas Girgensohn Presented by Sukhyung Shin, Ninad.
Computational Rhythm and Beat Analysis Nick Berkner.
Franz de Leon, Kirk Martinez Web and Internet Science Group  School of Electronics and Computer Science  University of Southampton {fadl1d09,
Toward Automatic Music Audio Summary Generation from Signal Analysis Seminar „Communications Engineering“ 11. December 2007 Patricia Signé.
Content-based retrieval of audio Francois Thibault MUMT 614B McGill University.
tracking beat tracking beat Mcgill university :: music technology :: mumt 611>>
LAM: Musical Audio Similarity Michael Casey Centre for Cognition, Computation and Culture Department of Computing Goldsmiths College, University of London.
Energy Characterization and Optimization of Embedded Data Mining Algorithms: A Case Study of the DTW-kNN Framework Huazhong University of Science & Technology,
Content-Based Classification, Search & Retrieval of Audio Erling Wold, Thom Blum, Douglas Keislar, James Wheaton Presented By: Adelle C. Knight.
“Random Projections on Smooth Manifolds” -A short summary
Toward Semantic Indexing and Retrieval Using Hierarchical Audio Models Wei-Ta Chu, Wen-Huang Cheng, Jane Yung-Jen Hsu and Ja-LingWu Multimedia Systems,
Effective Image Database Search via Dimensionality Reduction Anders Bjorholm Dahl and Henrik Aanæs IEEE Computer Society Conference on Computer Vision.
Principal Component Analysis
An Exploration of timbre: its perception, analysis and representation Dr. Deirdre Bolger CNRS-LMS,Paris Invited lecture, Institut für Musikwissenschaft,
CS292 Computational Vision and Language Visual Features - Colour and Texture.
Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.
Hand Signals Recognition from Video Using 3D Motion Capture Archive Tai-Peng Tian Stan Sclaroff Computer Science Department B OSTON U NIVERSITY I. Introduction.
IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING MARCH 2010 Lan-Ying Yeh
EE513 Audio Signals and Systems Statistical Pattern Classification Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Sound Applications Advanced Multimedia Tamara Berg.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
AdvisorStudent Dr. Jia Li Shaojun Liu Dept. of Computer Science and Engineering, Oakland University 3D Shape Classification Using Conformal Mapping In.
Audio classification Discriminating speech, music and environmental audio Rajas A. Sambhare ECE 539.
Educational Software using Audio to Score Alignment Antoine Gomas supervised by Dr. Tim Collins & Pr. Corinne Mailhes 7 th of September, 2007.
Gwangju Institute of Science and Technology Intelligent Design and Graphics Laboratory Multi-scale tensor voting for feature extraction from unstructured.
Centre for Computational Creativity Semantic Audio Studio Tools and Techniques using MPEG-7 Dr. Michael Casey Centre for Computational Creativity Department.
BACKGROUND LEARNING AND LETTER DETECTION USING TEXTURE WITH PRINCIPAL COMPONENT ANALYSIS (PCA) CIS 601 PROJECT SUMIT BASU FALL 2004.
Student: Mike Jiang Advisor: Dr. Ras, Zbigniew W. Music Information Retrieval.
Music Information Retrieval -or- how to search for (and maybe find) music and do away with incipits Michael Fingerhut Multimedia Library and Engineering.
Jacob Zurasky ECE5526 – Spring 2011
Fundamentals of Music Processing
MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.
Rhythmic Transcription of MIDI Signals Carmine Casciato MUMT 611 Thursday, February 10, 2005.
Understanding The Semantics of Media Chapter 8 Camilo A. Celis.
Data Extraction using Image Similarity CIS 601 Image Processing Ajay Kumar Yadav.
Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall /14.
A Toolkit for Remote Sensing Enviroinformatics Clustering Fazlul Shahriar, George Bonev Advisors: Michael Grossberg, Irina Gladkova, Srikanth Gottipati.
Dengsheng Zhang and Melissa Chen Yi Lim
Audio Tempo Extraction Presenter: Simon de Leon Date: February 9, 2006 Course: MUMT611.
Singer similarity / identification Francois Thibault MUMT 614B McGill University.
Event retrieval in large video collections with circulant temporal encoding CVPR 2013 Oral.
Duraid Y. Mohammed Philip J. Duncan Francis F. Li. School of Computing Science and Engineering, University of Salford UK Audio Content Analysis in The.
An Approximate Nearest Neighbor Retrieval Scheme for Computationally Intensive Distance Measures Pratyush Bhatt MS by Research(CVIT)
Melodic Similarity Presenter: Greg Eustace. Overview Defining melody Introduction to melodic similarity and its applications Choosing the level of representation.
QBSH Corpus The QBSH corpus provided by Roger Jang [1] consists of recordings of children’s songs from students taking the course “Audio Signal Processing.
MSc Project Musical Instrument Identification System MIIS Xiang LI ee05m216 Supervisor: Mark Plumbley.
Colour and Texture. Extract 3-D information Using Vision Extract 3-D information for performing certain tasks such as manipulation, navigation, and recognition.
Chapter 15: Classification of Time- Embedded EEG Using Short-Time Principal Component Analysis by Nguyen Duc Thang 5/2009.
CSCI 631 – Foundations of Computer Vision March 15, 2016 Ashwini Imran Image Stitching.
Harmonic Models CS 275B/Music 254.
1/16 Dynamic Programming Carmine Casciato MUMT 611 Thursday March 31 st 2005.
Audio Processing Mitch Parry. Resource! Sound Waves and Harmonic Motion.
Audio Processing Mitch Parry. Similar to Image Processing? For images a pixel is the smallest unit The color is a distribution of the spectrum of visible.
Rhythmic Transcription of MIDI Signals
Audio Segmentation, Classification, and Retrieval
Calypso Goes to School Additional resource required: Audio Sound file ‘Calypso Goes to School’
Carmine Casciato MUMT 611 Thursday, March 13, 2005
Artist Identification Based on Song Analysis
Recognition with Expression Variations
Supervised Time Series Pattern Discovery through Local Importance
Carmine Casciato MUMT 611 Thursday, March 13, 2005
EE513 Audio Signals and Systems
Presenter: Simon de Leon Date: March 2, 2006 Course: MUMT611
Presentation on Timbre Similarity
NON-NEGATIVE COMPONENT PARTS OF SOUND FOR CLASSIFICATION Yong-Choon Cho, Seungjin Choi, Sung-Yang Bang Wen-Yi Chu Department of Computer Science &
Measuring the Similarity of Rhythmic Patterns
Presentation transcript:

Rhythmic Similarity Carmine Casciato MUMT 611 Thursday, March 13, 2005

Overview  Music Tech research into rhythmic similarity  Representations  Problem of segmentation

Usage

 Computer accompaniment systems  Ethno-musicological research  Database management  Queries by humming  Genre classification  Speech processing

Paulus and Klapuri (2002)  Detects rhythmic similarity between musical samples containing arbitrary drum/percussive sounds  Removes sinusoidal components from sound to detect noisy percussive sounds, from this produces beat and measure estimations  Extracts loudness and brightness (mean square energy and spectral centroid, respectively) per measure  Dynamic Time Warping (DTW) finds optimal path between feature vectors  No training of system required

Foote, Cooper, and Nam (2002)  Derivation of ‘beat spectrum’  Feature used is power of FFT bins embedded into similarity matrix  Euclidean distance is used as similarity metric, tested against cosine angles between feature vectors, and FFT co- efficients

Ellis and Arroyo (2004)  Principal Component Analysis, a dimension reduction tool  Requires robust segmentation  Poor classification results

EigenRhythms (Ellis and Arroyo 2004)  Useful for generating variations?

Toussaint (2002)  Geometric representations of rhythms  Require extensive segmentation  Tests various distance metrics  Minimum Spanning Trees offer a framework for rhythmic development analysis

Toussaint Representations (2002)

Music Technology References  Ellis, D., and J. Arroyo Eigenrhythms: Drum pattern basis sets for classification and generation. In Proceedings of the International Conference on Music Information Retrieval.  Foote, J., M. Cooper, and U. Nam Audio retrieval by rhythmic similarity. In Proceedings of the International Conference on Music Information Retrieval.  Paulus, J., and A. Klapuri Measuring the similarity of rhythmic patterns. In Proceedings of the International Conference on Music Information Retrieval.  Toussaint, G A mathematical analysis of African, Brazilian, and Cuban clave rhythms. In Proceedings of BRIDGES: Mathematical Connections in Art, Music, and Science.

Other Disciplines  Gabrielsson, A Similarity ratings and dimension analyses of auditory rhythm patterns. Scandinavian Journal of Psychology 14: 138  60.  Lerdahl F., and R. Jackendoff A generative theory of tonal music. Cambridge: The MIT Press.  Powel, D., and P. Essens Perception of temporal patterns. Music Perception 2(4): 411  40.