Introduction to algorithmic models of music cognition David Meredith Aalborg University.

Slides:



Advertisements
Similar presentations
Introduction to the Elements of Music
Advertisements

Chapter 2: Rhythm and Pitch
Music Introduction to Humanities. Music chapter 9 Music is one of the most powerful of the arts partly because sounds – more than any other sensory stimulus.
Parts and Layers Electronic Counterpoint is built up in layers
Computational Rhythm and Beat Analysis Nick Berkner.
Chapter 2 – Scales, Tonality, Key, Modes
Statistics Versus Parameters
Music Processing Algorithms David Meredith Department of Media Technology Aalborg University.
Point-set algorithms for pattern discovery and pattern matching in music David Meredith Goldsmiths College University of London.
Pitch-spelling algorithms David Meredith Aalborg University.
MUSIC 1000A Lecture 2 Review and announcements Attend more than one concert before the concert report is due. Course objective Introduction to effect.
Rhythmic Similarity Carmine Casciato MUMT 611 Thursday, March 13, 2005.
UFCEXR-20-1Multimedia Sound Production Music Notation: Scales and Key Signatures.
Mapping MIDI to the Spiral Array: Disambiguating Pitch Spelling Elaine CHEW Yun-Ching CHEN.
Green’s Tri-Level Hypothesis Behavioral: a person’s performance on specific experimental tasks Cognitive: the postulated cognitive or affective systems.
1 Validation and Verification of Simulation Models.
Analysis: Art or Science - An Introduction Taken from Nicholas Cook’s A Guide to Analysis. 1. Why do it 2. History of Analysis up to 20 th century
Streaming David Meredith Aalborg University. Sequential integration The connection of parts of an auditory spectrum over time to form concurrent streams.
Test Taking Tips How to help yourself with multiple choice and short answer questions for reading selections A. Caldwell.
Harmony: Musical Space
The Nuts & Bolts of Music
JSymbolic and ELVIS Cory McKay Marianopolis College Montreal, Canada.
Grouping David Meredith Aalborg University. Musical grouping structure Listeners automatically chunk or “segment” music into structural units of various.
‘EINE KLEINE NACHTMUSIK’
A Time Based Approach to Musical Pattern Discovery in Polyphonic Music Tamar Berman Graduate School of Library and Information Science University of Illinois.
Music Processing Algorithms David Meredith. Recent projects Musical pattern matching and discovery Finding occurrences of a query pattern in a work Finding.
Metre David Meredith Aalborg University. Theories of musical metre A theory of musical metre should be able to predict – points in time when listeners.
AP Music Theory Elements: Rhythm. Rhythm  General term used to refer to the aspect of music  Durational Symbols:  Breve – Double Whole Note/rest –
HANA HARRISON CSE 435 NOVEMBER 19, 2012 Music Composition.
David Temperley Presentation by Carley Tanoue
Streaming David Meredith Aalborg University. Sequential integration The connection of parts of an auditory spectrum over time to form concurrent streams.
Audio Scene Analysis and Music Cognitive Elements of Music Listening
Genetic Regulatory Network Inference Russell Schwartz Department of Biological Sciences Carnegie Mellon University.
Rhythmic Transcription of MIDI Signals Carmine Casciato MUMT 611 Thursday, February 10, 2005.
NEW MODEL, OLD PROBLEM: AN EMPIRICAL INVESTIGATION INTO GROUPING AND METRICAL CONSTRAINTS IN MUSIC PERCEPTION NEW MODEL, OLD PROBLEM: AN EMPIRICAL INVESTIGATION.
Melodic Organization Motive Rhythmic Motive Melodic Motive
A year 1 musicianA year 2 musicianA year 3 musician I can use my voice to speak, sing and chant. I can use instruments to perform. I can clap short rhythmic.
A preliminary computational model of immanent accent salience in tonal music Richard Parncutt 1, Erica Bisesi 1, & Anders Friberg 2 1 University of Graz,
Algorithms for pattern discovery and pitch spelling in music David Meredith Goldsmiths College University of London.
MUSICAL ELEMENTS Melody, Harmony, Tonality, & Rhythm.
AURAL SKILLS ASSESSMENT TASK 2 Question 2 THE CONCEPTS OF MUSIC General Knowledge.
Final Projects Some simple ideas. Composition (1) program that "learns" some aspect of musical composition.
Melodic Similarity Presenter: Greg Eustace. Overview Defining melody Introduction to melodic similarity and its applications Choosing the level of representation.
A Compression-Based Model of Musical Learning David Meredith DMRN+7, Queen Mary University of London, 18 December 2012.
Other Aspects of Musical Sound pp Texture  Texture describes the number of things that are going on at once in a piece of music.  Monophony-
Things to Consider When Writing Melodies Vital Elements  Two most vital elements - rhythm and melody.  Harmonic structure of your composition will.
The Elements of Music “Student Selected Piece of Music”
Training Phase Modeling Jazz Artist Similarities Mathematically Andres Calderon Jaramillo - Mentor: Dr. Larry Lucas Department of Mathematics and Statistics,
Signatures and Earmarks: Computer Recognition of Patterns in Music By David Cope Presented by Andy Lee.
 6 th Musical Literacy 1.1 All students will be able to use a steady tone when performing.
GCSE MUSIC MOCK EXAM Steps to success. This exam is a LISTENING EXAM However success in this paper ( 40% of the total grade) depends on:- Sound revision.
Learning to analyse tonal music Pl á cido Rom á n Illescas David Rizo Jos é Manuel I ñ esta Pattern recognition and Artificial Intelligence group University.
Oasis Don’t Look Back in Anger. Background - Oasis Formed in 1991 One of the leading bands to develop the Britpop style Main members consisted of brothers.
Before We Begin... Get ready for your “test” – Figured Bass and Roman Numerals.
Partita no. 4 in D: Sarabande and Gigue J.S. Bach.
Harmonic Models CS 275B/Music 254.
1. Rhythm 1.1. Basic rhythmsBasic rhythms 1.2. Rhythmic formulasRhythmic formulas 2. Melody 2.1. Diatonic scaleDiatonic scale Relative keys Degrees of.
What is automatic music transcription? Transforming an audio signal of a music performance in a symbolic representation (MIDI or score). Aim: This prototype.
Rhythmic Transcription of MIDI Signals
Chapter 2: Rhythm and Pitch
Carmine Casciato MUMT 611 Thursday, March 13, 2005
Edexcel GCSE Music (2MU01) 2009
National Curriculum Requirements of Music at Key Stage 1
Weaving Music Knowledge, Skills and Understanding into the new National Curriculum Key Stage 1: Music Forest Academy.
Carmine Casciato MUMT 611 Thursday, March 13, 2005
Intervals in Action (Two-Voice Composition)
Fine Arts section 1 pg.7-20 By david steen.
Integrating Segmentation and Similarity in Melodic Analysis
MUSIC HIGH SCHOOL – MUSIC TECHNOLOGY – Unit 5
Pitch Spelling Algorithms
Presentation transcript:

Introduction to algorithmic models of music cognition David Meredith Aalborg University

Algorithmic models of music cognition Most recent theories of music cognition have been rule systems, algorithms or computer programs Take representation of musical passage as input and output a structural description Structural description should correctly describe aspects of how a listener interprets the passage

Algorithmic models of music cognition Models take different types of input – audio signals representing sound – representations of notated scores – piano-roll representations Type of input depends on purpose of model

Algorithmic models of music cognition A structural description represents a listener’s interpretation – so cannot be tested directly Need to hypothesise how the listener’s interpretation will influence his or her behaviour

Longuet-Higgins’ model (1976) Computer program that takes a performance of a melody as input and predicts key, pitch names, metre, notated note durations and onsets, phrasing and articulation OUTPUT: [[[24 C STC] [[-5 G STC] [0 G STC]]] [[1 AB] [-1 G TEN]]] [[[REST] [4 B STC]] [1 C TEN]]

Longuet-Higgins’ model (1976) Uses score as a ground truth – Assumes pitch names, metre, phrasing, key, etc. should be as notated in an authoritative score of the passage performed Note fourth note here spelt as an Ab not a G# OUTPUT: [[[24 C STC] [[-5 G STC] [0 G STC]]] [[1 AB] [-1 G TEN]]] [[[REST] [4 B STC]] [1 C TEN]]

Longuet-Higgins’ model (1976) Even calculating notated duration and onset of each note is not trivial because performed durations and onsets will not correspond exactly to those in the score – e.g., need to decide whether timing difference is due to tempo change or change in notated value OUTPUT: [[[24 C STC] [[-5 G STC] [0 G STC]]] [[1 AB] [-1 G TEN]]] [[[REST] [4 B STC]] [1 C TEN]]

Longuet-Higgins’ model (1976) Program assumes that perception of rhythm is independent of perception of tonality So rhythm perceived not affected by pitch – actually not strictly true (cf. compound melody) Assumes metre independent of dynamics – can perceive metre on harpsichord and organ where dynamics not controlled Only considers metres in which beats within a single level are equally-spaced One or two equally-spaced beats between consecutive beats at the next higher level OUTPUT: [[[24 C STC] [[-5 G STC] [0 G STC]]] [[1 AB] [-1 G TEN]]] [[[REST] [4 B STC]] [1 C TEN]]

Longuet-Higgins’ model of rhythm To start, listener assumes binary metre Changes interpretation if given enough evidence – current metre implies a syncopation – current metre implies excessive change in tempo If enough evidence, then changes to a metre where no syncopation and/or smaller change in tempo implied

Longuet-Higgins’ model of tonality Estimates value of sharpness of each note – i.e., position on line of fifths Theory has six rules – First rule says that notes should be spelt so they are as close as possible to the tonic on the line of fifths – Other rules control how algorithm deals with chromatic intervals and modulations e.g., second rule says that if current key implies two consecutive chromatic intervals, then change key so that both become diatonic

Longuet-Higgins’ model: Output Section of cor anglais solo from Act III of Wagner’s Tristan und Isolde – Triplets in first beat of fifth bar – Grace note in seventh bar – Output agrees with original score here In a larger study (Meredith 2006, 2007) LH’s model correctly predicts 98.21% of pitch names in a note corpus – cf % spelt correctly by Meredith’s PS13s1 algorithm

Lerdahl and Jackendoff’s (1983) Generative Theory of Tonal Music (GTTM) Probably the most influential and frequently-cited theory in music cognition Takes a musical surface as input and produces a structural description that predicts aspects of an expert listener’s interpretation – not entirely clear what information assumed in input – predicts “final state” of listener’s interpretation – not “real- time” experience of listening

GTTM Four interacting modules – Grouping structure: motives, themes, phrases, sections – Metrical structure: “hierarchical pattern of beats” – Time-span reduction: how some events elaborate or depend on other events – Prolongational reduction: the “ebb-and-flow of tension”

GTTM Each module contains two types of rule – Well-formedness rules: define a class of possible analyses – Preference rules: isolate best well-formed analyses Modules depend on each other (sometimes circularly!) – Metre requires grouping – Grouping requires time-span reduction – Time-span reduction requires metre Therefore not trivial to implement the theory computationally – Though some have tried (e.g., Temperley (2001), Hamanaka et al. (2005, 2007))

Temperley and Sleator’s Melisma system Temperley (2001) presents a computational theory of music cognition, deeply influenced by GTTM – see Meredith (2002) for a detailed review Uses well-formedness rules and preference rules like GTTM Models six aspects of musical structure – metre – phrasing – contrapuntal structure – pitch-spelling – harmonic structure – key-structure

Melisma Consists of five programs that should be piped as shown at left Evaluated output by comparison with scores – 46 excerpts from a harmony text book (Kostka and Payne, 1995, 1995b)

Melisma Input in the form of a note-list or piano- roll giving onset time, duration and MIDI note number of each note Must first infer metre using meter program But harmony can influence metre and vice-versa, so should use a “two-pass” method as shown The notelist and beatlist are then given as input to the other programs

Using Temperley’s model to explain listening, composition, performance and style Melisma programs scan music from left to right, keeping note of the analyses that best satisfy the preference rules at each point Ambiguity: Two or more best analyses at a given point Revision: The best analysis at a given point is not part of the best analysis at a later point Expectation: We most expect events that lead to an analysis that doesn’t conflict with the preference rules Style: A piece is in the style of the preference rules if it satisfies them not too well (boring) and does not conflict with them too much (incomprehensible) Composition: Compose a piece that optimally satisfies the preference rules Performance: Temporal and dynamic expression aimed at conveying structure that best satisfies the preference rules

Summary Can model music cognition using algorithms that generate structural descriptions from musical surfaces We can evaluate such algorithms by comparing their output with expert analyses and authoritative scores Some well-developed theories of music cognition take the form of preference-rule systems containing – Well-formedness rules that define a class of legal analyses – Preference rules that identify the well-formed analyses that best describe the listener’s experience

References Hamanaka, M., Hirata, K. & Tojo, S. (2005). ATTA: Automatic time-span tree analyzer based on extended GTTM. Proceedings of the Sixth International Conference on Music Information Retrieval (ISMIR 2005), London. pp. 358— Hamanaka, M., Hirata, K. & Tojo, S. (2007). ATTA: Implementing GTTM on a computer. Proceedings of the Eighth International Conference on Music Information Retrieval (ISMIR 2007), Vienna. pp Kostka, S. & Payne, D. (1995a). Tonal Harmony. New York: McGraw-Hill. Kostka, S. & Payne, D. (1995b). Workbook for Tonal Harmony. New York: McGraw-Hill. Lerdahl, F. and Jackendoff, R. (1983). A Generative Theory of Tonal Music. MIT Press, Cambridge, MA. Longuet-Higgins, H. C. (1976). The perception of melodies. Nature, 263(5579), Longuet-Higgins, H. C. (1987). The perception of melodies. In H. C. Longuet-Higgins (ed.), Mental Processes: Studies in Cognitive Science, pp British Psychological Society/MIT Press, London/Cambridge, MA. Meredith, D. (2002). Review of David Temperley’s The Cognition of Basic Musical Structures (Cambridge, MA: MIT Press, 2001). Musicae Scientiae, 6(2), pp Meredith, D. (2006). The ps13 pitch spelling algorithm. Journal of New Music Research, 35(2), pp Meredith, D. (2007). Computing Pitch Names in Tonal Music: A Comparative Analysis of Pitch Spelling Algorithms. D. Phil. dissertation. Faculty of Music, University of Oxford. Temperley, D. (2001). The Cognition of Basic Musical Structures. MIT Press, Cambridge, MA.