THE MANIFOLDS OF SPATIAL HEARING Ramani Duraiswami | Vikas C. Raykar Perceptual Interfaces and Reality Lab University of Maryland, College park.

Slides:



Advertisements
Similar presentations
Computer examples Tenenbaum, de Silva, Langford “A Global Geometric Framework for Nonlinear Dimensionality Reduction”
Advertisements

1 Manifold Alignment for Multitemporal Hyperspectral Image Classification H. Lexie Yang 1, Melba M. Crawford 2 School of Civil Engineering, Purdue University.
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Spatial Perception of Audio J. D. (jj) Johnston Neural Audio Corporation.
Extracting the pinna spectral notches Vikas C. Raykar | Ramani Duraiswami University of Maryland, CollegePark B. Yegnanaryana Indian Institute of Technology,
Manifold Learning Dimensionality Reduction. Outline Introduction Dim. Reduction Manifold Isomap Overall procedure Approximating geodesic dist. Dijkstra’s.
3-D Sound and Spatial Audio MUS_TECH 348. Wightman & Kistler (1989) Headphone simulation of free-field listening I. Stimulus synthesis II. Psychophysical.
Localizing Sounds. When we perceive a sound, we often simultaneously perceive the location of that sound. Even new born infants orient their eyes toward.
Presented by: Mingyuan Zhou Duke University, ECE April 3, 2009
Non-linear Dimensionality Reduction CMPUT 466/551 Nilanjan Ray Prepared on materials from the book Non-linear dimensionality reduction By Lee and Verleysen,
Watching Unlabeled Video Helps Learn New Human Actions from Very Few Labeled Snapshots Chao-Yeh Chen and Kristen Grauman University of Texas at Austin.
University of Joensuu Dept. of Computer Science P.O. Box 111 FIN Joensuu Tel fax Isomap Algorithm.
1 High dimensionality Evgeny Maksakov CS533C Department of Computer Science UBC.
“Random Projections on Smooth Manifolds” -A short summary
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
1 Manifold Clustering of Shapes Dragomir Yankov, Eamonn Keogh Dept. of Computer Science & Eng. University of California Riverside.
1 High dimensionality Evgeny Maksakov CS533C Department of Computer Science UBC.
LLE and ISOMAP Analysis of Robot Images Rong Xu. Background Intuition of Dimensionality Reduction Linear Approach –PCA(Principal Component Analysis) Nonlinear.
1 Numerical geometry of non-rigid shapes Spectral Methods Tutorial. Spectral Methods Tutorial 6 © Maks Ovsjanikov tosca.cs.technion.ac.il/book Numerical.
Manifold Learning: ISOMAP Alan O'Connor April 29, 2008.
Three Algorithms for Nonlinear Dimensionality Reduction Haixuan Yang Group Meeting Jan. 011, 2005.
Manifold Learning Using Geodesic Entropic Graphs Alfred O. Hero and Jose Costa Dept. EECS, Dept Biomed. Eng., Dept. Statistics University of Michigan -
Manifold learning and pattern matching with entropic graphs Alfred O. Hero Dept. EECS, Dept Biomed. Eng., Dept. Statistics University of Michigan - Ann.
Spectral centroid 6 harmonics: f0 = 100Hz E.g. 1: Amplitudes: 6; 5.75; 4; 3.2; 2; 1 [(100*6)+(200*5.75)+(300*4)+(400*3.2)+(500*2 )+(600*1)] / = 265.6Hz.
A Global Geometric Framework for Nonlinear Dimensionality Reduction Joshua B. Tenenbaum, Vin de Silva, John C. Langford Presented by Napat Triroj.
Atul Singh Junior Undergraduate CSE, IIT Kanpur.  Dimension reduction is a technique which is used to represent a high dimensional data in a more compact.
NonLinear Dimensionality Reduction or Unfolding Manifolds Tennenbaum|Silva|Langford [Isomap] Roweis|Saul [Locally Linear Embedding] Presented by Vikas.
Lightseminar: Learned Representation in AI An Introduction to Locally Linear Embedding Lawrence K. Saul Sam T. Roweis presented by Chan-Su Lee.
Random Projections of Signal Manifolds Michael Wakin and Richard Baraniuk Random Projections for Manifold Learning Chinmay Hegde, Michael Wakin and Richard.
Nonlinear Dimensionality Reduction by Locally Linear Embedding Sam T. Roweis and Lawrence K. Saul Reference: "Nonlinear dimensionality reduction by locally.
Nonlinear Dimensionality Reduction Approaches. Dimensionality Reduction The goal: The meaningful low-dimensional structures hidden in their high-dimensional.
Manifold learning: Locally Linear Embedding Jieping Ye Department of Computer Science and Engineering Arizona State University
On the Preprocessing and Postprocessing of HRTF Individualization Based on Sparse Representation of Anthropometric Features Jianjun HE, Woon-Seng Gan,
On the Preprocessing and Postprocessing of HRTF Individualization Based on Sparse Representation of Anthropometric Features Jianjun HE, Woon-Seng Gan,
Improved 3D Sound Delivered to Headphones Using Wavelets By Ozlem KALINLI EE-Systems University of Southern California December 4, 2003.
Graph Embedding: A General Framework for Dimensionality Reduction Dong XU School of Computer Engineering Nanyang Technological University
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Adaptive nonlinear manifolds and their applications to pattern.
IEEE TRANSSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
42nd IEEE Conference on Decision and Control, Dec 9-12, 2003
Data Reduction. 1.Overview 2.The Curse of Dimensionality 3.Data Sampling 4.Binning and Reduction of Cardinality.
Learning a Kernel Matrix for Nonlinear Dimensionality Reduction By K. Weinberger, F. Sha, and L. Saul Presented by Michael Barnathan.
Computer Vision Lab. SNU Young Ki Baik Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)
Computer examples Tenenbaum, de Silva, Langford “A Global Geometric Framework for Nonlinear Dimensionality Reduction”
3-D Sound and Spatial Audio MUS_TECH 348. Physical Modeling Problem: Can we model the physical acoustics of the directional hearing system and thereby.
ISOMAP TRACKING WITH PARTICLE FILTER Presented by Nikhil Rane.
GRASP Learning a Kernel Matrix for Nonlinear Dimensionality Reduction Kilian Q. Weinberger, Fei Sha and Lawrence K. Saul ICML’04 Department of Computer.
Dimensionality Reduction
Manifold learning: MDS and Isomap
Dimensionality Reduction Part 2: Nonlinear Methods
Nonlinear Dimensionality Reduction Approach (ISOMAP)
Jan Kamenický.  Many features ⇒ many dimensions  Dimensionality reduction ◦ Feature extraction (useful representation) ◦ Classification ◦ Visualization.
H. Lexie Yang1, Dr. Melba M. Crawford2
Data Mining Course 0 Manifold learning Xin Yang. Data Mining Course 1 Outline Manifold and Manifold Learning Classical Dimensionality Reduction Semi-Supervised.
Non-Linear Dimensionality Reduction
Project 11: Determining the Intrinsic Dimensionality of a Distribution Okke Formsma, Nicolas Roussis and Per Løwenborg.
Tony Jebara, Columbia University Advanced Machine Learning & Perception Instructor: Tony Jebara.
Implicit Active Shape Models for 3D Segmentation in MR Imaging M. Rousson 1, N. Paragio s 2, R. Deriche 1 1 Odyssée Lab., INRIA Sophia Antipolis, France.
3-D Sound and Spatial Audio MUS_TECH 348. Are IID and ITD sufficient for localization? No, consider the “Cone of Confusion”
On the manifolds of spatial hearing
Fletcher’s band-widening experiment (1940)
SPATIAL HEARING Ability to locate the direction of a sound. Ability to locate the direction of a sound. Localization: In free field Localization: In free.
3-D Sound and Spatial Audio MUS_TECH 348. What do these terms mean? Both terms are very general. “3-D sound” usually implies the perception of point sources.
Spectral Methods for Dimensionality
Nonlinear Dimensionality Reduction
Auditory Localization in Rooms: Acoustic Analysis and Behavior
Unsupervised Riemannian Clustering of Probability Density Functions
Outline Nonlinear Dimension Reduction Brief introduction Isomap LLE
Localizing Sounds.
NonLinear Dimensionality Reduction or Unfolding Manifolds
3 primary cues for auditory localization: Interaural time difference (ITD) Interaural intensity difference Directional transfer function.
Presentation transcript:

THE MANIFOLDS OF SPATIAL HEARING Ramani Duraiswami | Vikas C. Raykar Perceptual Interfaces and Reality Lab University of Maryland, College park

HOW DO HUMANS LOCALIZE A SOUND SOURCE? Primary cues –Interaural Time Difference (ITD) –Interaural Level Difference (ILD) Can explain only localization in the horizontal plane. All points in the cone of confusion have the same ITD and ILD. Other cues –Pinna shape gives elevation cues for higher frequencies. –Torso and Head give elevation cues for lower frequencies.

Source HEAD Left earRight ear

HRTF  The spectral filtering caused by the head, torso and the pinna can be described by the Head Related Transfer Function (HRTF).  Can experimentally measure HRTF for all elevation and azimuth for both ears for different persons. Convolve the source signal with the measured HRIR to create virtual audio

Sample HRIR and HRTF

MANIFOLD REPRESENTATION A HRIR of N samples can be considered as a point in N dimensional space. As the elevation is varied smoothly, the points essentially trace out a one-dimensional manifold in the N-dimensional space.

PERCEPTUAL MANIFOLDS In the N dimensional Euclidean space of the original HRIRs, two HRIRs corresponding to far apart elevations may still be very close to each other. However on the one-dimensional manifold, where we measure the distance between two points as the length of the geodesic on the manifold, they are far apart. If we can unfold this low-dimensional manifold we have a good perceptual representation of the signal.

A Unroll the manifold PCA and MDS see the Euclidean distance What is important is the geodesic distance

NON-LINEAR MANIFOLD LEARNING Nonlinear manifold techniques essentially help to unfold the manifold giving a low dimensional representation. Isomap and Locally Linear Embedding (LLE) are two popular techniques. We use the LLE, since it has a good representational capacity and does not make any assumptions regarding manifold structure.

LLE LLE models local neighborhoods as linear patches and then embeds them in a lower dimensional manifold.

FIT LOCALLY We expect each data point and its neighbors to lie on or close to a locally linear patch of the manifold. Each point can be written as a linear combination of its neighbors. The weights chosen to minimize the reconstruction Error.

CRUX OF LLE The weights that minimize the reconstruction errors are invariant to rotation, rescaling and translation of the data points. The same weights that reconstruct the data points in D dimensions should reconstruct it in the manifold in d dimensions. –The weights characterize the intrinsic geometric properties of each neighborhood

THINK GLOBALLY

HRIR/HRTF MANIFOLDS We used the public domain CIPIC database. We tried to recover the elevation manifolds for both the HRIR and the HRTF with K=2 neighbors.

HRIR MANIFOLD The one-dimensional HRIR manifold recovered by the LLE technique using K=2 neighbors. The same manifold embedded in three dimensions recovered using K=4 neighbors is also shown.

HRTF MANIFOLD The one-dimensional HRTF manifold recovered by the LLE technique using K=2 neighbors. The same manifold embedded in three dimensions recovered using K=4 neighbors is also shown.

CHOICE K The only free parameter that needs to be selected in the LLE algorithm is the number of neighbors K. Large K leads to short-circuit errors. Small K leads to fragmented manifold. K related to the intrinsic dimensionality.

The manifold recovered for different values of K. DIFFERENT K

DIFFERENT AZIMUTHS

DIFFERENT SUBJECTS

A NEW DISTANCE METRIC How to compare any two given HRIRs i.e. how to formulate a distance metric in the space of HRIRs. The distance metric has to be perceptually inspired. The absolute justification however is to do psycho acoustical tests. In the absence of any good perceptual error metric the most commonly used one is the squared log-magnitude error of the spectrum of the HRIRs.

DISTANCE ON THE MANIFOLD It is tough to decide what aspects of a given signal are perceptually relevant. For our case of all HRIRs for different elevation angles, the obvious perceptual information to be extracted is the elevation of the source. A natural measure of distance would be the distance on the extracted one dimensional manifold

(a) The distance matrix using the metric defined in Equation 1 (b) using the distance on the manifold.

HRIR INTERPOLATION It is also possible to go from the manifold to the signal representation. If we want the HRTF for a new elevation we find the value of the lower-dimensional manifold at the required angle. Once we know the value on the manifold it can be written as a linear combination of its neighbors and compute the weights that best linearly reconstructs it from its neighbors. The same weights reconstruct the HRTF in the higher dimensional space.

The actual and the reconstructed HRTF for elevation 0 o and azimuth 0 o.

CONCLUSION We presented a new representation for the HRTFs in terms of the elevation manifold they lie on. We also proposed a new distance metric and a new scheme for HRIR interpolation.

REFERENCES S. Roweis and L. Saul, “Nonlinear dimensionality reduction by locally linear embedding,” Science, vol. 290, pp. 2323– 2326, december J.P. Blauert, Spatial Hearing (Revised Edition), MIT Press, Cambridge, MA, J. C. Middlebrooks and D. M. Green, “Sound localization by human listeners,” Annual Review of Psychology, vol. 42, pp. 135–159, F. L. Wightman and D. J. Kistler, “Monaural sound localization revisited,” Journal of the Acoustical Society of America,vol. 101, no. 2, pp. 1050–1063, Feb H. S. Seung and D. D. Lee, “The manifold ways of perception,” Science, vol. 290, pp. 2268, december J. B. Tenenbaum, V. de Silva, and J. C. Langford, “A global geometric framework for nonlinear dimensionality reduction,” Science, vol. 290, pp. 2319–2323, december V. de Silva and J. B. Tenenbaum, “Local versus global methods for nonlinear dimensionality reduction,” Advances in Neural Information Processing Systems, vol. 15, V. R. Algazi, R. O. Duda, D. M. Thompson, and C. Avendano, “The CIPIC HRTF database,” Proc.2001 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Mohonk Mountain House, New Paltz, NY, pp. 99–102, October M. Balasubramanian, E. L. Schwartz, J. B. Tenenbaum, V. de Silva, and J. C. Langford, “The isomap algorithm and topological stability,” Science, vol. 295, 2002.