Automatic Position Calibration of Multiple Microphones

Slides:



Advertisements
Similar presentations
Pattern Recognition and Machine Learning
Advertisements

Component Analysis (Review)
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
A Geometric Perspective on Machine Learning 何晓飞 浙江大学计算机学院 1.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: The FIR Adaptive Filter The LMS Adaptive Filter Stability and Convergence.
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
Principal Component Analysis CMPUT 466/551 Nilanjan Ray.
Sam Pfister, Stergios Roumeliotis, Joel Burdick
Volkan Cevher, Marco F. Duarte, and Richard G. Baraniuk European Signal Processing Conference 2008.
Principal Component Analysis
1 Distributed localization of networked cameras Stanislav Funiak Carlos Guestrin Carnegie Mellon University Mark Paskin Stanford University Rahul Sukthankar.
Pattern Recognition Topic 1: Principle Component Analysis Shapiro chap
Uncalibrated Geometry & Stratification Sastry and Yang
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Dimension reduction : PCA and Clustering Slides by Agnieszka Juncker and Chris Workman.
Independent Component Analysis (ICA) and Factor Analysis (FA)
Project Presentation: March 9, 2006
Multiple View Geometry Marc Pollefeys University of North Carolina at Chapel Hill Modified by Philippos Mordohai.
Single View Metrology Class 3. 3D photography course schedule (tentative) LectureExercise Sept 26Introduction- Oct. 3Geometry & Camera modelCamera calibration.
APPROXIMATE EXPRESSIONS FOR THE MEAN AND COVARIANCE OF THE ML ESTIMATIOR FOR ACOUSTIC SOURCE LOCALIZATION Vikas C. Raykar | Ramani Duraiswami Perceptual.
Algorithm Evaluation and Error Analysis class 7 Multiple View Geometry Comp Marc Pollefeys.
NonLinear Dimensionality Reduction or Unfolding Manifolds Tennenbaum|Silva|Langford [Isomap] Roweis|Saul [Locally Linear Embedding] Presented by Vikas.
Nonlinear Dimensionality Reduction by Locally Linear Embedding Sam T. Roweis and Lawrence K. Saul Reference: "Nonlinear dimensionality reduction by locally.
Principles of the Global Positioning System Lecture 10 Prof. Thomas Herring Room A;
Multi-view geometry. Multi-view geometry problems Structure: Given projections of the same 3D point in two or more images, compute the 3D coordinates.
Nonlinear Dimensionality Reduction Approaches. Dimensionality Reduction The goal: The meaningful low-dimensional structures hidden in their high-dimensional.
EE513 Audio Signals and Systems Statistical Pattern Classification Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Position Calibration of Acoustic Sensors and Actuators on Distributed General Purpose Computing Platforms Vikas Chandrakant Raykar | University of Maryland,
PATTERN RECOGNITION AND MACHINE LEARNING
Computing the Fundamental matrix Peter Praženica FMFI UK May 5, 2008.
July, 2005 Doc: IEEE a Qi, Li, Hara, Kohno (NICT) SlideTG4a1 Project: IEEE P Working Group for Wireless Personal Area Networks (WPANs)
Speech Recognition Pattern Classification. 22 September 2015Veton Këpuska2 Pattern Classification  Introduction  Parametric classifiers  Semi-parametric.
Intel Labs Self Localizing sensors and actuators on Distributed Computing Platforms Vikas Raykar Igor Kozintsev Igor Kozintsev Rainer Lienhart.
November 1, 2012 Presented by Marwan M. Alkhweldi Co-authors Natalia A. Schmid and Matthew C. Valenti Distributed Estimation of a Parametric Field Using.
Chapter 6 BEST Linear Unbiased Estimator (BLUE)
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.
Modern Navigation Thomas Herring
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 3: LINEAR MODELS FOR REGRESSION.
SUPA Advanced Data Analysis Course, Jan 6th – 7th 2009 Advanced Data Analysis for the Physical Sciences Dr Martin Hendry Dept of Physics and Astronomy.
Computer Vision Lab. SNU Young Ki Baik Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)
ECE 8443 – Pattern Recognition LECTURE 10: HETEROSCEDASTIC LINEAR DISCRIMINANT ANALYSIS AND INDEPENDENT COMPONENT ANALYSIS Objectives: Generalization of.
ELEC 303 – Random Signals Lecture 18 – Classical Statistical Inference, Dr. Farinaz Koushanfar ECE Dept., Rice University Nov 4, 2010.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: ML and Simple Regression Bias of the ML Estimate Variance of the ML Estimate.
Affine Structure from Motion
Modern Navigation Thomas Herring MW 11:00-12:30 Room
A Passive Approach to Sensor Network Localization Rahul Biswas and Sebastian Thrun International Conference on Intelligent Robots and Systems 2004 Presented.
Full-rank Gaussian modeling of convolutive audio mixtures applied to source separation Ngoc Q. K. Duong, Supervisor: R. Gribonval and E. Vincent METISS.
A Flexible New Technique for Camera Calibration Zhengyou Zhang Sung Huh CSPS 643 Individual Presentation 1 February 25,
Two-view geometry. Epipolar Plane – plane containing baseline (1D family) Epipoles = intersections of baseline with image planes = projections of the.
Tony Jebara, Columbia University Advanced Machine Learning & Perception Instructor: Tony Jebara.
1  The Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.
Analyzing Expression Data: Clustering and Stats Chapter 16.
September, 2005 Doc: IEEE a Qi, Li, Umeda, Hara and Kohno (NICT) SlideTG4a1 Project: IEEE P Working Group for Wireless Personal.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 12: Advanced Discriminant Analysis Objectives:
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: MLLR For Two Gaussians Mean and Variance Adaptation MATLB Example Resources:
Position Calibration of Audio Sensors and Actuators in a Distributed Computing Platform Vikas C. Raykar | Igor Kozintsev | Rainer Lienhart University of.
Parameter estimation class 5 Multiple View Geometry CPSC 689 Slides modified from Marc Pollefeys’ Comp
Learning Theory Reza Shadmehr Distribution of the ML estimates of model parameters Signal dependent noise models.
Computacion Inteligente Least-Square Methods for System Identification.
LECTURE 11: Advanced Discriminant Analysis
LECTURE 09: BAYESIAN ESTIMATION (Cont.)
LECTURE 10: DISCRIMINANT ANALYSIS
Estimating 2-view relationships
Filtering and State Estimation: Basic Concepts
EE513 Audio Signals and Systems
Dimension reduction : PCA and Clustering
LECTURE 15: REESTIMATION, EM AND MIXTURES
LECTURE 09: DISCRIMINANT ANALYSIS
NonLinear Dimensionality Reduction or Unfolding Manifolds
Presentation transcript:

Automatic Position Calibration of Multiple Microphones Vikas Chandrakant Raykar | Ramani Duraiswami Perceptual Interfaces and Reality Lab. | University of Maryland, CollegePark

Motivation Multiple microphones are widely used for applications like source localization, tracking and beamforming. Most applications need to know the precise locations of the microphones. Small uncertainity in the sensor location could make substantial contribution to the overall localization error. In ad-hoc deployed arrays it is tedious and often inaccurate to manually measure using a tape or a laser device. In this paper we describe a method to automatically determine the three dimensional positions of multiple microphones.

Automatically fix a coordinate system Z

If we know the positions of 3 speakers…. Y Distances are not exact Need atleast 3 speakers in 2D. Can use more speakers Find the intersection in the least square sense ? X

If positions of speakers unknown… Consider M Microphones and S speakers. What can we measure? Distance between each speaker and all microphones. Or Time Of Flight (TOF) MxS TOF matrix Assume TOF corrupted by Gaussian noise. Can derive the ML estimate. Calibration signal

Nonlinear Least Squares.. More formally can derive the ML estimate using a Gaussian Noise model speed of sound Find the coordinates of both the microphones as speakers which minimizes

Maximum Likelihood (ML) Estimate.. If noise is Gaussian and independent ML is same as Least squares we can define a noise model and derive the ML estimate i.e. maximize the likelihood ratio observation model parameters to be estimated Gaussian noise

Reference Coordinate System Positive Y axis Similarly in 3D 1.Fix origin (0,0,0) 2.Fix X axis (x1,0,0) 3.Fix Y axis (x2,y2,0) 4.Fix positive Z axis x1,x2,y2>0 Origin X axis Which to choose? Later…

Nonlinear least squares.. Levenberg Marquadrat method Function of a large number of parameters [ 3(M+S)-6 ] Unless we have a good initial guess may not converge to the minima. Approximate initial guess required. If we have M microphones and S speakers [ 3M+3S–6 ] parameters to estimate. [ MS ] TOF observations [ MS ] >= [ 3M+3S – 6 ] If M=S=K then K>=5 Why do we consider M=S ? Later..

Closed form Solution.. Say if we are given all pairwise distances between N points can we get the coordinates. 1 2 3 4 X

Classical Metric Multi Dimensional Scaling dot product matrix Symmetric positive definite rank 3 Say given B can you get X ?....Singular Value Decomposition Same as Principal component Analysis One hitch.. we can measure only the pairwise distance matrix

How to get dot product from the pairwise distance matrix…Cosine Law k j

MDS... If given pairwise distances between cities we can build a map. Instead of pairwise distances we can use pairwise “dissimilarities”. When the distances are Euclidean MDS is equivalent to PCA. Eg. Face recognition, wine tasting Can get the significant cognitive dimensions. Steyvers, M., & Busey, T. (2000). Predicting Similarity Ratings to Faces using Physical Descriptions. In M. Wenger, & J. Townsend (Eds.), Computational, geometric, and process perspectives on facial cognition: Contexts and challenges. Lawrence Erlbaum Associates

Can we use MDS.. ? 1. We do not have the complete pairwise distances X 1. We do not have the complete pairwise distances UNKNOWN UNKNOWN

Forming microphone speaker pairs… Now we know the locations of speakers and microphones close to them. Problem is essentially same as with position of speakers known. Can get a closed form solution using least squares technique. Can refine all the values further by a further ML estimation.

Microphone and speaker The complete algorithm… Approximation Approx Distance matrix Between Microphone Speaker pairs TOF matrix MDS Approx. microphone and speaker locations Nonlinear minimization Nonlinear minimization Exact. microphone and speaker locations Microphone and speaker locations Approx. Microphone locations

Sample result in 2D…

Algorithm Performance… The performance of our algorithm depends on Noise variance in the estimated distances. Number of microphones and speakers. Microphone and speaker geometry One way to study the dependence is to do a lot of monte carlo simulations. Else can derive the covariance matrix and bias of the estimator. The ML estimate is implicitly defined as the minimum of a certain error function. Cannot get an exact analytical expression for the mean and variance. Can use implicit function theorem and Taylors series expansion to get approximate expressions for bias and variance.

Where to place loudspeakers..

Monte Carlo Simulations…

Calibration Signal…

Time Delay Estimation… Compute the cross-correlation between the signals received at the two microphones. The location of the peak in the cross correlation gives an estimate of the delay. Task complicated due to two reasons 1.Background noise. 2.Channel multi-path due to room reverberations. Use Generalized Cross Correlation(GCC). W(w) is the weighting function. PHAT(Phase Transform) Weighting

Experimental Setup…

Results

Related Previous work… J. M. Sachar, H. F. Silverman, and W. R. Patterson III. Position calibration of large-aperture microphone arrays. ICASSP 2002 Y. Rockah and P. M. Schultheiss. Array shape calibration using sources in unknown locations Part II: Near-field sources and estimator implementation. IEEE Trans. Acoust.,Speech, Signal Processing, ASSP-35(6):724-735, June 1987. R. Moses, D. Krishnamurthy, and R. Patterson. A self-localization method for wireless sensor networks. Eurasip Journal on Applied Signal Processing Special Issue on SensorNetworks, 2003(4):348-358, March 2003. J. Weiss and B. Friedlander. Array shape calibration using sources in unknown locations a maximum likelihood approach. IEEE Trans. Acoust., Speech, Signal Processing , 37(12):1958-1966, December 1989.

Our Contributions… Locations of the speakers need not be known. Only constraint is that there showld be a microphone close to a loud speaker. In a practical setup attach a microphone to a louspeaker. Derived the theoretical variance of the estimator. Where to place the loudspeakers?

Acknowledgements… Dr. Dmitry Zotkin for building the microphone array. Dr. Elena Grassi and Zhiyun Li for the data capture boards,

Thank You ! | Questions ?