Intel Labs Self Localizing sensors and actuators on Distributed Computing Platforms Vikas Raykar Igor Kozintsev Igor Kozintsev Rainer Lienhart.

Slides:

Advertisements

Similar presentations

Pattern Recognition and Machine Learning

Advertisements

1 ECE 776 Project Information-theoretic Approaches for Sensor Selection and Placement in Sensor Networks for Target Localization and Tracking Renita Machado.

ECE 8443 – Pattern Recognition LECTURE 05: MAXIMUM LIKELIHOOD ESTIMATION Objectives: Discrete Features Maximum Likelihood Resources: D.H.S: Chapter 3 (Part.

Pattern Recognition and Machine Learning

1 12. Principles of Parameter Estimation The purpose of this lecture is to illustrate the usefulness of the various concepts introduced and studied in.

Computer Networks Group Universität Paderborn Ad hoc and Sensor Networks Chapter 9: Localization & positioning Holger Karl.

Two-view geometry.

Paper by Shi, Qin, Suo, & Xiao Presented by Alan Kelly November 16, 2011.

Performance Analysis and Enhancement of Certain Range-based Localization Algorithms for Wireless Ad-Hoc Sensor Networks Maurizio A. Spirito and Francesco.

Visual Recognition Tutorial

Camera calibration and epipolar geometry

Volkan Cevher, Marco F. Duarte, and Richard G. Baraniuk European Signal Processing Conference 2008.

Sensor/Actuator Network Calibration Kamin Whitehouse Nest Retreat, June

Application of Statistical Techniques to Neural Data Analysis Aniket Kaloti 03/07/2006.

Automatic Position Calibration of Multiple Microphones

AGC DSP AGC DSP Professor A G Constantinides© Estimation Theory We seek to determine from a set of data, a set of parameters such that their values would.

Multiple Pitch Tracking for Blind Source Separation Using a Single Microphone Joseph Tabrikian Dept. of Electrical and Computer Engineering Ben-Gurion.

Structure from motion. Multiple-view geometry questions Scene geometry (structure): Given 2D point matches in two or more images, where are the corresponding.

Dimension reduction : PCA and Clustering Slides by Agnieszka Juncker and Chris Workman.

Independent Component Analysis (ICA) and Factor Analysis (FA)

APPROXIMATE EXPRESSIONS FOR THE MEAN AND COVARIANCE OF THE ML ESTIMATIOR FOR ACOUSTIC SOURCE LOCALIZATION Vikas C. Raykar | Ramani Duraiswami Perceptual.

Algorithm Evaluation and Error Analysis class 7 Multiple View Geometry Comp Marc Pollefeys.

Collaborative Signal Processing CS 691 – Wireless Sensor Networks Mohammad Ali Salahuddin 04/22/03.

NonLinear Dimensionality Reduction or Unfolding Manifolds Tennenbaum|Silva|Langford [Isomap] Roweis|Saul [Locally Linear Embedding] Presented by Vikas.

Principles of the Global Positioning System Lecture 10 Prof. Thomas Herring Room A;

Multi-view geometry. Multi-view geometry problems Structure: Given projections of the same 3D point in two or more images, compute the 3D coordinates.

Position Calibration of Acoustic Sensors and Actuators on Distributed General Purpose Computing Platforms Vikas Chandrakant Raykar | University of Maryland,

PATTERN RECOGNITION AND MACHINE LEARNING

EM and expected complete log-likelihood Mixture of Experts

Course 12 Calibration. 1.Introduction In theoretic discussions, we have assumed: Camera is located at the origin of coordinate system of scene.

November 1, 2012 Presented by Marwan M. Alkhweldi Co-authors Natalia A. Schmid and Matthew C. Valenti Distributed Estimation of a Parametric Field Using.

Generative Topographic Mapping by Deterministic Annealing Jong Youl Choi, Judy Qiu, Marlon Pierce, and Geoffrey Fox School of Informatics and Computing.

Multiple Regression The Basics. Multiple Regression (MR) Predicting one DV from a set of predictors, the DV should be interval/ratio or at least assumed.

ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.

Modern Navigation Thomas Herring

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 3: LINEAR MODELS FOR REGRESSION.

Visual Tracking on an Autonomous Self-contained Humanoid Robot Mauro Rodrigues, Filipe Silva, Vítor Santos University of Aveiro CLAWAR 2008 Eleventh International.

ECE 8443 – Pattern Recognition LECTURE 10: HETEROSCEDASTIC LINEAR DISCRIMINANT ANALYSIS AND INDEPENDENT COMPONENT ANALYSIS Objectives: Generalization of.

1 Formation et Analyse d’Images Session 7 Daniela Hall 25 November 2004.

ELEC 303 – Random Signals Lecture 18 – Classical Statistical Inference, Dr. Farinaz Koushanfar ECE Dept., Rice University Nov 4, 2010.

Dimension reduction : PCA and Clustering Slides by Agnieszka Juncker and Chris Workman modified by Hanne Jarmer.

ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: ML and Simple Regression Bias of the ML Estimate Variance of the ML Estimate.

Lecture 4: Statistics Review II Date: 9/5/02  Hypothesis tests: power  Estimation: likelihood, moment estimation, least square  Statistical properties.

Audio Location Accurate Low-Cost Location Sensing James Scott Intel Research Cambridge Boris Dragovic Intern in 2004 at Intel Research Cambridge Studying.

PROBABILITY AND STATISTICS FOR ENGINEERING Hossein Sameti Department of Computer Engineering Sharif University of Technology Principles of Parameter Estimation.

A Passive Approach to Sensor Network Localization Rahul Biswas and Sebastian Thrun International Conference on Intelligent Robots and Systems 2004 Presented.

0 IEEE SECON 2004 Estimation Bounds for Localization October 7 th, 2004 Cheng Chang EECS Dept,UC Berkeley Joint work with Prof.

ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Reestimation Equations Continuous Distributions.

Full-rank Gaussian modeling of convolutive audio mixtures applied to source separation Ngoc Q. K. Duong, Supervisor: R. Gribonval and E. Vincent METISS.

Two-view geometry. Epipolar Plane – plane containing baseline (1D family) Epipoles = intersections of baseline with image planes = projections of the.

1  The Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.

Using Adaptive Tracking To Classify And Monitor Activities In A Site W.E.L. Grimson, C. Stauffer, R. Romano, L. Lee.

Yi Jiang MS Thesis 1 Yi Jiang Dept. Of Electrical and Computer Engineering University of Florida, Gainesville, FL 32611, USA Array Signal Processing in.

Spatial Covariance Models For Under- Determined Reverberant Audio Source Separation N. Duong, E. Vincent and R. Gribonval METISS project team, IRISA/INRIA,

Position Calibration of Audio Sensors and Actuators in a Distributed Computing Platform Vikas C. Raykar | Igor Kozintsev | Rainer Lienhart University of.

R. Kass/W03 P416 Lecture 5 l Suppose we are trying to measure the true value of some quantity (x T ). u We make repeated measurements of this quantity.

Learning Theory Reza Shadmehr Distribution of the ML estimates of model parameters Signal dependent noise models.

Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.

RECONSTRUCTION OF MULTI- SPECTRAL IMAGES USING MAP Gaurav.

LECTURE 10: DISCRIMINANT ANALYSIS

Energy Based Acoustic Source Localization

Ch3: Model Building through Regression

Structure from motion Input: Output: (Tomasi and Kanade)

Visual Tracking on an Autonomous Self-contained Humanoid Robot

Wireless Mesh Networks

Dimension reduction : PCA and Clustering

LECTURE 15: REESTIMATION, EM AND MIXTURES

LECTURE 09: DISCRIMINANT ANALYSIS

Structure from motion Input: Output: (Tomasi and Kanade)

Presentation transcript:

Intel Labs Self Localizing sensors and actuators on Distributed Computing Platforms Vikas Raykar Igor Kozintsev Igor Kozintsev Rainer Lienhart

Motivation  Many multimedia applications are emerging which use multiple audio/video sensors and actuators. Microphones Cameras Speakers Displays DistributedCapture Distributed Rendering Other Applications Number Crunching

Applications Audio/Video Surveillance Hands free voice communication MultiChannel Speech Enhancement Smart Conference Rooms Audio/Image Based Rendering Object Localization And tracking Meeting Recording Distributed Audio Video Capture Interactive Audio Visual Interfaces MultiChannel Echo Cancellation Speech Recognition Source separation and Deverberation

Additional Motivation  Current work has focused on setting up all the sensors and actuators on a single dedicated computing platform.  Dedicated infrastructure required in terms of the sensors, multi-channel interface cards and computing power.  Computing devices such as laptops, PDAs, tablets, cellular phones, camcorders have become pervasive.  Audio/video sensors on different laptops can be used to form a distributed network of sensors. On the other hand…

Problem formulation  Put all the distributed audio-visual I/O capabilities into a common time and space.  In this paper:  Focus on providing a common space by means of actively estimating the 3D positions of the sensors (microphones) and actuators (speakers).  Account for the errors due to lack of temporal synchronization among various sensors and actuators (A/Ds and D/As) on distributed general purpose computing platforms.

Our View of Distributed Sensor Network X Y Z

Localization with known positions of speakers Distances are not exact There are more speakers

If positions of speakers are unknown…  Consider M Microphones and S speakers.  What can we measure? Distance between each speaker and all microphones (Time Of Flight) MxS TOF matrix Assume TOF corrupted by AWGN: can derive the ML estimate. Calibration signal

Nonlinear Least Squares Find the coordinates which minimizes this

Maximum Likelihood (ML) Estimate.. More rigorously, we can define a noise model and derive the ML estimate i.e. maximize the likelihood ratio Gaussian noise If noise is iid Gaussian ML is same as Least squares

Reference Coordinate System X axis Positive Y axis Origin Similarly in 3D 1.Fix origin (0,0,0) 2.Fix X axis (x1,0,0) 3.Fix Y axis (x2,y2,0) 4.Fix positive Z axis x1,x2,y2>0 Which to choose? Later…

Intel Labs On a synchronized platform all is well..

However on a Distributed system..

Intel Labs PC platform overview PCI Slots CPU AGP MCH ICH ATA LAN USB AC97 ICH, hub, PCI, LAN, etc. CPU, MCH, FSB, memory Operating system Multimedia/multistream applications Audio/video I/O devices I/O bus

t t Signal Emitted by source j Signal Received by microphone i Capture Started Playback Started Time Origin Timing on distributed system

Speaker Emission Start Times S Microphone Capture Start Times M -1 Assume tm_1=0 Microphone and speaker Coordinates DM+DS - [ D(D+1)/2 ] MS TOF Measurements Joint Estimation

Formulation same as above but less number of parameters. Time Difference of Arrival (TDOA)

Levenberg Marquadrat method Multidimensional function. Unless we have a good initial guess may not converge to the global minima. Approximate initial guess required. Nonlinear least squares

dot product matrix Symmetric positive definite rank 3 Given B can you get X ?....Singular Value Decomposition Multi Dimensional Scaling

Clustering approximation

i j i j i j Clustering approximation

k i j How to get dot product from the pair wise distance matrix

Later shift it to our orignal reference Slightly perturb each location of GPC into two to get the initial guess for the microphone and speaker coordinates Centroid as the origin

Sample result in 2D

Approx Distance matrix between GPCs Approx ts Approx tm Clustering Dot product matrix Dimension and coordinate system MDS to get approx GPC locations perturb TOF matrix Approx. microphone and speaker locations TDOA based Nonlinear minimization Microphone and speaker locations tm Algorithm

 Gives the lower bound on the variance of any unbiased estimator.  Does not depends on the estimator. Just the data and the noise model.  Basically tells us to what extent the noise limits our performance i.e. you cannot get a variance lesser than the CR bound. Jacobian Rank deficit: remove the known parameters Cramer-Rao bound

Performance comparison

Dependence on number of nodes

Geometry matters

Mic 3 Mic 1 Mic 2 Mic 4 Speaker 1 Speaker 4 Speaker 2 Speaker 3 X Z Room Length = 4.22 m Room Width = 2.55 m Room Height = 2.03 m Experimental setup: bias 0.08 cm sigma 3.8 cm

Intel Labs Summary General purpose computers can be used for distributed array processing General purpose computers can be used for distributed array processing It is possible to define common time and space for a network of distributed sensors and actuators. It is possible to define common time and space for a network of distributed sensors and actuators. For more information please see our two papers in ACM MM in November or contact For more information please see our two papers in ACM MM in November or contact Let us know if you will be interested in testing/using out time and space synchronization software for developing distributed algorithms on GPC (available in November) Let us know if you will be interested in testing/using out time and space synchronization software for developing distributed algorithms on GPC (available in November)

Intel Labs Backup

Calibration signal

Results (contd.)