Explaining human multiple object tracking as resource-constrained approximate inference in a dynamic probabilistic model Ed Vul Mike Frank George Alvarez.

Slides:



Advertisements
Similar presentations
14 de Fevereiro de 2004, Instituto Sistemas e Robótica Emotion-Based Decision and Learning Bruno Damas.
Advertisements

David Rosen Goals  Overview of some of the big ideas in autonomous systems  Theme: Dynamical and stochastic systems lie at the intersection of mathematics.
Jose-Luis Blanco, Javier González, Juan-Antonio Fernández-Madrigal University of Málaga (Spain) Dpt. of System Engineering and Automation May Pasadena,
State Estimation and Kalman Filtering CS B659 Spring 2013 Kris Hauser.
Effective Reinforcement Learning for Mobile Robots Smart, D.L and Kaelbing, L.P.
Probabilistic Robotics
Monte Carlo Localization for Mobile Robots Karan M. Gupta 03/10/2004
Lab 2 Lab 3 Homework Labs 4-6 Final Project Late No Videos Write up
Formation et Analyse d’Images Session 8
Tracking Objects with Dynamics Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 04/21/15 some slides from Amin Sadeghi, Lana Lazebnik,
Patrick Cavanagh Department of Psychology Harvard University Attention Routines.
Segmentation and Tracking of Multiple Humans in Crowded Environments Tao Zhao, Ram Nevatia, Bo Wu IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,
Adaptive Rao-Blackwellized Particle Filter and It’s Evaluation for Tracking in Surveillance Xinyu Xu and Baoxin Li, Senior Member, IEEE.
Motion based Correspondence for Distributed 3D tracking of multiple dim objects Ashok Veeraraghavan.
1 Integration of Background Modeling and Object Tracking Yu-Ting Chen, Chu-Song Chen, Yi-Ping Hung IEEE ICME, 2006.
Goal: Fast and Robust Velocity Estimation P1P1 P2P2 P3P3 P4P4 Our Approach: Alignment Probability ●Spatial Distance ●Color Distance (if available) ●Probability.
Helsinki University of Technology Adaptive Informatics Research Centre Finland Variational Bayesian Approach for Nonlinear Identification and Control Matti.
Gaussian process modelling
The impact of localization and observation averaging for convective-scale data assimilation in a simple stochastic model Michael Würsch, George C. Craig.
Belief space planning assuming maximum likelihood observations Robert Platt Russ Tedrake, Leslie Kaelbling, Tomas Lozano-Perez Computer Science and Artificial.
/09/dji-phantom-crashes-into- canadian-lake/
PSYC 368 – ATTENTION TO SCENES JOHN SECEN MSC CANDIDATE GRADUATE PROGRAM IN NEUROSCIENCE MARCH 11, 2010.
ANTs PI Meeting, Nov. 29, 2000W. Zhang, Washington University1 Flexible Methods for Multi-agent distributed resource Allocation by Exploiting Phase Transitions.
Jamal Saboune - CRV10 Tutorial Day 1 Bayesian state estimation and application to tracking Jamal Saboune VIVA Lab - SITE - University.
Loris Bazzani*, Marco Cristani*†, Vittorio Murino*† Speaker: Diego Tosato* *Computer Science Department, University of Verona, Italy †Istituto Italiano.
Feature Integration Theory Project guide: Amitabha Mukerjee Course: SE367 Presented by Harmanjit Singh.
Young Ki Baik, Computer Vision Lab.
MURI: Integrated Fusion, Performance Prediction, and Sensor Management for Automatic Target Exploitation 1 Dynamic Sensor Resource Management for ATE MURI.
Center for Radiative Shock Hydrodynamics Fall 2011 Review Assessment of predictive capability Derek Bingham 1.
Learning the Appearance and Motion of People in Video Hedvig Sidenbladh, KTH Michael Black, Brown University.
Processing Sequential Sensor Data The “John Krumm perspective” Thomas Plötz November 29 th, 2011.
Stable Multi-Target Tracking in Real-Time Surveillance Video
Design, Optimization, and Control for Multiscale Systems
NCAF Manchester July 2000 Graham Hesketh Information Engineering Group Rolls-Royce Strategic Research Centre.
Chapter 7. Learning through Imitation and Exploration: Towards Humanoid Robots that Learn from Humans in Creating Brain-like Intelligence. Course: Robots.
Boosted Particle Filter: Multitarget Detection and Tracking Fayin Li.
Motion Illusions As Optimal Percepts. What’s Special About Perception? Visual perception important for survival  Likely optimized by evolution  at least.
- 1 - Overall procedure of validation Calibration Validation Figure 12.4 Validation, calibration, and prediction (Oberkampf and Barone, 2004 ). Model accuracy.
Beam Sampling for the Infinite Hidden Markov Model by Jurgen Van Gael, Yunus Saatic, Yee Whye Teh and Zoubin Ghahramani (ICML 2008) Presented by Lihan.
 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.
Exploiting Cognitive Constraints To Improve Machine-Learning Memory Models Michael C. Mozer Department of Computer Science University of Colorado, Boulder.
1 Hidra: History Based Dynamic Resource Allocation For Server Clusters Jayanth Gummaraju 1 and Yoshio Turner 2 1 Stanford University, CA, USA 2 Hewlett-Packard.
Classification Ensemble Methods 1
Tracking with dynamics
IEEE International Conference on Multimedia and Expo.
6. Population Codes Presented by Rhee, Je-Keun © 2008, SNU Biointelligence Lab,
MACHINE LEARNING 3. Supervised Learning. Learning a Class from Examples Based on E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1)
Geogg124: Data assimilation P. Lewis. What is Data Assimilation? Optimal merging of models and data Models Expression of current understanding about process.
Matrix Models for Population Management & Conservation March 2014 Lecture 10 Uncertainty, Process Variance, and Retrospective Perturbation Analysis.
Artificial Intelligence in Game Design Lecture 20: Hill Climbing and N-Grams.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
Maxlab proprietary information – 5/4/09 – Maximilian Riesenhuber CT2WS at Georgetown: Letting your brain be all that.
Unified Adaptivity Optimization of Clock and Logic Signals Shiyan Hu and Jiang Hu Dept of Electrical and Computer Engineering Texas A&M University.
Introduction to Sampling based inference and MCMC
Tracking Objects with Dynamics
Velocity Estimation from noisy Measurements
Circuits and Interconnects In Aggressively Scaled CMOS
Video Google: Text Retrieval Approach to Object Matching in Videos
Lecture 10 Causal Estimation of 3D Structure and Motion
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Using Time-Varying Motion Stimuli to Explore Decision Dynamics
Neural Networks ICS 273A UC Irvine Instructor: Max Welling
Kalman Filtering COS 323.
Finding Periodic Discrete Events in Noisy Streams
Video Google: Text Retrieval Approach to Object Matching in Videos
Homework 2 Let us simulate the savings experiment of Kojima et al. (2004) assuming that the learner models the hidden state of the world with a 3x1 vector.
Unsupervised Perceptual Rewards For Imitation Learning
Volume 23, Issue 21, Pages (November 2013)
Overview: Chapter 2 Localization and Tracking
Nome Sobrenome. Time time time time time time..
Presentation transcript:

Explaining human multiple object tracking as resource-constrained approximate inference in a dynamic probabilistic model Ed Vul Mike Frank George Alvarez Josh Tenenbuam Support from: ONR-MURI (PI: Bavelier); NDSEG (Vul); NSF (Vul)

XNL PFJ DQR GAU

Multiple object tracking What limits performance?

What limits our performance? Juice? (Sperling, 1960; Vogel Machizawa 2003) Number tracked out of 16 total (Alvarez, Franconeri, Cavanagh) Slots?

Explanations of success and failure Harder when objects are faster (Alvarez & Franconeri, 2007) Easier when they are further apart. (Franconeri et al., 2008) We remember object velocity (Horowitz, 2008) But we don’t seem to use it (Keane & Pylyshyn, 2005) Well, maybe sometimes (Fencsik et al. 2005) Keeping track of features is hard (Pylyshyn, 2004) But we can track in color space (Blaser, Pylyshyn, & Holcombe, 2000) And unique colors help (Makovski & Jiang, 2009) We can only track a few objects (Pylyshyn & Storm, 1988; Intrilligator et al., 2001) But we can track more if they are slower (Alvarez & Franconeri, 2007) Crowding? Spotlights? Inhibitory surrounds? FINSTs? Slots? Juice? Juice boxes? Speed limits? Must specify performance of an unconstrained observer before postulating limitations. PhenomenaExplanations

Stark contrast with the real world

Puzzles What limits human performance in object tracking? Discrepancy between poor performance in the lab and robust behavior in the real world?

Our approach Look to engineering to see how people should track objects. Do people track objects this way? What resources would limit performance? How should these resources be allocated? Do people allocate resources in this manner?

An ideal observer for MOT ? ? ? ? … ? ? ? ? A B C D S0S0 m0m0 α0α0 S1S1 m1m1 α1α1 dynamics StSt mtmt αtαt … S: states m: observations α: data association

Dynamics and process uncertainty S0S0 m0m0 α0α0 S1S1 m1m1 α1α1 dynamics StSt mtmt αtαt … Parameterize by inertia and spacing / velocity variance of stationary distribution.

Measurement uncertainty Eccentricity scaling of position errors. Weber scaling of velocity errors S0S0 m0m0 α0α0 S1S1 m1m1 α1α1 dynamics StSt mtmt αtαt … S0S0 m0m0 α0α0 S1S1 m1m1 α1α1 StSt mtmt αtαt … ? ? ? ? Unknown data-associations

An ideal observer for MOT A B C D A B C D A B C D A B C D Measurement Assignment Estimate Prediction P(α t | m t, S’ t ) (sampled by particle filter) P(S t | α t, m t, S’ t ) (Kalman filter) P(S’ t+1 | S t, dynamics) (Kalman filter) mtmt

An ideal observer for MOT

Speed/space for people (Franconeri et al., 2008)

Speed/space for the ideal observer The correspondence problem is harder as speed increases or spacing decreases. faster closer

Speed/space for the ideal observer 10 0 Model thresholds Faster  Closer  Spatial concentration (1/σ x ) Velocity variance (σ v )

Speed/space tradeoff experiment Adjust minimal spacing for given a fixed speed. Track 3 of 6

Speed/space tradeoff Model thresholds Human thresholds 10 0 Faster  Closer  Spatial concentration (1/σ x ) Velocity variance (σ v ) People make similar tradeoffs between speed and space as the ideal observer.

Incorporating extra dynamic cues

10 0 Model thresholds Human thresholds Faster  Closer  Spatial concentration (1/σ x ) Velocity variance (σ v ) People make similar tradeoffs between speed/space and color as the ideal observer. Color-drift = 0.02π Color-drift = 0.2π

Predictive use of inertia 10 0 Inertia stable Inertia unstable Model thresholds Human thresholds Inertia = 0.7 Inertia = 0.9 Faster  Closer  Spatial concentration (1/σ x ) Velocity variance (σ v ) People use velocity when appropriate

Number tracked out of 16 total Ideal object tracking Speed Spacing Intermittent use of inertia Use of additional cues Track duration Additional distracters Phenomena explained We can only track a few objects (Pylyshyn & Storm, 1988; Intrilligator et al., 2001) But we can track more if they are slower (Alvarez & Franconeri, 2007) ???

Resources in tracking objects S t+1 StSt mtmt m t+1 α t+1 αtαt dynamics Limited ability to propagate state estimates: Limited memory fidelity P(S’ t+1 | S t + w m, dynamics) S w[i] = S w / ( 1 + √a i ) A resource that reduces noise in propagation of state estimates:

Number of targets Limited memory produces the speed-number tradeoff. Number tracked out of 16 total Human thresholds (Alvarez & Franconeri, 2007) Number of targets Number tracked out of 16 total Model thresholds Faster  Velocity variance (σ v )

Strategic allocation of resources Allocating resources Planning by greedy 1-step look-ahead sampling. Make one object more demanding. modelhuman Position Velocity TargetsDistractors Object 1 Object n … TargetsDistractors Position Velocity Object 1 Object n …

Contributions Ideal observer for human object tracking based on common engineering models… …Accounts for effects of speed, space, inertia, time, additional cues, distracters. A resource such as finite memory for state estimates accounts for target # effects. Teaser about strategic allocation of resources.

Position Velocity TargetsDistractors Object 1 Object n … TargetsDistractors Position Velocity Object 1 Object n …

How do people track objects? Harder to track when objects are faster (Alvarez & Franconeri, 2007) Easier when they are further apart. (Franconeri et al., 2008) We remember object velocity (Horowitz, 2008) But we don’t seem to use it (Keane & Pylyshyn, 2005) Yet it helps us track (Fencsik et al. 2005) Keeping track of identity is hard (Pylyshyn, 2004) But we can track in color space (Blaser, Pylyshyn, & Holcombe, 2000) And unique colors help (Makovski & Jiang, 2009) We can only track a few objects (Pylyshyn & Storm, 1988; Intrilligator et al., 2001) But we can track more if they are slower (Alvarez & Franconeri, 2007)

Inertia and trajectories Assumed “inertia” predicts future dot position. With inertia Without inertia

Inertia and trajectories With inertia Without inertia Model without inertia Inertia = 0.7 Inertia = 0.8 Inertia = σxσx Model with inertia Inertia = 0.7 Inertia = 0.8 Inertia = σxσx faster  Velocity std. dev. σvσv σxσx more space  Position std. dev. Inertia = 0.7 Inertia = 0.9 Inertia varied Across subjects Inertia varied Across trials People use inertia / extrapolated trajectories to track like the ideal observer: Only when it is stable and predictive.

Why only use inertia sometimes? Underestimating inertia under uncertainty?

Allocating a finite resource What allocation policies do people consider? Just Target vs. Non-Target? Or can people more flexibly adapt to task demands –Allocate to individual items Position Velocity TargetsDistractors Beta(p,v) Beta(T, D) Dirichlet(Α) Object 1 Object n …

How flexible is resource allocation? Track 4 of 8 1 tracked object (and one distractor) is “crazy charlie”: –Charlie moves faster/slower than the other targets. –Measure performance on other 3 targets. Key question: –Can we flexibly allocate to individual objects based on need? If so, accuracy on other targets should decrease as Charlie’s speed increases. –If we only allocate to targets vs. non-targets, Charlie’s speed should not matter.

Optimal Crazy Charlie performance Log10(Charlie speed / Other speed) Accuracy on non-Charlie targets

Crazy Charlie experimental results Slow “Charlie” Fast “Charlie” Speed of “key” target alters performance on other targets Tracking one harder target “steals” resources from the others

Multiple object tracking (Pylyshyn & Storm, 1988; and many others.)

Tracking in non-spatial dimensions Are spatial and non-spatial features combined to track objects? Blaser, Holcombe, Pylyshyn, 2000

Tracking in non-spatial dimensions

Are spatial and non-spatial features combined to track objects? Blaser, Holcombe, Pylyshyn, 2000 Model thresholds Color-drift = 0.02π Color-drift = 0.2π 10 0 σxσx Color-drift = 0.02π Color-drift = 0.2π Human settings faster  Velocity std. dev. σvσv σxσx more space  Position std. dev. People trade off spatial and non- spatial information when tracking.

An ideal observer for MOT S0S0 m0m0 α0α0 S1S1 m1m1 α1α1 dynamics StSt mtmt αtαt … Given: 1) Starting state: α 0, m 0 2) Unlabeled measurements: m 1, … m t 3) Model of the dynamics. Find: Final labels: α t Final state: S t Invert to compute: P(S t, α t |m 0,…m t-1, α 0 ) ? ? ? ? … ? ? ? ? A B C D How should people track objects given available information?