Learning Image Statistics for Bayesian Tracking Hedvig Sidenbladh KTH, Sweden Michael Black Brown University, RI, USA

Slides:



Advertisements
Similar presentations
Bayesian Belief Propagation
Advertisements

Probabilistic Tracking and Recognition of Non-rigid Hand Motion
CSCE643: Computer Vision Bayesian Tracking & Particle Filtering Jinxiang Chai Some slides from Stephen Roth.
1 Schedule. May 4, Monday, May 4: –Lecture on tracking objects and people. Wednesday, May 6: –Lecture on writing papers and giving talks. Monday,
Patch to the Future: Unsupervised Visual Prediction
1 Video Processing Lecture on the image part (8+9) Automatic Perception Volker Krüger Aalborg Media Lab Aalborg University Copenhagen
Learning to estimate human pose with data driven belief propagation Gang Hua, Ming-Hsuan Yang, Ying Wu CVPR 05.
Implicit Probabilistic Models of Human Motion for Synthesis and Tracking Hedvig Sidenbladh, KTH, Sweden (now FOI, Sweden) Michael J. Black, Brown University,
Qualifying Exam: Contour Grouping Vida Movahedi Supervisor: James Elder Supervisory Committee: Minas Spetsakis, Jeff Edmonds York University Summer 2009.
Foreground Modeling The Shape of Things that Came Nathan Jacobs Advisor: Robert Pless Computer Science Washington University in St. Louis.
Forward-Backward Correlation for Template-Based Tracking Xiao Wang ECE Dept. Clemson University.
Robust Object Tracking via Sparsity-based Collaborative Model
2D Human Pose Estimation in TV Shows Vittorio Ferrari Manuel Marin Andrew Zisserman Dagstuhl Seminar July 2008.
Stochastic Tracking of Humans Michael J. Black Department of Computer Science Brown University.
A KLT-Based Approach for Occlusion Handling in Human Tracking Chenyuan Zhang, Jiu Xu, Axel Beaugendre and Satoshi Goto 2012 Picture Coding Symposium.
Optimization & Learning for Registration of Moving Dynamic Textures Junzhou Huang 1, Xiaolei Huang 2, Dimitris Metaxas 1 Rutgers University 1, Lehigh University.
Robust Multi-Pedestrian Tracking in Thermal-Visible Surveillance Videos Alex Leykin and Riad Hammoud.
Local Descriptors for Spatio-Temporal Recognition
Formation et Analyse d’Images Session 8
1 Hierarchical Image-Motion Segmentation using Swendsen-Wang Cuts Adrian Barbu Siemens Corporate Research Princeton, NJ Acknowledgements: S.C. Zhu, Y.N.
Tracking Objects with Dynamics Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 04/21/15 some slides from Amin Sadeghi, Lana Lazebnik,
3-D Depth Reconstruction from a Single Still Image 何開暘
Segmentation and Tracking of Multiple Humans in Crowded Environments Tao Zhao, Ram Nevatia, Bo Wu IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,
Multiple Human Objects Tracking in Crowded Scenes Yao-Te Tsai, Huang-Chia Shih, and Chung-Lin Huang Dept. of EE, NTHU International Conference on Pattern.
Particle filters (continued…). Recall Particle filters –Track state sequence x i given the measurements ( y 0, y 1, …., y i ) –Non-linear dynamics –Non-linear.
Object Detection and Tracking Mike Knowles 11 th January 2005
Particle Filtering for Non- Linear/Non-Gaussian System Bohyung Han
Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.
A Bayesian Approach For 3D Reconstruction From a Single Image
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
BraMBLe: The Bayesian Multiple-BLob Tracker By Michael Isard and John MacCormick Presented by Kristin Branson CSE 252C, Fall 2003.
Tracking by Sampling Trackers Junseok Kwon* and Kyoung Mu lee Computer Vision Lab. Dept. of EECS Seoul National University, Korea Homepage:
1 Mean shift and feature selection ECE 738 course project Zhaozheng Yin Spring 2005 Note: Figures and ideas are copyrighted by original authors.
Object Tracking using Particle Filter
1. Introduction Motion Segmentation The Affine Motion Model Contour Extraction & Shape Estimation Recursive Shape Estimation & Motion Estimation Occlusion.
A General Framework for Tracking Multiple People from a Moving Camera
Jamal Saboune - CRV10 Tutorial Day 1 Bayesian state estimation and application to tracking Jamal Saboune VIVA Lab - SITE - University.
A Statistically Selected Part-Based Probabilistic Model for Object Recognition Zhipeng Zhao, Ahmed Elgammal Department of Computer Science, Rutgers, The.
An Information Fusion Approach for Multiview Feature Tracking Esra Ataer-Cansizoglu and Margrit Betke ) Image and.
Michael J. BlackFebruary 2002 Learning the Appearance and Motion of People in Video Hedvig Sidenbladh Michael J. Black Department.
Encoding/Decoding of Arm Kinematics from Simultaneously Recorded MI Neurons Y. Gao, E. Bienenstock, M. Black, S.Shoham, M.Serruya, J. Donoghue Brown Univ.,
Tracking People by Learning Their Appearance Deva Ramanan David A. Forsuth Andrew Zisserman.
Computer Vision Michael Isard and Dimitris Metaxas.
Learning the Appearance and Motion of People in Video Hedvig Sidenbladh, KTH Michael Black, Brown University.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
Fields of Experts: A Framework for Learning Image Priors (Mon) Young Ki Baik, Computer Vision Lab.
Chapter 5 Multi-Cue 3D Model- Based Object Tracking Geoffrey Taylor Lindsay Kleeman Intelligent Robotics Research Centre (IRRC) Department of Electrical.
CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.
Michael Isard and Andrew Blake, IJCV 1998 Presented by Wen Li Department of Computer Science & Engineering Texas A&M University.
O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.
Jiu XU, Axel BEAUGENDRE and Satoshi GOTO Computer Sciences and Convergence Information Technology (ICCIT), th International Conference on 1 Real-time.
Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team
Sequential Monte-Carlo Method -Introduction, implementation and application Fan, Xin
 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.
Introduction to Sampling Methods Qi Zhao Oct.27,2004.
The Science of Silly Walks Hedvig Sidenbladh Michael J. Black Department of Computer Science Brown University Royal Inst.
Multi-view Synchronization of Human Actions and Dynamic Scenes Emilie Dexter, Patrick Pérez, Ivan Laptev INRIA Rennes - Bretagne Atlantique
Representation in Vision Derek Hoiem CS 598, Spring 2009 Jan 22, 2009.
Tracking Hands with Distance Transforms Dave Bargeron Noah Snavely.
ICCV 2007 Optimization & Learning for Registration of Moving Dynamic Textures Junzhou Huang 1, Xiaolei Huang 2, Dimitris Metaxas 1 Rutgers University 1,
Detecting Occlusion from Color Information to Improve Visual Tracking
ROBUST SUBSPACE LEARNING FOR VISION AND GRAPHICS
Tracking Objects with Dynamics
Particle Filtering for Geometric Active Contours
Dynamical Statistical Shape Priors for Level Set Based Tracking
Combining Geometric- and View-Based Approaches for Articulated Pose Estimation David Demirdjian MIT Computer Science and Artificial Intelligence Laboratory.
PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD
Liyuan Li, Jerry Kah Eng Hoe, Xinguo Yu, Li Dong, and Xinqi Chu
Introduction to Object Tracking
Tracking Many slides adapted from Kristen Grauman, Deva Ramanan.
Presentation transcript:

Learning Image Statistics for Bayesian Tracking Hedvig Sidenbladh KTH, Sweden Michael Black Brown University, RI, USA Learning Image Statistics for Bayesian Tracking Hedvig Sidenbladh KTH, Sweden Michael Black Brown University, RI, USA 2. Why is it Hard? People have different appearance due to clothing, shape etc. Need to discriminate between “people” and “non-people”, but allow for variation in human appearance Occlusion 2D-3D projection ambiguities Unexpected position Similarity between different limbs 1. Introduction Goal: Tracking and reconstruction of people in 3D from a monocular video sequence. Articulated human model. Approach: Learn models of the appearance of humans in image sequences. What are the image statistics of human limbs? How do they differ for general scenes? 3. Previous Approaches Edge Detection: Probabilistic model? Under/over-segmentation, thresholds,... Image flow, templates, color, stereo,... Instead, model the appearance of people in terms of filter responses 4. Key idea #1 Steered edge responses Use the 3D model to predict the location of limb boundaries in the scene. Compute various filter responses steered to the predicted orientation of the limb. Compute likelihood of filter responses given 3D model, using a statistical model learned from examples. 10. Key idea #2 ForegroundBackground “Explain” the entire image Konishi et al. 00: p(image | foreground,background) p(image | foreground,background) = p(fgr part of image | background) p(fgr part of image | foreground) const  14. Related Work Statistics of natural images: S. Konishi, A. Yuille, J. Coughlan, and S. Zhu. Fundamental bounds on edge detection: An information theoretic evaluation of different edge cues. submitted: PAMI T. Lindeberg. Edge detection and ridge detection with automatic scale selection. IJCV, 30(2): , 1998 J. Sullivan, A. Blake, and J. Rittscher. Statistical foreground modelling for object localization. In ECCV, pp , 2000 S. Zhu and D. Mumford. Prior learning and Gibbs reaction-diffusion. PAMI, 19(11), 1997 Tracking: J. Deutscher, A. Blake, and I. Reid. Articulated motion capture by annealed particle filtering. In CVPR, vol 2, pp , 2000 M. Isard and J. MacCormick. BraMBLe: A Bayesian multi-blob tracker. In ICCV, 2001 M. Isard and A. Blake. Contour tracking by stochastic propagation of conditional density. In ECCV, pp , Conclusions Generic, learned, model of appearance Combines multiple cues, exploits work on image statistics Use the 3D model to predict position and orientation of features Model of foreground and background Exploits the ratio between foreground and background likelihood, improves tracking 5. Learning Distributions Training data: ForegroundBackground Statistical differences. P on /P off can be used for discriminating foreground from background. P on = probability of filter response on limbs P off = probability of filter response on general background Similar to Konishi et al. 00, but object specific - responses may be present or not. 6. Normalized Derivatives Image derivatives f x, f y, f xx, f xy, f yy normalized with a non-linear transfer function. High  1, low  0, Vertical derivative Normalized vertical derivative Parameters of transfer function optimized to maximize Bhattacharyya distance between P on and P off.: 7. Edge Steered edge response: P on /P off P on, P off 8. Ridge Scale specific! Steered ridge response: P on /P off P on, P off |2 nd derivative|  arm |2 nd derivative| // arm 9. Motion Motion response = I(x, t+1) - I(x+u, t) xx+u Training data: Consecutive images from sequence P on P off 11. Bayesian Formulation Posterior propagated with Condensation (Isard & Blake 96) Likelihood independence assumption: Posterior Likelihood Temporal prior 12. Results Last frame, EdgeRidgeMotionAll Tracking with self occlusion (see video) Tracking is improved using multiple cues (see video) Last frame