Modeling 3D Deformable and Articulated Shapes Yu Chen, Tae-Kyun Kim, Roberto Cipolla Department of Engineering University of Cambridge.

Slides:

Advertisements

Similar presentations

Part 2: Unsupervised Learning

Advertisements

National Technical University of Athens Department of Electrical and Computer Engineering Image, Video and Multimedia Systems Laboratory

O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.

Active Appearance Models

Active Shape Models Suppose we have a statistical shape model –Trained from sets of examples How do we use it to interpret new images? Use an “Active Shape.

Probabilistic Tracking and Recognition of Non-rigid Hand Motion

CSCE643: Computer Vision Bayesian Tracking & Particle Filtering Jinxiang Chai Some slides from Stephen Roth.

Silhouette-based Object Phenotype Recognition using 3D Shape Priors Yu Chen 1 Tae-Kyun Kim 2 Roberto Cipolla 1 University of Cambridge, Cambridge, UK 1.

Proposed concepts illustrated well on sets of face images extracted from video: Face texture and surface are smooth, constraining them to a manifold Recognition.

- Recovering Human Body Configurations: Combining Segmentation and Recognition (CVPR’04) Greg Mori, Xiaofeng Ren, Alexei A. Efros and Jitendra Malik -

Learning a correlated model of identity and pose-dependent body shape variation for real-time synthesis Brett Allen 1,2, Brian Curless 1, Zoran Popović.

Modeling the Shape of People from 3D Range Scans

3D Human Body Pose Estimation from Monocular Video Moin Nabi Computer Vision Group Institute for Research in Fundamental Sciences (IPM)

Online Multiple Classifier Boosting for Object Tracking Tae-Kyun Kim 1 Thomas Woodley 1 Björn Stenger 2 Roberto Cipolla 1 1 Dept. of Engineering, University.

International Conference on Automatic Face and Gesture Recognition, 2006 A Layered Deformable Model for Gait Analysis Haiping Lu, K.N. Plataniotis and.

Lecture Pose Estimation – Gaussian Process Tae-Kyun Kim 1 EE4-62 MLCV.

3D Shape Representation Tianqiang 04/01/2014. Image/video understanding Content creation Why do we need 3D shapes?

Proportion Priors for Image Sequence Segmentation Claudia Nieuwenhuis, etc. ICCV 2013 Oral.

EE462 MLCV Lecture Introduction of Graphical Models Markov Random Fields Segmentation Tae-Kyun Kim 1.

Robust Object Tracking via Sparsity-based Collaborative Model

PARAMETRIC RESHAPING OF HUMAN BODIES IN IMAGES SIGGRAPH 2010 Shizhe Zhou Hongbo Fu Ligang Liu Daniel Cohen-Or Xiaoguang Han.

Oklahoma State University Generative Graphical Models for Maneuvering Object Tracking and Dynamics Analysis Xin Fan and Guoliang Fan Visual Computing and.

Hidden Markov Model based 2D Shape Classification Ninad Thakoor 1 and Jean Gao 2 1 Electrical Engineering, University of Texas at Arlington, TX-76013,

Bayesian Robust Principal Component Analysis Presenter: Raghu Ranganathan ECE / CMR Tennessee Technological University January 21, 2011 Reading Group (Xinghao.

Tracking Using A Highly Deformable Object Model Nilanjan Ray Department of Computing Science University of Alberta.

Localization of Piled Boxes by Means of the Hough Transform Dimitrios Katsoulas Institute for Pattern Recognition and Image Processing University of Freiburg.

Model: Parts and Structure. History of Idea Fischler & Elschlager 1973 Yuille ‘91 Brunelli & Poggio ‘93 Lades, v.d. Malsburg et al. ‘93 Cootes, Lanitis,

Introduction to Data-driven Animation Jinxiang Chai Computer Science and Engineering Texas A&M University.

Exchanging Faces in Images SIGGRAPH ’04 Blanz V., Scherbaum K., Vetter T., Seidel HP. Speaker: Alvin Date: 21 July 2004.

3D Hand Pose Estimation by Finding Appearance-Based Matches in a Large Database of Training Views

Recovering Articulated Object Models from 3D Range Data Dragomir Anguelov Daphne Koller Hoi-Cheung Pang Praveen Srinivasan Sebastian Thrun Computer Science.

Multi-camera Tracking of Articulated Human Motion using Motion and Shape Cues Aravind Sundaresan and Rama Chellappa Center for Automation Research University.

Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.

A Search Engine for Historical Manuscript Images Toni M. Rath, R. Manmatha and Victor Lavrenko Center for Intelligent Information Retrieval University.

Computer vision: models, learning and inference Chapter 6 Learning and Inference in Vision.

A Practical System for Modelling Body Shapes from Single View Measurements Yu Chen 1, Duncan Robertson 2, Roberto Cipolla 1 Department of Engineering,

Gwangju Institute of Science and Technology Intelligent Design and Graphics Laboratory Multi-scale tensor voting for feature extraction from unstructured.

Shape-Based Human Detection and Segmentation via Hierarchical Part- Template Matching Zhe Lin, Member, IEEE Larry S. Davis, Fellow, IEEE IEEE TRANSACTIONS.

Computer vision: models, learning and inference Chapter 19 Temporal models.

Graphite 2004 Statistical Synthesis of Facial Expressions for the Portrayal of Emotion Lisa Gralewski Bristol University United Kingdom

Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.

Generalized Hough Transform

MURI: Integrated Fusion, Performance Prediction, and Sensor Management for Automatic Target Exploitation 1 Dynamic Sensor Resource Management for ATE MURI.

Vision-based human motion analysis: An overview Computer Vision and Image Understanding(2007)

Learning the Appearance and Motion of People in Video Hedvig Sidenbladh, KTH Michael Black, Brown University.

Automatic Locating of Anthropometric Landmarks on 3D Human Models

1 Research Question  Can a vision-based mobile robot  with limited computation and memory,  and rapidly varying camera positions,  operate autonomously.

Epitomic Location Recognition A generative approach for location recognition K. Ni, A. Kannan, A. Criminisi and J. Winn In proc. CVPR Anchorage,

Chapter 5 Multi-Cue 3D Model- Based Object Tracking Geoffrey Taylor Lindsay Kleeman Intelligent Robotics Research Centre (IRRC) Department of Electrical.

EE4-62 MLCV Lecture Face Recognition – Subspace/Manifold Learning Tae-Kyun Kim 1 EE4-62 MLCV.

Rick Parent - CIS681 Motion Analysis – Human Figure Processing video to extract information of objects Motion tracking Pose reconstruction Motion and subject.

O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.

Paper Reading Dalong Du Nov.27, Papers Leon Gu and Takeo Kanade. A Generative Shape Regularization Model for Robust Face Alignment. ECCV08. Yan.

Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team

Human Activity Recognition at Mid and Near Range Ram Nevatia University of Southern California Based on work of several collaborators: F. Lv, P. Natarajan,

Inference in generative models of images and video John Winn MSR Cambridge May 2004.

Automatic 3D modelling of Architecture Anthony Dick 1 Phil Torr 2 Roberto Cipolla 1 1 Department of Engineering 2 Microsoft Research, University of Cambridge.

11/25/03 3D Model Acquisition by Tracking 2D Wireframes Presenter: Jing Han Shiau M. Brown, T. Drummond and R. Cipolla Department of Engineering University.

Learning Image Statistics for Bayesian Tracking Hedvig Sidenbladh KTH, Sweden Michael Black Brown University, RI, USA

Processing visual information for Computer Vision

Computer vision: models, learning and inference

Karel Lebeda, Simon Hadfield, Richard Bowden

Tracking Objects with Dynamics

LOCUS: Learning Object Classes with Unsupervised Segmentation

Real-Time Human Pose Recognition in Parts from Single Depth Image

Dynamical Statistical Shape Priors for Level Set Based Tracking

A Bayesian Estimation of Building Shape using MCMC

PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD

Paper Reading Dalong Du April.08, 2011.

Human-object interaction

Presentation transcript:

Modeling 3D Deformable and Articulated Shapes Yu Chen, Tae-Kyun Kim, Roberto Cipolla Department of Engineering University of Cambridge

Roadmap Brief Introductions Our Framework Experimental Results Summary

Motivation + 3D Shapes Uncertainty Measurements 2D Images Tasks: –To recover deformable shapes from a single image with arbitrary camera viewpoint.

Previous Work Rigid shapes [Prasad’05, Rother’09, Yu’09, etc.] Problems: –Cannot handle self-deformation or articulations. Category-specific articulated shapes e.g., human bodies [Anguelov’05, Balan’07, etc.] Problems: –Requiring strong shape or anatomical knowledge of the category, such as skeletons and joint angles. –Too many parameters to estimate; –Hard to be generalised to other object categories.

Roadmap Brief Introductions Our Framework Experimental Results Summary

Our Contribution A probabilistic framework for: –Modelling different shape variations of general categories; –Synthesizing new shapes of the category from limited training data; –Inferring dense 3D shapes of deformable or articulated objects from a single silhouette;

Explanations on the Graphical Model Shape SynthesisMatching Silhouettes Pose Generator Shape Generator Joint Distribution:

Generating Shapes Target: Simultaneous modelling two types of shape variations: –Phenotype variation: fat vs. thin, tall vs. Short... –Pose variation: articulation, self deformation,... Training two GPLVMs: –Shape generator (M S ) for phenotype variation; –Pose generator (M A ) for pose variation.

Shape Generator (M S ) –Training Set: Shapes in the canonical pose. –Pre-processing: Automatically register each instance with a common 3D template; 3D shape context matching and thin-plate spline interpolation; Perform PCA on all registered 3D shapes. –Input: PCA coefficients of all the data. Generating Shapes

Pose Generator (M A ) –Training Set: Synthetic 3D poses sequences. –Pre-processing: Perform PCA on both spatial positions of vertices and all vertex-wise Jacobian matrices. –Input: PCA coefficients of all the data

Shape Generator (M S ) –Training Set: shapes in the canonical pose. –Input: PCA coefficients of vertex positions. Generating Shapes

Pose Generator (M A ) –Training Set: Synthetic 3D poses sequence. –Input: PCA coefficients of vertex positions and vertex-wise Jacobian matrices.

Shape Synthesis Zero Shape V 0 Pose Generator M A Shape Generator M S VAVA VAVA VSVS VSVS Shape Synthesis V V

Shape Synthesis Modelling the local shape transfer –Computing Jacobian matrices on the zero shape vertex-wisely. JiJi

Shape Synthesis Synthesizing fully-varied shape V from phenotype-varied shape V S and pose- varied shape V A. Probabilistic formulation: a Gaussian Approximation

Matching Silhouettes A two-stage process: o Projecting the 3D shape onto the image plane o Chamfer matching of silhouettes Maximizing likelihood over latent coordinates x A, x S and camera parameters γ k o Optimizing the closed-form lower bound. o Adaptive line-search with multiple initialisations.

Roadmap Brief Introductions Our Framework Experimental Results Summary

Experiments on Shape Synthesis Task: –To synthesize shapes in different phenotypes and poses with the mean shape μ V.

Shape Synthesis: Demo Shape Generator Pose Generator (Running)

Shape Synthesis: Demo Shape Generator Pose Generator (Running)

Shape Synthesis: Demo Shape Generator Pose Generator (Running)

Shape Synthesis: Demo Shape Generator Pose Generator (Running)

Shape Synthesis: Demo Shape Generator Pose Generator (Running)

Shape Synthesis: Demo Shape Generator Pose Generator (Running)

Shape Synthesis: Demo Shape Generator Pose Generator (Running)

Shape Synthesis: Demo Shape Generator Pose Generator (Running)

Shape Synthesis: Demo Shape Generator Pose Generator (Running)

Shape Synthesis: Demo Shape Generator Pose Generator (Running)

Experiments on Single View Reconstruction Training dataset: –Shark data: M S : 11 3D models of different shark species. M A : 11-frame tail-waving sequence from an animatable 3D MEX model. –Human data: M S : CAESAR dataset. M A : Animations of different 3D poses of Sydney in Poser 7. Testing: –Internet images (22 sharks and 20 humans in different poses and camera viewpoints) Segmentation: GrabCut [Rother’04]

Experiments on Single View Reconstruction Sharks:

Experiments on Single View Reconstruction Humans:

Experiments on Single View Reconstruction Examples of multi-modality

Experiments on Single View Reconstruction Qualitative Results: Precision-Recall Ratios –S F : foreground regions –S R : image projection of our result A very good approximation to the results given by parametrical models

Roadmap Brief Introductions Our Framework Experimental Results Summary

Pros and Cons: Advantages Fully data driven; Requiring no strong class- specific prior knowledge, e.g., skeleton, joint angles; Capable of modelling general categories; Compact shape representation and much lower dimensions for efficient optimization; Uncertainty measurements provided. Disadvantages Inaccurate at fine parts, e.g., hands. Lower descriptive power on poses compared with parametric model, when training instances are not enough; Training data are sometimes difficult to obtain.

Future Work A compatible framework which allows incorporating category knowledge Incorporating more cues: internal edges, texture, and colour; Multiple view settings and video sequences; 3D object recognition and action recognition tasks.

Thanks!