Reconstructing Relief Surfaces George Vogiatzis, Philip Torr, Steven Seitz and Roberto Cipolla BMVC 2004.

Slides:



Advertisements
Similar presentations
Using Strong Shape Priors for Multiview Reconstruction Yunda SunPushmeet Kohli Mathieu BrayPhilip HS Torr Department of Computing Oxford Brookes University.
Advertisements

POSE–CUT Simultaneous Segmentation and 3D Pose Estimation of Humans using Dynamic Graph Cuts Mathieu Bray Pushmeet Kohli Philip H.S. Torr Department of.
Mean-Field Theory and Its Applications In Computer Vision1 1.
Primal-dual Algorithm for Convex Markov Random Fields Vladimir Kolmogorov University College London GDR (Optimisation Discrète, Graph Cuts et Analyse d'Images)
Bayesian Belief Propagation
O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.
Multi-view Stereo via Volumetric Graph-cuts George Vogiatzis, Philip H. S. Torr Roberto Cipolla.
Multi-view Stereo via Volumetric Graph-cuts
Wheres Waldo: Matching People in Images of Crowds Rahul GargDeva RamananSteven M. Seitz Noah Snavely Problem Definition University of Washington University.
Linear Time Methods for Propagating Beliefs Min Convolution, Distance Transforms and Box Sums Daniel Huttenlocher Computer Science Department December,
Stereo matching Class 7 Read Chapter 7 of tutorial Tsukuba dataset.
Multiview Reconstruction. Why More Than 2 Views? BaselineBaseline – Too short – low accuracy – Too long – matching becomes hard.
Introduction to Markov Random Fields and Graph Cuts Simon Prince
ICCV 2007 tutorial Part III Message-passing algorithms for energy minimization Vladimir Kolmogorov University College London.
Markov Random Fields (MRF)
Belief Propagation on Markov Random Fields Aggeliki Tsoli.
CS774. Markov Random Field : Theory and Application Lecture 04 Kyomin Jung KAIST Sep
Graphical models, belief propagation, and Markov random fields 1.
MultiView Stereo Steve Seitz CSE590SS: Vision for Graphics.
Multiview Reconstruction. Why More Than 2 Views? BaselineBaseline – Too short – low accuracy – Too long – matching becomes hard.
Shape from Contours and Multiple Stereo A Hierarchical, Mesh-Based Approach Hendrik Kück, Wolfgang Heidrich, Christian Vogelgsang.
Silhouettes in Multiview Stereo Ian Simon. Multiview Stereo Problem Input: – a collection of images of a rigid object (or scene) – camera parameters for.
Distributed Message Passing for Large Scale Graphical Models Alexander Schwing Tamir Hazan Marc Pollefeys Raquel Urtasun CVPR2011.
Last Time Pinhole camera model, projection
Computational Photography: Image-based Modeling Jinxiang Chai.
CSCE 641 Computer Graphics: Image-based Modeling Jinxiang Chai.
P 3 & Beyond Solving Energies with Higher Order Cliques Pushmeet Kohli Pawan Kumar Philip H. S. Torr Oxford Brookes University CVPR 2007.
Stereo & Iterative Graph-Cuts Alex Rav-Acha Vision Course Hebrew University.
Multiview stereo. Volumetric stereo Scene Volume V Input Images (Calibrated) Goal: Determine occupancy, “color” of points in V.
Multi-view stereo Many slides adapted from S. Seitz.
Learning Low-Level Vision William T. Freeman Egon C. Pasztor Owen T. Carmichael.
High-Quality Video View Interpolation
Efficiently Solving Convex Relaxations M. Pawan Kumar University of Oxford for MAP Estimation Philip Torr Oxford Brookes University.
The plan for today Camera matrix
Understanding Belief Propagation and its Applications Dan Yuan June 2004.
Stereo Computation using Iterative Graph-Cuts
An Iterative Optimization Approach for Unified Image Segmentation and Matting Hello everyone, my name is Jue Wang, I’m glad to be here to present our paper.
Project 1 grades out Announcements. Multiview stereo Readings S. M. Seitz and C. R. Dyer, Photorealistic Scene Reconstruction by Voxel Coloring, International.
Multiview Reconstruction. Why More Than 2 Views? BaselineBaseline – Too short – low accuracy – Too long – matching becomes hard.
Measuring Uncertainty in Graph Cut Solutions Pushmeet Kohli Philip H.S. Torr Department of Computing Oxford Brookes University.
Lecture 25: Multi-view stereo, continued
Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.
Review: Binocular stereo If necessary, rectify the two stereo images to transform epipolar lines into scanlines For each pixel x in the first image Find.
MRFs and Segmentation with Graph Cuts Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/31/15.
Lecture 12 Stereo Reconstruction II Lecture 12 Stereo Reconstruction II Mata kuliah: T Computer Vision Tahun: 2010.
Lighting-up geometry: accurate 3D modelling with a torch and a camera George Vogiatzis, Carlos Hernández, Roberto Cipolla University of Cambridge, Toshiba.
MRFs and Segmentation with Graph Cuts Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 02/24/10.
Graph Cut Algorithms for Binocular Stereo with Occlusions
Y. Moses 11 Combining Photometric and Geometric Constraints Yael Moses IDC, Herzliya Joint work with Ilan Shimshoni and Michael Lindenbaum, the Technion.
Markov Random Fields Probabilistic Models for Images
O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.
Wei Sun and KC Chang George Mason University March 2008 Convergence Study of Message Passing In Arbitrary Continuous Bayesian.
Probabilistic Inference Lecture 2 M. Pawan Kumar Slides available online
875 Dynamic Scene Reconstruction
Markov Random Fields & Conditional Random Fields
The University of Ontario From photohulls to photoflux optimization Yuri Boykov University of Western Ontario Victor Lempitsky Moscow State University.
A global approach Finding correspondence between a pair of epipolar lines for all pixels simultaneously Local method: no guarantee we will have one to.
Efficient Belief Propagation for Image Restoration Qi Zhao Mar.22,2006.
Tightening LP Relaxations for MAP using Message-Passing David Sontag Joint work with Talya Meltzer, Amir Globerson, Tommi Jaakkola, and Yair Weiss.
Markov Networks: Theory and Applications Ying Wu Electrical Engineering and Computer Science Northwestern University Evanston, IL 60208
Photoconsistency constraint C2 q C1 p l = 2 l = 3 Depth labels If this 3D point is visible in both cameras, pixels p and q should have similar intensities.
Markov Random Fields in Vision
Energy minimization Another global approach to improve quality of correspondences Assumption: disparities vary (mostly) smoothly Minimize energy function:
CSE 185 Introduction to Computer Vision Stereo 2.
MRFs and Segmentation with Graph Cuts Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/27/12.
From Photohulls to Photoflux Optimization
Integration and Graphical Models
Announcements Midterm due now Project 2 artifacts: vote today!
Announcements Project 3 out today (help session at end of class)
Occlusion and smoothness probabilities in 3D cluttered scenes
Presentation transcript:

Reconstructing Relief Surfaces George Vogiatzis, Philip Torr, Steven Seitz and Roberto Cipolla BMVC 2004

Stereo reconstruction problem: Input Input  Set of images of a scene I={I 1,…,I K }  Camera matrices P 1,…,P K Output Output  Surface model

Shape parametrisation Disparity-map parametrisation   MRF formulation – good optimisation techniques exist (Graph-cuts, Loopy BP)   MRF smoothness is viewpoint dependent   Disparity is unique per pixel – only functions represented

Shape parametrisation Volumetric parametrisation – e.g. Level- sets, Space carving etc.   Able to cope with non-functions   Convergence properties not well understood, Local minima   Memory intensive   For Space carving, no simple way to impose surface smoothness

Solution ? Cast volumetric methods in MRF framework Cast volumetric methods in MRF framework Key assumption: Approximate scene geometry given Key assumption: Approximate scene geometry given Benefits: Benefits:  General surfaces can be represented  Optimisation is tractable (MRF solvers)  Occlusions are approximately modelled  Smoothness is viewpoint independent

MRFs The labelling problem: The labelling problem:

MRFs A set of random variables h 1,…,h M A set of random variables h 1,…,h M A binary neighbourhood relation N defined on the variables A binary neighbourhood relation N defined on the variables Each can take a label out of a set H 1,…,H L Each can take a label out of a set H 1,…,H L C i (h i ) (Labelling cost) C i (h i ) (Labelling cost) C i,j (h i,h j ) for (i,j)  N (Compatibility cost) C i,j (h i,h j ) for (i,j)  N (Compatibility cost) -log P(h 1,…,h M ) =  C i (h i ) +  C i,j (h i,h j )

MRF inference Minimise  C i (h i ) +  C i,j (h i,h j ) Minimise  C i (h i ) +  C i,j (h i,h j ) Not in polynomial time in general case Not in polynomial time in general case Special cases (e.g. no loops or 2 label MRF) solved exactly Special cases (e.g. no loops or 2 label MRF) solved exactly General cases solved approximately via Graph-cuts or Loopy Belief Propagation. Approx mins for MRF with 250,000 nodes. General cases solved approximately via Graph-cuts or Loopy Belief Propagation. Approx mins for MRF with 250,000 nodes.

Relief Surfaces Approximate base surface Approximate base surface  Triangulated feature matches  Visual hull from silhouettes  Initialised by hand

Relief Surfaces labels :

Relief Surfaces labelling cost : Low cost High cost XiXi nini X i +h i n i C i (h i )=photoconsistency(X i +h i n i )

Relief Surfaces Compatibility cost : Low cost XiXi nini njnj XjXj X i +h i n i X j +h j n j

Relief Surfaces Neighbour cost : C i,j (h i, h j )= ||(X i +h i n i )-(X j +h j n j )|| XiXi nini X i +h i n i X j +h j n j High cost

Relief Surfaces Base surface is the occluding volume Base surface is the occluding volume If base surface ‘contains’ true surface (e.g. visual hull) then If base surface ‘contains’ true surface (e.g. visual hull) then  Points on the base surface X i are not visible by cameras they shouldn’t be [Kutulakos, Seitz 2000] Approximation: Approximation:  Visibility is propagated from X i to X i +h i n i

Loopy Belief Propagation Iterative message passing algorithm Iterative message passing algorithm m (t) i,j (h j ) is the message passed from i to j at time step t m (t) i,j (h j ) is the message passed from i to j at time step t It is a L-dimensional vector It is a L-dimensional vector Represents what node i ‘believes’ about the true state of node j. Represents what node i ‘believes’ about the true state of node j. min  C i (h i ) +  C i,j (h i,h j ) i j m i,j

Loopy Belief Propagation Message passing rule: Message passing rule: After convergence, optimal state is given by After convergence, optimal state is given by m (t+1) i,j (h j )= min { C ij (h i,h j ) +C i (h i ) +  m (t) k,i (h i ) } k  N(i) hihi h i * = min { C i (h i ) +  m (  ) k,i (h i ) } k  N(i) hihi i j m i,j

Loopy Belief Propagation O(L 2 ) to compute a message (L is number of allowable heights) O(L 2 ) to compute a message (L is number of allowable heights) Message passing schedule can be asynchronous which can accelerate convergence [Tappen & Freeman ICCV 03] Message passing schedule can be asynchronous which can accelerate convergence [Tappen & Freeman ICCV 03]

Iterative Scheme BP is memory intensive. BP is memory intensive. Can consider few possible labels at a time Can consider few possible labels at a time After convergence we ‘zoom in’ to heights close to the optimal After convergence we ‘zoom in’ to heights close to the optimal

Evaluation Artificial deformed sphere Artificial deformed sphere Textured with random patern Textured with random patern 20 images 20 images 40,000 sample points on sphere base surface 40,000 sample points on sphere base surface True surfaceTexture-mappedReconstruction

Evaluation Benchmark: 2-view, disparity based Loopy Belief Propagation [Sun et al ECCV02] Benchmark: 2-view, disparity based Loopy Belief Propagation [Sun et al ECCV02] BP run on 10 pairs of nearby views BP run on 10 pairs of nearby views Compare Disparity Maps given by Compare Disparity Maps given by  2-view BP  Relief surfaces  Ground truth

Evaluation 2-view BPRelief surface Ground truth 2-view BP Relief surf. MSE pixels pixels % of correct disparities 75.9%79.2%

Results Sarcophagus Sarcophagus

Results Sarcophagus Sarcophagus

Results Sarcophagus Sarcophagus

Results Building facade Building facade

Results Building facade Building facade

Results Base surfaceRelief surface with texture Stone carving Stone carving

Summary MRF methods can be extended in the volumetric domain MRF methods can be extended in the volumetric domain Advantages Advantages  General surfaces can be represented  Optimisation is tractable (MRF solvers)  Smoothness is viewpoint independent

Future work Photoconsistency beyond Lambertian surface models. (Optimise both height and surface normal fields) Photoconsistency beyond Lambertian surface models. (Optimise both height and surface normal fields) Change in topology Change in topology In cases where C mn (h m,h n )=|| h m -h n || or || h m - h n || 2 we can compute messages in O(L) time instead of O(L 2 ) (Felzenszwalb & Huttenlocher CVPR 04). In cases where C mn (h m,h n )=|| h m -h n || or || h m - h n || 2 we can compute messages in O(L) time instead of O(L 2 ) (Felzenszwalb & Huttenlocher CVPR 04).