Reconstructing Building Interiors from Images Yasutaka Furukawa Brian Curless Steven M. Seitz University of Washington, Seattle, USA 2011/01/16 蔡禹婷.

Slides:



Advertisements
Similar presentations
Using Strong Shape Priors for Multiview Reconstruction Yunda SunPushmeet Kohli Mathieu BrayPhilip HS Torr Department of Computing Oxford Brookes University.
Advertisements

Multi-view Stereo via Volumetric Graph-cuts George Vogiatzis, Philip H. S. Torr Roberto Cipolla.
Multi-view Stereo via Volumetric Graph-cuts
Multi-View Stereo for Community Photo Collections
Wheres Waldo: Matching People in Images of Crowds Rahul GargDeva RamananSteven M. Seitz Noah Snavely Problem Definition University of Washington University.
3D Model Matching with Viewpoint-Invariant Patches(VIP) Reporter :鄒嘉恆 Date : 10/06/2009.
Face Alignment by Explicit Shape Regression
For Internal Use Only. © CT T IN EM. All rights reserved. 3D Reconstruction Using Aerial Images A Dense Structure from Motion pipeline Ramakrishna Vedantam.
Recent work in image-based rendering from unstructured image collections and remaining challenges Sudipta N. Sinha Microsoft Research, Redmond, USA.
Tracking Multiple Occluding People by Localizing on Multiple Scene Planes Professor :王聖智 教授 Student :周節.
MATHIEU GAUTHIER PIERRE POULIN LIGUM, DEPT. I.R.O. UNIVERSITÉ DE MONTRÉAL GRAPHICS INTERFACE 2009 Preserving Sharp Edges in Geometry Images.
1 Minimum Ratio Contours For Meshes Andrew Clements Hao Zhang gruvi graphics + usability + visualization.
Silhouettes in Multiview Stereo Ian Simon. Multiview Stereo Problem Input: – a collection of images of a rigid object (or scene) – camera parameters for.
Reconstructing Building Interiors from Images Yasutaka Furukawa Brian Curless Steven M. Seitz University of Washington, Seattle, USA Richard Szeliski Microsoft.
Structure from motion.
Constructing immersive virtual space for HAI with photos Shingo Mori Yoshimasa Ohmoto Toyoaki Nishida Graduate School of Informatics Kyoto University GrC2011.
Exchanging Faces in Images SIGGRAPH ’04 Blanz V., Scherbaum K., Vetter T., Seidel HP. Speaker: Alvin Date: 21 July 2004.
Background Removal of Multiview Images by Learning Shape Priors Yu-Pao Tsai, Cheng-Hung Ko, Yi-Ping Hung, and Zen-Chung Shih.
GATE D Object Representations (GATE-540) Dr.Çağatay ÜNDEĞER Instructor Middle East Technical University, GameTechnologies & General Manager SimBT.
ADVISE: Advanced Digital Video Information Segmentation Engine
CSCE 641 Computer Graphics: Image-based Modeling Jinxiang Chai.
Structure from motion. Multiple-view geometry questions Scene geometry (structure): Given 2D point matches in two or more images, where are the corresponding.
Multiview stereo. Volumetric stereo Scene Volume V Input Images (Calibrated) Goal: Determine occupancy, “color” of points in V.
Multi-view stereo Many slides adapted from S. Seitz.
Constructing immersive virtual space for HAI with photos Shingo Mori Yoshimasa Ohmoto Toyoaki Nishida Graduate School of Informatics Kyoto University GrC2011.
Lecture 25: Multi-view stereo, continued
Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.
Manhattan-world Stereo Y. Furukawa, B. Curless, S. M. Seitz, and R. Szeliski 2009 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.
Review: Binocular stereo If necessary, rectify the two stereo images to transform epipolar lines into scanlines For each pixel x in the first image Find.
Research & Innovation 1 An Industry Perspective on VVG Research Oliver Grau BBC Research & Innovation VVG SUMMER SCHOOL '07.
Computer vision: models, learning and inference
Reconstructing Relief Surfaces George Vogiatzis, Philip Torr, Steven Seitz and Roberto Cipolla BMVC 2004.
3D Surface Reconstruction from 2D Images (Survey)
Peter Sturm INRIA Grenoble – Rhône-Alpes (Institut National de Recherche en Informatique et Automatique) An overview of multi-view stereo and other topics.
Automatic Registration of Color Images to 3D Geometry Computer Graphics International 2009 Yunzhen Li and Kok-Lim Low School of Computing National University.
KinectFusion : Real-Time Dense Surface Mapping and Tracking IEEE International Symposium on Mixed and Augmented Reality 2011 Science and Technology Proceedings.
A General Framework for Tracking Multiple People from a Moving Camera
Xiaoguang Han Department of Computer Science Probation talk – D Human Reconstruction from Sparse Uncalibrated Views.
A Local Adaptive Approach for Dense Stereo Matching in Architectural Scene Reconstruction C. Stentoumis 1, L. Grammatikopoulos 2, I. Kalisperakis 2, E.
MRFs and Segmentation with Graph Cuts Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 02/24/10.
Exploitation of 3D Video Technologies Takashi Matsuyama Graduate School of Informatics, Kyoto University 12 th International Conference on Informatics.
Interactive Graph Cuts for Optimal Boundary & Region Segmentation of Objects in N-D Images (Fri) Young Ki Baik, Computer Vision Lab.
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
A Non-local Cost Aggregation Method for Stereo Matching
Dynamic 3D Scene Analysis from a Moving Vehicle Young Ki Baik (CV Lab.) (Wed)
I 3D: Interactive Planar Reconstruction of Objects and Scenes Adarsh KowdleYao-Jen Chang Tsuhan Chen School of Electrical and Computer Engineering Cornell.
Visibility map (high low) Surveillance with Visual Tagging and Camera Placement J. Zhao and S.-C. Cheung — Center for Visualization and Virtual Environment,
Structured Face Hallucination Chih-Yuan Yang Sifei Liu Ming-Hsuan Yang Electrical Engineering and Computer Science 1.
Multi-View Stereo : A Parametric Study Number of images, their angular separation and the like.
Spring 2015 CSc 83020: 3D Photography Prof. Ioannis Stamos Mondays 4:15 – 6:15
Interactive Point-based Modeling of Complex Objects from Images Pierre Poulin (a,b) Marc Stamminger (a,c) François Duranleau (b) Marie-Claude Frasson (a)
Scene Reconstruction Seminar presented by Anton Jigalin Advanced Topics in Computer Vision ( )
Presenter : Jia-Hao Syu
Discontinuous Displacement Mapping for Volume Graphics, Volume Graphics 2006, July 30, Boston, MA Discontinuous Displacement Mapping for Volume Graphics.
875 Dynamic Scene Reconstruction
776 Computer Vision Jan-Michael Frahm & Enrique Dunn Spring 2013.
Journal of Visual Communication and Image Representation
Paper presentation topics 2. More on feature detection and descriptors 3. Shape and Matching 4. Indexing and Retrieval 5. More on 3D reconstruction 1.
Image-Based Rendering Geometry and light interaction may be difficult and expensive to model –Think of how hard radiosity is –Imagine the complexity of.
Speaker Min-Koo Kang March 26, 2013 Depth Enhancement Technique by Sensor Fusion: MRF-based approach.
1 Supporting a Volume Rendering Application on a Grid-Middleware For Streaming Data Liang Chen Gagan Agrawal Computer Science & Engineering Ohio State.
A Globally Optimal Algorithm for Robust TV-L 1 Range Image Integration Christopher Zach VRVis Research Center Thomas Pock, Horst Bischof.
Semi-Global Matching with self-adjusting penalties
Introduction to Graphics Modeling
Paper Presentation: Shape and Matching
Modeling the world with photos
Dingding Liu* Yingen Xiong† Linda Shapiro* Kari Pulli†
Radio Propagation Simulation Based on Automatic 3D Environment Reconstruction D. He A novel method to simulate radio propagation is presented. The method.
Parallel Integration of Video Modules
Single View Modeling of Free-Form Scenes
Presentation transcript:

Reconstructing Building Interiors from Images Yasutaka Furukawa Brian Curless Steven M. Seitz University of Washington, Seattle, USA 2011/01/16 蔡禹婷

Outline Introduction Goal Challenges System pipeline Algorithmic details (technical contribution) Experimental Results Conclusion and future work Reference

Outline Introduction Goal Challenges System pipeline Algorithmic details (technical contribution) Experimental Results Conclusion and future work Reference

Reconstruction and Visualization of Architectural Scenes ◦ Semi-automatic(Manual )  Google Earth & Virtual Earth  Façade : Building facade made for use as a real-time video game engine environment. Google Earth 4 Virtual Earth Aerial images ◦ Automatic  Ground-level images  Aerial images: A projected image which is "floating in air", and cannot be viewed normally.

Reconstruction and Visualization of Architectural Scenes Difficulty ◦ Little attention paid to indoor scenes  If you walk inside your home and take photographs, generating a compelling 3D reconstruction and visualization becomes much more difficult. Google Earth 4 Aerial images Virtual Earth ? ? ? ?

Outline Introduction Goal Challenges System pipeline Algorithmic details (technical contribution) Experimental Results Conclusion and future work Reference

Goal Fully automatic system for interiors / outdoors ◦ Reconstructs a simple 3D model from images ◦ Provides real-time interactive visualization

Outline Introduction Goal Challenges System pipeline Algorithmic details (technical contribution) Experimental Results Conclusion and future work Reference

Challenges Reconstruction ◦ Multi-view stereo (MVS) typically produces a dense model ◦ We want the model to be  Simple for real-time interactive visualization of a large scene (e.g., a whole house)  Accurate for high-quality image-based rendering  Simple mode is effective for compelling visualization

Challenges Indoor Reconstruction Texture-poor surfacesComplicated visibility Prevalence of thin structures (doors, walls, tables)

Outline Introduction Goal Challenges System pipeline Algorithmic details (technical contribution) Experimental Results Conclusion and future work Reference

System pipeline 3D reconstruction and visualization system for architectural scenes.

System pipeline Image-based SFM MVS MWS Merging

MVSImage-basedSFMMWSMerging System pipeline Image-based

System pipeline Structure-from-Motion Bundler by Noah Snavely Structure from Motion for unordered image collections WEB: MVSImage-basedSFMMWSMerging

System pipeline PMVS by Yasutaka Furukawa and Jean Ponce Patch-based Multi-View Stereo Software/ Multi-view Stereo MVSImage-basedSFMMWSMerging

System pipeline Manhattan-world Stereo MVSImage-basedSFMMWSMerging

System pipeline MVSImage-basedSFM Manhattan-world Stereo

System pipeline Manhattan-world Stereo Result MVSImage-basedSFMMWSMerging

System pipeline Axis-aligned depth map merging (Paper contribution) MVSImage-basedSFMMWSMerging

Outline Introduction Goal Challenges System pipeline Algorithmic details (technical contribution) Experimental Results Conclusion and future work Reference

Axis-aligned Depth-map Merging Basic framework is similar to volumetric MRF

Axis-aligned Depth-map Merging Basic framework is similar to volumetric MRF

Key Feature 1 - Penalty terms

Binary penalty Binary encodes smoothness & data

Key Feature 1 - Penalty terms Binary penalty Binary encodes smoothness & data Unary is often constant (inflation)

Key Feature 1 - Penalty terms Binary penalty Binary encodes smoothness & data Unary is often constant (inflation) Weak regularization at interesting places Focus on a dense model

Key Feature 1 - Penalty terms Binary penalty Binary encodes smoothness & data Unary is often constant (inflation) Weak regularization at interesting places Focus on a dense model We want a simple model

Key Feature 1 - Penalty terms Binary penalty Binary encodes smoothness & data Unary is often constant (inflation)

Key Feature 1 - Penalty terms Binary penalty Binary encodes smoothness & data Unary is often constant (inflation)

Key Feature 1 - Penalty terms Unary encodes data Binary penalty Binary encodes smoothness & data Unary is often constant (inflation)

Key Feature 1 - Penalty terms Binary is smoothness Unary encodes data Binary penalty Binary encodes smoothness & data Unary is often constant (inflation)

Key Feature 1 - Penalty terms Regularization becomes weak  Dense 3D model Regularization is data-independent  Simpler 3D model Binary penalty

Axis-aligned Depth-map Merging Align-voxel grid with the dominant axes

Axis-aligned Depth-map Merging Align-voxel grid with the dominant axes Data term (unary)

Axis-aligned Depth-map Merging Align voxel grid with the dominant axes Data term (unary)

Axis-aligned Depth-map Merging Align voxel grid with the dominant axes Data term (unary)

Axis-aligned Depth-map Merging Align voxel grid with the dominant axes Data term (unary) Smoothness (binary)

Axis-aligned Depth-map Merging Align voxel grid with the dominant axes Data term (unary) Smoothness (binary)

Axis-aligned Depth-map Merging Align voxel grid with the dominant axes Data term (unary) Smoothness (binary) Graph-cuts

Key Feature 2 – Regularization For large scenes, data info are not complete Typical volumetric MRFs bias to general minimal surface We bias to piece-wise planar axis-aligned for architectural scenes

Key Feature 2 – Regularization

Same energy (ambiguous)

Key Feature 2 – Regularization Same energy (ambiguous) Data penalty: 0

Key Feature 2 – Regularization Same energy (ambiguous) Data penalty: 0 Smoothness penalty:Data penalty: 0 Smoothness penalty: 24Data penalty: 0

Key Feature 2 – Regularization shrinkage

Key Feature 2 – Regularization Axis-aligned neighborhood + Potts model Ambiguous Break ties with the minimum-volume solution Piece-wise planar axis-aligned model Axis-aligned neighborhood + Potts model Ambiguous Break ties with the minimum-volume solution Piece-wise planar axis-aligned model

Key Feature 3 – Sub-voxel accuracy

Summary of Depth-map Merging For a simple and axis-aligned model ◦ Explicit regularization in binary ◦ Axis-aligned neighborhood system & minimum-volume solution For an accurate model ◦ Sub-voxel refinement

Outline Introduction Goal Challenges System pipeline Algorithmic details (technical contribution) Experimental Results Conclusion and future work Reference

Experimental results Model complexity control with parameter

Experimental results Qualitative comparisons with a state-of- the-art MVS approach on hall with the number of faces in parentheses.

Experimental results Effects of the sub-voxel refinement procedure.

Experimental results Effects of the minimum volume constraint.

Experimental results Effects of the grid pruning on running time.

Outline Introduction Goal Challenges System pipeline Algorithmic details (technical contribution) Experimental Results Conclusion and future work Reference

Conclusion & Future Work Conclusion ◦ Fully automated 3D reconstruction/visualization system for architectural scenes ◦ Novel depth-map merging to produce piece- wise planar axis-aligned model with sub-voxel accuracy Future work ◦ Relax Manhattan-world assumption ◦ Larger scenes (e.g., a whole building)

Outline Introduction Goal Challenges System pipeline Algorithmic details (technical contribution) Experimental Results Conclusion and future work Reference

Reference N. Cornelis, B. Leibe, K. Cornelis, and L. V. Gool. 3d urban scene modeling integrating recognition and reconstruction. IJCV, 78(2-3):121–141, July L. Zebedin, J. Bauer, K. Karner, and H. Bischof. Fusion of feature- and area-based information for urban buildings modeling from aerial imagery. In ECCV, 2008.