Review: Binocular stereo If necessary, rectify the two stereo images to transform epipolar lines into scanlines For each pixel x in the first image Find.

Slides:



Advertisements
Similar presentations
CSE473/573 – Stereo and Multiple View Geometry
Advertisements

Stereo matching Class 7 Read Chapter 7 of tutorial Tsukuba dataset.
Multiview Reconstruction. Why More Than 2 Views? BaselineBaseline – Too short – low accuracy – Too long – matching becomes hard.
Some books on linear algebra Linear Algebra, Serge Lang, 2004 Finite Dimensional Vector Spaces, Paul R. Halmos, 1947 Matrix Computation, Gene H. Golub,
875: Recent Advances in Geometric Computer Vision & Recognition
Gratuitous Picture US Naval Artillery Rangefinder from World War I (1918)!!
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image Where does the.
MASKS © 2004 Invitation to 3D vision Lecture 7 Step-by-Step Model Buidling.
Lecture 8: Stereo.
Stereo.
MultiView Stereo Steve Seitz CSE590SS: Vision for Graphics.
Multiview Reconstruction. Why More Than 2 Views? BaselineBaseline – Too short – low accuracy – Too long – matching becomes hard.
Silhouettes in Multiview Stereo Ian Simon. Multiview Stereo Problem Input: – a collection of images of a rigid object (or scene) – camera parameters for.
Last Time Pinhole camera model, projection
Project 3 extension: Wednesday at noon Final project proposal extension: Friday at noon >consult with Steve, Rick, and/or Ian now! Project 2 artifact winners...
Lecture 12: Structure from motion CS6670: Computer Vision Noah Snavely.
Stereo. STEREOPSIS Reading: Chapter 11. The Stereopsis Problem: Fusion and Reconstruction Human Stereopsis and Random Dot Stereograms Cooperative Algorithms.
Computational Photography: Image-based Modeling Jinxiang Chai.
CSCE 641 Computer Graphics: Image-based Modeling Jinxiang Chai.
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2005 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
Project 2 winners Think about Project 3 Guest lecture on Monday: Aseem Announcements.
Direct Methods for Visual Scene Reconstruction Paper by Richard Szeliski & Sing Bing Kang Presented by Kristin Branson November 7, 2002.
Multiview stereo. Volumetric stereo Scene Volume V Input Images (Calibrated) Goal: Determine occupancy, “color” of points in V.
Multi-view stereo Many slides adapted from S. Seitz.
The plan for today Camera matrix
CS 223b 1 More on stereo and correspondence. CS 223b 2 =?f g Mostpopular For each window, match to closest window on epipolar line in other image. (slides.
3D from multiple views : Rendering and Image Processing Alexei Efros …with a lot of slides stolen from Steve Seitz and Jianbo Shi.
CSCE 641 Computer Graphics: Image-based Modeling Jinxiang Chai.
Stereo and Structure from Motion
Project 1 grades out Announcements. Multiview stereo Readings S. M. Seitz and C. R. Dyer, Photorealistic Scene Reconstruction by Voxel Coloring, International.
Multiview Reconstruction. Why More Than 2 Views? BaselineBaseline – Too short – low accuracy – Too long – matching becomes hard.
May 2004Stereo1 Introduction to Computer Vision CS / ECE 181B Tuesday, May 11, 2004  Multiple view geometry and stereo  Handout #6 available (check with.
Announcements Project Update Extension: due Friday, April 20 Create web page with description, results Present your project in class (~10min/each) on Friday,
CSE473/573 – Stereo Correspondence
Lecture 25: Multi-view stereo, continued
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2006 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.
Stereo matching Class 10 Read Chapter 7 Tsukuba dataset.
CSCE 641 Computer Graphics: Image-based Modeling Jinxiang Chai.
Computer Vision Spring ,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30am – 11:50am Lecture #15.
Reconstructing Relief Surfaces George Vogiatzis, Philip Torr, Steven Seitz and Roberto Cipolla BMVC 2004.
Some books on linear algebra Linear Algebra, Serge Lang, 2004 Finite Dimensional Vector Spaces, Paul R. Halmos, 1947 Matrix Computation, Gene H. Golub,
Structure from images. Calibration Review: Pinhole Camera.
3D Surface Reconstruction from 2D Images (Survey)
KinectFusion : Real-Time Dense Surface Mapping and Tracking IEEE International Symposium on Mixed and Augmented Reality 2011 Science and Technology Proceedings.
Recap from Monday Image Warping – Coordinate transforms – Linear transforms expressed in matrix form – Inverse transforms useful when synthesizing images.
Epipolar geometry Epipolar Plane Baseline Epipoles Epipolar Lines
Stereo Readings Szeliski, Chapter 11 (through 11.5) Single image stereogram, by Niklas EenNiklas Een.
Stereo Many slides adapted from Steve Seitz.
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image image 1image.
Computer Vision, Robert Pless
Lec 22: Stereo CS4670 / 5670: Computer Vision Kavita Bala.
Computer Vision Lecture #10 Hossam Abdelmunim 1 & Aly A. Farag 2 1 Computer & Systems Engineering Department, Ain Shams University, Cairo, Egypt 2 Electerical.
stereo Outline : Remind class of 3d geometry Introduction
776 Computer Vision Jan-Michael Frahm Spring 2012.
Solving for Stereo Correspondence Many slides drawn from Lana Lazebnik, UIUC.
875 Dynamic Scene Reconstruction
776 Computer Vision Jan-Michael Frahm & Enrique Dunn Spring 2013.
Paper presentation topics 2. More on feature detection and descriptors 3. Shape and Matching 4. Indexing and Retrieval 5. More on 3D reconstruction 1.
CSE 185 Introduction to Computer Vision Stereo 2.
제 5 장 스테레오.
CS4670 / 5670: Computer Vision Kavita Bala Lec 27: Stereo.
STEREOPSIS The Stereopsis Problem: Fusion and Reconstruction
STEREOPSIS The Stereopsis Problem: Fusion and Reconstruction
EECS 274 Computer Vision Stereopsis.
Announcements Midterm due now Project 2 artifacts: vote today!
MSR Image based Reality Project
Chapter 11: Stereopsis Stereopsis: Fusing the pictures taken by two cameras and exploiting the difference (or disparity) between them to obtain the depth.
Announcements Project 3 out today (help session at end of class)
Stereo vision Many slides adapted from Steve Seitz.
Presentation transcript:

Review: Binocular stereo If necessary, rectify the two stereo images to transform epipolar lines into scanlines For each pixel x in the first image Find corresponding epipolar scanline in the right image Examine all pixels on the scanline and pick the best match x’ Compute disparity x-x’ and set depth(x) = B*f/(x-x’)

Multi-view stereo Many slides adapted from S. Seitz

What is stereo vision? Generic problem formulation: given several images of the same object or scene, compute a representation of its 3D shape

What is stereo vision? Generic problem formulation: given several images of the same object or scene, compute a representation of its 3D shape “Images of the same object or scene” Arbitrary number of images (from two to thousands) Arbitrary camera positions (camera network or video sequence) Calibration may be initially unknown “Representation of 3D shape” Depth maps Meshes Point clouds Patch clouds Volumetric models Layered models

The third view can be used for verification Beyond two-view stereo

Pick a reference image, and slide the corresponding window along the corresponding epipolar lines of all other images, using inverse depth relative to the first image as the search parameter M. Okutomi and T. Kanade, “A Multiple-Baseline Stereo System,” IEEE Trans. on Pattern Analysis and Machine Intelligence, 15(4): (1993).“A Multiple-Baseline Stereo System,” Multiple-baseline stereo

For larger baselines, must search larger area in second image 1/z width of a pixel width of a pixel 1/z pixel matching score

Multiple-baseline stereo Use the sum of SSD scores to rank matches

I1I2I10 Multiple-baseline stereo results M. Okutomi and T. Kanade, “A Multiple-Baseline Stereo System,” IEEE Trans. on Pattern Analysis and Machine Intelligence, 15(4): (1993).“A Multiple-Baseline Stereo System,”

Plane Sweep Stereo Choose a reference view Sweep family of planes at different depths with respect to the reference camera Each plane defines a homography warping each input image into the reference view reference camera input image R. Collins. A space-sweep approach to true multi-image matching. CVPR 1996.A space-sweep approach to true multi-image matching. input image

Plane Sweep Stereo For each depth plane For each pixel in the composite image stack, compute the variance For each pixel, select the depth that gives the lowest variance

Plane Sweep Stereo For each depth plane For each pixel in the composite image stack, compute the variance For each pixel, select the depth that gives the lowest variance Can be accelerated using graphics hardware R. Yang and M. Pollefeys. Multi-Resolution Real-Time Stereo on Commodity Graphics Hardware, CVPR 2003Multi-Resolution Real-Time Stereo on Commodity Graphics Hardware

Volumetric stereo In plane sweep stereo, the sampling of the scene depends on the reference view We can use a voxel volume to get a view- independent representation

Volumetric Stereo / Voxel Coloring Discretized Scene Volume Input Images (Calibrated) Goal: Assign RGB values to voxels in V photo-consistent with images

Photo-consistency All Scenes Photo-Consistent Scenes True Scene A photo-consistent scene is a scene that exactly reproduces your input images from the same camera viewpoints You can’t use your input cameras and images to tell the difference between a photo-consistent scene and the true scene

Space Carving Space Carving Algorithm Image 1 Image N …... Initialize to a volume V containing the true scene Repeat until convergence Choose a voxel on the outside of the volume Carve if not photo-consistent Project to visible input images K. N. Kutulakos and S. M. Seitz, A Theory of Shape by Space Carving, ICCV 1999A Theory of Shape by Space Carving

Which shape do you get? The Photo Hull is the UNION of all photo-consistent scenes in V It is a photo-consistent scene reconstruction Tightest possible bound on the true scene True Scene V Photo Hull V Source: S. Seitz

Space Carving Results: African Violet Input Image (1 of 45)Reconstruction Source: S. Seitz

Space Carving Results: Hand Input Image (1 of 100) Views of Reconstruction

Reconstruction from Silhouettes Binary Images The case of binary images: a voxel is photo- consistent if it lies inside the object’s silhouette in all views

Reconstruction from Silhouettes Binary Images Finding the silhouette-consistent shape (visual hull): Backproject each silhouette Intersect backprojected volumes The case of binary images: a voxel is photo- consistent if it lies inside the object’s silhouette in all views

Volume intersection B. Baumgart, Geometric Modeling for Computer Vision, Stanford Artificial Intelligence Laboratory, Memo no. AIM-249, Stanford University, October 1974.Geometric Modeling for Computer Vision

Photo-consistency vs. silhouette-consistency True ScenePhoto HullVisual Hull

Carved visual hulls The visual hull is a good starting point for optimizing photo-consistency Easy to compute Tight outer boundary of the object Parts of the visual hull (rims) already lie on the surface and are already photo-consistent Yasutaka Furukawa and Jean Ponce, Carved Visual Hulls for Image-Based Modeling, ECCV 2006.Carved Visual Hulls for Image-Based Modeling

Carved visual hulls 1.Compute visual hull 2.Use dynamic programming to find rims (photo-consistent parts of visual hull) 3.Carve the visual hull to optimize photo-consistency keeping the rims fixed Yasutaka Furukawa and Jean Ponce, Carved Visual Hulls for Image-Based Modeling, ECCV 2006.Carved Visual Hulls for Image-Based Modeling

From feature matching to dense stereo 1.Extract features 2.Get a sparse set of initial matches 3.Iteratively expand matches to nearby locations 4.Use visibility constraints to filter out false matches 5.Perform surface reconstruction Yasutaka Furukawa and Jean Ponce, Accurate, Dense, and Robust Multi-View Stereopsis, CVPR 2007.Accurate, Dense, and Robust Multi-View Stereopsis

From feature matching to dense stereo Yasutaka Furukawa and Jean Ponce, Accurate, Dense, and Robust Multi-View Stereopsis, CVPR 2007.Accurate, Dense, and Robust Multi-View Stereopsis

Stereo from community photo collections Up to now, we’ve always assumed that camera calibration is known For photos taken from the Internet, we need structure from motion techniques to reconstruct both camera positions and 3D points

Towards Internet-Scale Multi-View Stereo YouTube videoYouTube video, high-quality videohigh-quality video Yasutaka Furukawa, Brian Curless, Steven M. Seitz and Richard Szeliski, Towards Internet-scale Multi-view Stereo,CVPR 2010.Towards Internet-scale Multi-view Stereo

Fast stereo for Internet photo collections Start with a cluster of registered views Obtain a depth map for every view using plane sweeping stereo with normalized cross-correlation Frahm et al., “Building Rome on a Cloudless Day,” ECCV 2010.“Building Rome on a Cloudless Day,”

Plane sweeping stereo Need to register individual depth maps into a single 3D model Problem: depth maps are very noisy far near Frahm et al., “Building Rome on a Cloudless Day,” ECCV 2010.“Building Rome on a Cloudless Day,”

Robust stereo fusion using a heightmap Enforces vertical facades One continuous surface, no holes Fast to compute, low memory complexity David Gallup, Marc Pollefeys, Jan-Michael Frahm, “3D Reconstruction using an n-Layer Heightmap”, DAGM 2010

Results YouTube Video Frahm et al., “Building Rome on a Cloudless Day,” ECCV 2010.“Building Rome on a Cloudless Day,”

Kinect: Structured infrared light

Kinect Fusion YouTube Video Paper linkPaper link (ACM Symposium on User Interface Software and Technology, October 2011)