Stereo Matching Vision for Graphics CSE 590SS, Winter 2001 Richard Szeliski.

Slides:



Advertisements
Similar presentations
875: Recent Advances in Geometric Computer Vision & Recognition
Advertisements

Gratuitous Picture US Naval Artillery Rangefinder from World War I (1918)!!
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image Where does the.
Recap from Previous Lecture Tone Mapping – Preserve local contrast or detail at the expense of large scale contrast. – Changing the brightness within.
Lecture 8: Stereo.
Stereo.
776 Computer Vision Jan-Michael Frahm, Enrique Dunn Spring 2012.
Last Time Pinhole camera model, projection
Stanford CS223B Computer Vision, Winter 2005 Lecture 6: Stereo 2 Sebastian Thrun, Stanford Rick Szeliski, Microsoft Hendrik Dahlkamp and Dan Morris, Stanford.
Computer Vision CSE576, Spring 2005 Richard Szeliski
CS6670: Computer Vision Noah Snavely Lecture 17: Stereo
Announcements Quiz Thursday Quiz Review Tomorrow: AV Williams 4424, 4pm. Practice Quiz handout.
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2005 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
Direct Methods for Visual Scene Reconstruction Paper by Richard Szeliski & Sing Bing Kang Presented by Kristin Branson November 7, 2002.
Multi-view stereo Many slides adapted from S. Seitz.
Stanford CS223B Computer Vision, Winter 2006 Lecture 6 Stereo II Professor Sebastian Thrun CAs: Dan Maynes-Aminzade, Mitul Saha, Greg Corrado Stereo.
Stereo Binocular Stereo Calibration (finish up) Next Time Motivation
Stereopsis Mark Twain at Pool Table", no date, UCR Museum of Photography.
The plan for today Camera matrix
CS 223b 1 More on stereo and correspondence. CS 223b 2 =?f g Mostpopular For each window, match to closest window on epipolar line in other image. (slides.
3D from multiple views : Rendering and Image Processing Alexei Efros …with a lot of slides stolen from Steve Seitz and Jianbo Shi.
CSCE 641 Computer Graphics: Image-based Modeling Jinxiang Chai.
Introduction to Computer Vision CS223B, Winter 2005.
Stereo Matching Computer Vision CSE576, Spring 2005 Richard Szeliski.
Lec 21: Fundamental Matrix
CSE473/573 – Stereo Correspondence
Announcements PS3 Due Thursday PS4 Available today, due 4/17. Quiz 2 4/24.
Stereo Guest Lecture by Li Zhang
Project 1 artifact winners Project 2 questions Project 2 extra signup slots –Can take a second slot if you’d like Announcements.
Stereo Matching & Energy Minimization Vision for Graphics CSE 590SS, Winter 2001 Richard Szeliski.
Midterm went out on Tuesday (due next Tuesday) Project 3 out today Announcements.
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2006 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.
Stereo matching Class 10 Read Chapter 7 Tsukuba dataset.
Review: Binocular stereo If necessary, rectify the two stereo images to transform epipolar lines into scanlines For each pixel x in the first image Find.
3-D Scene u u’u’ Study the mathematical relations between corresponding image points. “Corresponding” means originated from the same 3D point. Objective.
CSCE 641 Computer Graphics: Image-based Modeling Jinxiang Chai.
Computer Vision Spring ,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30am – 11:50am Lecture #15.
Camera Calibration & Stereo Reconstruction Jinxiang Chai.
What have we leaned so far? Camera structure Eye structure Project 1: High Dynamic Range Imaging.
Announcements Project 1 artifact winners Project 2 questions
Structure from images. Calibration Review: Pinhole Camera.
Lecture 12 Stereo Reconstruction II Lecture 12 Stereo Reconstruction II Mata kuliah: T Computer Vision Tahun: 2010.
Recap from Monday Image Warping – Coordinate transforms – Linear transforms expressed in matrix form – Inverse transforms useful when synthesizing images.
Stereo Readings Szeliski, Chapter 11 (through 11.5) Single image stereogram, by Niklas EenNiklas Een.
Stereo Many slides adapted from Steve Seitz.
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image image 1image.
Computer Vision, Robert Pless
Computer Vision Lecture #10 Hossam Abdelmunim 1 & Aly A. Farag 2 1 Computer & Systems Engineering Department, Ain Shams University, Cairo, Egypt 2 Electerical.
Bahadir K. Gunturk1 Phase Correlation Bahadir K. Gunturk2 Phase Correlation Take cross correlation Take inverse Fourier transform  Location of the impulse.
Lecture 16: Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.
Digital Image Processing
776 Computer Vision Jan-Michael Frahm Spring 2012.
Solving for Stereo Correspondence Many slides drawn from Lana Lazebnik, UIUC.
Project 2 due today Project 3 out today Announcements TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAA.
Correspondence and Stereopsis Original notes by W. Correa. Figures from [Forsyth & Ponce] and [Trucco & Verri]
Advanced Computer Vision Chapter 11 Stereo Correspondence Presented by: 蘇唯誠 指導教授 : 傅楸善 博士.
Energy minimization Another global approach to improve quality of correspondences Assumption: disparities vary (mostly) smoothly Minimize energy function:
Correspondence and Stereopsis. Introduction Disparity – Informally: difference between two pictures – Allows us to gain a strong sense of depth Stereopsis.
CSE 185 Introduction to Computer Vision Stereo 2.
Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.
CS4670 / 5670: Computer Vision Kavita Bala Lec 27: Stereo.
Motion and Optical Flow
Jeremy Bolton, PhD Assistant Teaching Professor
Announcements Midterms graded (handed back at end of lecture)
Thanks to Richard Szeliski and George Bebis for the use of some slides
What have we learned so far?
Computer Vision CSE576, Spring 2005 Richard Szeliski
Chapter 11: Stereopsis Stereopsis: Fusing the pictures taken by two cameras and exploiting the difference (or disparity) between them to obtain the depth.
Stereo vision Many slides adapted from Steve Seitz.
Presentation transcript:

Stereo Matching Vision for Graphics CSE 590SS, Winter 2001 Richard Szeliski

1/31/2001Vision for Graphics2 Stereo Matching Given two or more images of the same scene or object, compute a representation of its shape What are some possible applications?

1/31/2001Vision for Graphics3 Face modeling From one stereo pair to a 3D head model [Frederic Deverney, INRIA]Frederic Deverney

1/31/2001Vision for Graphics4 Z-keying: mix live and synthetic Takeo Kanade, CMU (Stereo Machine)Stereo Machine

1/31/2001Vision for Graphics5 Virtualized Reality TM Takeo Kanade, CMU collect video from 50+ stream reconstruct 3D model sequences

1/31/2001Vision for Graphics6 Virtualized Reality TM Takeo Kanade, CMU generate new video steerable version used for SuperBowl XXV “eye vision” system

1/31/2001Vision for Graphics7 View Interpolation Given two images with correspondences, morph (warp and cross-dissolve) between them [Chen & Williams, SIGGRAPH’93] input depth image novel view [Matthies,Szeliski,Kanade’88]

1/31/2001Vision for Graphics8 More view interpolation Spline-based depth map inputdepth imagenovel view [Szeliski & Kang ‘95]

1/31/2001Vision for Graphics9 View Morphing Morph between pair of images using epipolar geometry [Seitz & Dyer, SIGGRAPH’96]

1/31/2001Vision for Graphics10 Additional applications Real-time people tracking (systems from Pt. Gray Research and SRI) “Gaze” correction for video conferencing [Ott,Lewis,Cox InterChi’93] Other ideas?

1/31/2001Vision for Graphics11 Stereo Matching Given two or more images of the same scene or object, compute a representation of its shape What are some possible representations? depth maps volumetric models 3D surface models planar (or offset) layers

1/31/2001Vision for Graphics12 Stereo Matching What are some possible algorithms? match “features” and interpolate match edges and interpolate match all pixels with windows (coarse-fine) use optimization: –iterative updating –dynamic programming –energy minimization (regularization, stochastic) –graph algorithms

1/31/2001Vision for Graphics13 Outline (remainder of talk) Image rectification Matching criteria Local algorithms (aggregation) iterative updating Optimization algorithms: energy (cost) formulation Markov Random Fields mean-field, stochastic, and graph algorithms

1/31/2001Vision for Graphics14 Stereo: epipolar geometry Match features along epipolar lines viewing ray epipolar plane epipolar line

1/31/2001Vision for Graphics15 Stereo: epipolar geometry for two images (or images with collinear camera centers), can find epipolar lines epipolar lines are the projection of the pencil of planes passing through the centers Rectification: warping the input images (perspective transformation) so that epipolar lines are horizontal

1/31/2001Vision for Graphics16 Rectification Project each image onto same plane, which is parallel to the epipole Resample lines (and shear/stretch) to place lines in correspondence, and minimize distortion [Zhang and Loop, MSR-TR-99-21]MSR-TR-99-21

1/31/2001Vision for Graphics17 Rectification

1/31/2001Vision for Graphics18 Rectification

1/31/2001Vision for Graphics19 Matching criteria Raw pixel values (correlation) Band-pass filtered images [Jones & Malik 92] “Corner” like features [Zhang, …] Edges [many people…] Gradients [Seitz 89; Scharstein 94] Rank statistics [Zabih & Woodfill 94]

1/31/2001Vision for Graphics20 Finding correspondences apply feature matching criterion (e.g., correlation or Lucas-Kanade) at all pixels simultaneously search only over epipolar lines (many fewer candidate positions)

1/31/2001Vision for Graphics21 Image registration (revisited) How do we determine correspondences? block matching or SSD (sum squared differences) d is the disparity (horizontal motion) How big should the neighborhood be?

1/31/2001Vision for Graphics22 Neighborhood size Smaller neighborhood: more details Larger neighborhood: fewer isolated mistakes w = 3w = 20

1/31/2001Vision for Graphics23 Stereo: certainty modeling Compute certainty map from correlations input depth map certainty map

1/31/2001Vision for Graphics24 Plane Sweep Stereo Sweep family of planes through volume each plane defines an image  composite homography virtual camera composite input image projective re-sampling of (X,Y,Z)  projective re-sampling of (X,Y,Z)

1/31/2001Vision for Graphics25 Plane Sweep Stereo For each depth plane compute composite (mosaic) image — mean compute error image — variance convert to confidence and aggregate spatially Select winning depth at each pixel

1/31/2001Vision for Graphics26 Plane sweep stereo Re-order (pixel / disparity) evaluation loops for every pixel,for every disparity for every disparity for every pixel compute cost compute cost

1/31/2001Vision for Graphics27 Stereo matching framework 1.For every disparity, compute raw matching costs Why use a robust function? occlusions, other outliers Can also use alternative match criteria

1/31/2001Vision for Graphics28 Stereo matching framework 2.Aggregate costs spatially Here, we are using a box filter (efficient moving average implementation) Can also use weighted average, [non-linear] diffusion…

1/31/2001Vision for Graphics29 Stereo matching framework 3.Choose winning disparity at each pixel Can interpolate to sub-pixel accuracy

1/31/2001Vision for Graphics30 Traditional Stereo Matching Advantages: gives detailed surface estimates fast algorithms based on moving averages sub-pixel disparity estimates and confidence Limitations: narrow baseline  noisy estimates fails in textureless areas gets confused near occlusion boundaries

1/31/2001Vision for Graphics31 Stereo with Non-Linear Diffusion Problem with traditional approach: gets confused near discontinuities New approach: use iterative (non-linear) aggregation to obtain better estimate provably equivalent to mean-field estimate of Markov Random Field

1/31/2001Vision for Graphics32 Linear diffusion Average energy with neighbors windowdiffusion

1/31/2001Vision for Graphics33 Linear diffusion Average energy with neighbors + starting value windowdiffusion

1/31/2001Vision for Graphics34 Non-linear diffusion Stopping criterion: only update (x,y) column is entropy goes down (distribution is more peaked)

1/31/2001Vision for Graphics35 Summary Applications Image rectification Matching criteria Local algorithms (aggregation) area-based; iterative updating Optimization algorithms: energy (cost) formulation Markov Random Fields mean-field; dynamic programming; stochastic; graph algorithms

1/31/2001Vision for Graphics36 More stereo…(next 2 lectures) Multi-image stereo Volumetric techniques Graph cuts Transparency Surfaces and level sets

1/31/2001Vision for Graphics37 Bibliography See the references in the readings… D. Scharstein and R. Szeliski. Stereo matching with nonlinear diffusion. International Journal of Computer Vision, 28(2): , July 1998 R. Szeliski. Stereo algorithms and representations for image-based rendering. In British Machine Vision Conference (BMVC'99), volume 2, pages , Nottingham, England, September R. Szeliski and R. Zabih. An experimental comparison of stereo algorithms. In International Workshop on Vision Algorithms, pages 1- 19, Kerkyra, Greece, September 1999.