Stereo Dan Kong.

Slides:



Advertisements
Similar presentations
Bayesian Belief Propagation
Advertisements

Stereo Vision Reading: Chapter 11
Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923.
Gratuitous Picture US Naval Artillery Rangefinder from World War I (1918)!!
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image Where does the.
Lecture 8: Stereo.
Last Time Pinhole camera model, projection
Stanford CS223B Computer Vision, Winter 2005 Lecture 6: Stereo 2 Sebastian Thrun, Stanford Rick Szeliski, Microsoft Hendrik Dahlkamp and Dan Morris, Stanford.
Stanford CS223B Computer Vision, Winter 2007 Lecture 6 Advanced Stereo Professors Sebastian Thrun and Jana Košecká CAs: Vaibhav Vaish and David Stavens.
Contents Description of the big picture Theoretical background on this work The Algorithm Examples.
Computer Vision : CISC 4/689 Adaptation from: Prof. James M. Rehg, G.Tech.
Stanford CS223B Computer Vision, Winter 2006 Lecture 6 Stereo II Professor Sebastian Thrun CAs: Dan Maynes-Aminzade, Mitul Saha, Greg Corrado Stereo.
Stereopsis Mark Twain at Pool Table", no date, UCR Museum of Photography.
The plan for today Camera matrix
CS 223b 1 More on stereo and correspondence. CS 223b 2 =?f g Mostpopular For each window, match to closest window on epipolar line in other image. (slides.
Stereo Computation using Iterative Graph-Cuts
© 2006 by Davi GeigerComputer Vision April 2006 L1.1 Binocular Stereo Left Image Right Image.
COMP 290 Computer Vision - Spring Motion II - Estimation of Motion field / 3-D construction from motion Yongjik Kim.
CSE473/573 – Stereo Correspondence
Announcements PS3 Due Thursday PS4 Available today, due 4/17. Quiz 2 4/24.
Stereo Sebastian Thrun, Gary Bradski, Daniel Russakoff Stanford CS223B Computer Vision (with slides by James Rehg and.
COMP322/S2000/L271 Stereo Imaging Ref.V.S.Nalwa, A Guided Tour of Computer Vision, Addison Wesley, (ISBN ) Slides are adapted from CS641.
Stereo matching “Stereo matching” is the correspondence problem –For a point in Image #1, where is the corresponding point in Image #2? C1C1 C2C2 ? ? C1C1.
Stereo matching Class 10 Read Chapter 7 Tsukuba dataset.
3-D Scene u u’u’ Study the mathematical relations between corresponding image points. “Corresponding” means originated from the same 3D point. Objective.
John Morris Iolanthe returns to the Waitemata Harbour Stereo Vision The Correspondence Problem.
Lecture 11 Stereo Reconstruction I Lecture 11 Stereo Reconstruction I Mata kuliah: T Computer Vision Tahun: 2010.
Structure from images. Calibration Review: Pinhole Camera.
Fast Approximate Energy Minimization via Graph Cuts
What we didn’t have time for CS664 Lecture 26 Thursday 12/02/04 Some slides c/o Dan Huttenlocher, Stefano Soatto, Sebastian Thrun.
Lecture 12 Stereo Reconstruction II Lecture 12 Stereo Reconstruction II Mata kuliah: T Computer Vision Tahun: 2010.
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
Stereo Vision Reading: Chapter 11 Stereo matching computes depth from two or more images Subproblems: –Calibrating camera positions. –Finding all corresponding.
CS654: Digital Image Analysis Lecture 8: Stereo Imaging.
Geometry 3: Stereo Reconstruction Introduction to Computer Vision Ronen Basri Weizmann Institute of Science.
Stereo Many slides adapted from Steve Seitz.
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image image 1image.
Computer Vision, Robert Pless
A Region Based Stereo Matching Algorithm Using Cooperative Optimization Zeng-Fu Wang, Zhi-Gang Zheng University of Science and Technology of China Computer.
Lec 22: Stereo CS4670 / 5670: Computer Vision Kavita Bala.
Computer Vision Stereo Vision. Bahadir K. Gunturk2 Pinhole Camera.
Computer Vision Lecture #10 Hossam Abdelmunim 1 & Aly A. Farag 2 1 Computer & Systems Engineering Department, Ain Shams University, Cairo, Egypt 2 Electerical.
Bahadir K. Gunturk1 Phase Correlation Bahadir K. Gunturk2 Phase Correlation Take cross correlation Take inverse Fourier transform  Location of the impulse.
Lecture 16: Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.
stereo Outline : Remind class of 3d geometry Introduction
Jochen Triesch, UC San Diego, 1 Stereo Outline: parallel camera axes convergent axes, epipolar geometry correspondence.
Segmentation- Based Stereo Michael Bleyer LVA Stereo Vision.
Solving for Stereo Correspondence Many slides drawn from Lana Lazebnik, UIUC.
Lecture 9 Feature Extraction and Motion Estimation Slides by: Michael Black Clark F. Olson Jean Ponce.
A global approach Finding correspondence between a pair of epipolar lines for all pixels simultaneously Local method: no guarantee we will have one to.
Correspondence and Stereopsis Original notes by W. Correa. Figures from [Forsyth & Ponce] and [Trucco & Verri]
John Morris Stereo Vision (continued) Iolanthe returns to the Waitemata Harbour.
Advanced Computer Vision Chapter 11 Stereo Correspondence Presented by: 蘇唯誠 指導教授 : 傅楸善 博士.
John Morris These slides were adapted from a set of lectures written by Mircea Nicolescu, University of Nevada at Reno Stereo Vision Iolanthe in the Bay.
Photoconsistency constraint C2 q C1 p l = 2 l = 3 Depth labels If this 3D point is visible in both cameras, pixels p and q should have similar intensities.
Stereo Matching Using Dynamic Programming
Computational Vision CSCI 363, Fall 2012 Lecture 17 Stereopsis II
數位三維視訊 楊 家 輝 Jar-Ferr Yang 電腦與通信工程研究所 電機工程學系 國立成功大學 Institute of Computer and Communication Engineering Department of Electrical Engineering National Cheng.
Energy minimization Another global approach to improve quality of correspondences Assumption: disparities vary (mostly) smoothly Minimize energy function:
Correspondence and Stereopsis. Introduction Disparity – Informally: difference between two pictures – Allows us to gain a strong sense of depth Stereopsis.
CSE 185 Introduction to Computer Vision Stereo 2.
Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.
CS4670 / 5670: Computer Vision Kavita Bala Lec 27: Stereo.
Geometry 3: Stereo Reconstruction
EECS 274 Computer Vision Stereopsis.
What have we learned so far?
Computer Vision Stereo Vision.
Binocular Stereo Vision
Chapter 11: Stereopsis Stereopsis: Fusing the pictures taken by two cameras and exploiting the difference (or disparity) between them to obtain the depth.
Stereo vision Many slides adapted from Steve Seitz.
Presentation transcript:

Stereo Dan Kong

Stereo vision Triangulate on two images of the same scene point to recover depth. Camera calibration Finding all correspondence Computing depth or surfaces depth baseline left Right

Outline Basic stereo equations Constraints and assumption Windows-based matching Cooperative Stereo Dynamic programming Graph cut and Belief Propagation Segmentation-based method

Pinhole Camera Model Image plane Virtual Image

Basic Stereo Derivations

Basic Stereo Derivations disparity

Stereo Constraint Color constancy The color of any world points remains constant from image to image This assumption is true under Lambertian Model In practice, given photometric camera calibration and typical scenes, color constancy holds well enough for most stereo algorithms.

Stereo Constraint Epipolar geometry The epipolar geometry is the fundamental constraint in stereo. Rectification aligns epipolar lines with scanlines Epipolar plane Epipolar line for p Epipolar line for p’

Stereo Constraint Uniqueness and Continuity Proposed by Marr&Poggio. Each item from each image may be assigned at most one disparity value,” and the “disparity” varies smoothly almost everywhere.

Correspondence Using Window-based matching scanline Left Right SSD error disparity Left Right

Sum of Squared (Pixel) Differences Left Right

Image Normalization Even when the cameras are identical models, there can be differences in gain and sensitivity. The cameras do not see exactly the same surfaces, so their overall light levels can differ. For these reasons and more, it is a good idea to normalize the pixels in each window:

Images as Vectors Left Right “Unwrap” image to form vector, using raster scan order row 1 row 2 Each window is a vector in an m2 dimensional vector space. Normalization makes them unit length. row 3

Normalized Correlation

Results Using window-based Method Left Disparity Map Images courtesy of Point Grey Research Left Right

Stereo Results Left Disparity map

Problems with Window-based matching Disparity within the window must be constant. Bias the results towards frontal-parallel surfaces. Blur across depth discontinuities. Perform poorly in textureless regions. Erroneous results in occluded regions

Cooperative Stereo Algorithm Based on two basic assumption by Marr and Poggio: Uniqueness: at most a single unique match exists for each pixel. Continuous: disparity values are generally continuous, i.e., smooth within a local neighborhood.

Disparity Space Image (DSI) The 3D disparity space has dimensions row r column c and disparity d. Each element (r, c, d) of the disparity space projects to the pixel (r, c) in the left image and to the (r, c + d) in the right image DSI represents the confidence or likelihood of a particular match.

(r, c) slices for different d Illustration of DSI (r, c) slices for different d (c, d) slice for r = 151

Definition Match value assigned to element (r, c, d) at iteration n Initial values computed from SSD or NCC Inhibition area for element (r, c, d) Local support area for element (r, c, d)

Illustration of Inhibitory and Support Regions

Iterative Updating DSI 1 2 3 4

Explicit Detection of Occlusion Identify occlusions by examining the magnitude of the converged values in conjunction with the uniqueness constrain

Summary of Cooperative Stereo Prepare a 3D array, (r, c, d): (r, c) for each pixel in the reference image and d for the range of disparity. Set initial match values using a function of image intensities, such as normalized correlation or SSD. Iteratively update match values using (4) until the match values converge. For each pixel (r, c), find the element (r, c, d) with the maximum match value. If the maximum match value is higher than a threshold, output the disparity d, otherwise, declare a occlusion.

MRF Stereo Model Local Evidence function Compatibility function :Lx1 vector :LxL matrix

Disparity Optimization Joint probability of MRF: The disparity optimization step requires choosing an estimator for MMSE: estimate of the mean of the marginal distribution of MAP: the labeling of maximize the above joint probability (1)

Equivalence to Energy Minimization Taking the negative log of equation 1: In graph cut, equation 2 is expressed as: Maximizing the probability in equation 1is equivalent to minimizing energy in equation 3. (2) (3)

Stereo Matching Using Belief Propagation Belief propagation is an iterative inference algorithm that propagates messages in the Markov network Message node send to Message observed node send to Belief at node We simplify as , and as

Belief Propagation Algorithm Initialize messages as uniform distribution Iterative update messages for I = 1:T Compute belief at each node and output disparity

Illustration of BP

BP Results

Stereo As a Pixel-Labeling Problem Let P be a set of pixels, L be a label set. The goal is find a labeling f which minimize some energy. For stereo, the labels are disparities. The classic form of energy function is:

Energy Function: The energy function measures how appropriate a label is for the pixel given the observed data. In stereo, this term corresponds to the match cost or likelihood. The energy term encodes the prior or smoothness constraint. In stereo, the so called Potts model is used:

Two Energy Minimization Algorithm via Graph Cuts Swap algorithm

Two Energy Minimization Algorithm via Graph Cuts expansion algorithm

Moves

Graph Cuts Results Graph Cuts Belief Propagation

Ordering Constraint If an object a is left on an object b in the left image then object a will also appear to the left of object b in the right image Ordering constraint… …and its failure

Stereo Correspondences Left scanline Right scanline … Match intensities sequentially between two scanlines

Stereo Correspondences Left scanline Right scanline … Left occlusion Match Match Match Right occlusion

Search Over Correspondences Left Occluded Pixels Left scanline Right scanline Right occluded Pixels Three cases: Sequential – cost of match Left occluded – cost of no match Right occluded – cost of no match

Standard 3-move Dynamic Programming for Stereo Left Occluded Pixels Left scanline Start Dynamic programming yields the optimal path through grid. This is the best set of matches that satisfy the ordering constraint Right occluded Pixels Right scanline End

Dynamic Programming Efficient algorithm for solving sequential decision (optimal path) problems. 1 1 1 1 … 2 2 2 2 3 3 3 3 How many paths through this trellis?

Dynamic Programming 1 1 1 States: 2 2 2 3 3 3 Suppose cost can be decomposed into stages:

Dynamic Programming 1 1 1 2 2 2 3 3 3 Principle of Optimality for an n-stage assignment problem

Dynamic Programming 1 1 1 2 2 2 3 3 3

Stereo Matching with Dynamic Programming Pseudo-code describing how to calculate the optimal match

Stereo Matching with Dynamic Programming Pseudo-code describing how to reconstruct the optimal path

Results Local errors may be propagated along a scan-line and no inter scan-line consistency is enforced.

Assumption Behind Segmentation-based Stereo Depth discontinuity tend to correlate well with color edges Disparity variation within a segment is small Approximation the scene with piece-wise planar surfaces

Segmentation-based stereo Plane equation is fitted in each segment based on initial disparity estimation obtained SSD or Correlation Globe matching criteria: if a depth map is good, warping the reference image to the other view according to this depth will render an image that matches the real view Optimization by iterative neighborhood depth hypothesizing

Hypothesizing neighborhood depth Correct depth is propagated to reduce fattening effect:

Hypothesizing neighborhood depth Background depth is hypothesized for unmatched region:

Result

Another Result