CS4670 / 5670: Computer Vision Kavita Bala Lec 27: Stereo.

Slides:



Advertisements
Similar presentations
The fundamental matrix F
Advertisements

Lecture 11: Two-view geometry
CSE473/573 – Stereo and Multiple View Geometry
Gratuitous Picture US Naval Artillery Rangefinder from World War I (1918)!!
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image Where does the.
Recap from Previous Lecture Tone Mapping – Preserve local contrast or detail at the expense of large scale contrast. – Changing the brightness within.
Lecture 8: Stereo.
Stereo.
776 Computer Vision Jan-Michael Frahm, Enrique Dunn Spring 2012.
Dr. Hassan Foroosh Dept. of Computer Science UCF
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2005 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
Epipolar geometry. (i)Correspondence geometry: Given an image point x in the first view, how does this constrain the position of the corresponding point.
Lecture 21: Multiple-view geometry and structure from motion
Stereopsis Mark Twain at Pool Table", no date, UCR Museum of Photography.
3D from multiple views : Rendering and Image Processing Alexei Efros …with a lot of slides stolen from Steve Seitz and Jianbo Shi.
CSCE 641 Computer Graphics: Image-based Modeling Jinxiang Chai.
Stereo and Structure from Motion
Lecture 20: Two-view geometry CS6670: Computer Vision Noah Snavely.
May 2004Stereo1 Introduction to Computer Vision CS / ECE 181B Tuesday, May 11, 2004  Multiple view geometry and stereo  Handout #6 available (check with.
Lec 21: Fundamental Matrix
CSE473/573 – Stereo Correspondence
Announcements PS3 Due Thursday PS4 Available today, due 4/17. Quiz 2 4/24.
Stereo Guest Lecture by Li Zhang
Project 1 artifact winners Project 2 questions Project 2 extra signup slots –Can take a second slot if you’d like Announcements.
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2006 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
3-D Scene u u’u’ Study the mathematical relations between corresponding image points. “Corresponding” means originated from the same 3D point. Objective.
Epipolar Geometry and Stereo Vision Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 04/12/11 Many slides adapted from Lana Lazebnik,
9 – Stereo Reconstruction
Stereo Readings Trucco & Verri, Chapter 7 –Read through 7.1, 7.2.1, 7.2.2, 7.3.1, 7.3.2, and 7.4, –The rest is optional. Single image stereogram,
Automatic Camera Calibration
Epipolar Geometry and Stereo Vision Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/05/15 Many slides adapted from Lana Lazebnik,
Computer vision: models, learning and inference
Camera Calibration & Stereo Reconstruction Jinxiang Chai.
Announcements Project 1 artifact winners Project 2 questions
Structure from images. Calibration Review: Pinhole Camera.
Lecture 12 Stereo Reconstruction II Lecture 12 Stereo Reconstruction II Mata kuliah: T Computer Vision Tahun: 2010.
1 Preview At least two views are required to access the depth of a scene point and in turn to reconstruct scene structure Multiple views can be obtained.
Recap from Monday Image Warping – Coordinate transforms – Linear transforms expressed in matrix form – Inverse transforms useful when synthesizing images.
Epipolar geometry Epipolar Plane Baseline Epipoles Epipolar Lines
Stereo Vision Reading: Chapter 11 Stereo matching computes depth from two or more images Subproblems: –Calibrating camera positions. –Finding all corresponding.
Stereo Readings Szeliski, Chapter 11 (through 11.5) Single image stereogram, by Niklas EenNiklas Een.
Stereo Many slides adapted from Steve Seitz.
Project 2 code & artifact due Friday Midterm out tomorrow (check your ), due next Fri Announcements TexPoint fonts used in EMF. Read the TexPoint.
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image image 1image.
Computer Vision, Robert Pless
Lec 22: Stereo CS4670 / 5670: Computer Vision Kavita Bala.
Announcements Project 3 due Thursday by 11:59pm Demos on Friday; signup on CMS Prelim to be distributed in class Friday, due Wednesday by the beginning.
Geometry of Multiple Views
Lecture 16: Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.
stereo Outline : Remind class of 3d geometry Introduction
776 Computer Vision Jan-Michael Frahm Spring 2012.
Solving for Stereo Correspondence Many slides drawn from Lana Lazebnik, UIUC.
55:148 Digital Image Processing Chapter 11 3D Vision, Geometry Topics: Basics of projective geometry Points and hyperplanes in projective space Homography.
Project 2 due today Project 3 out today Announcements TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAA.
Lec 26: Fundamental Matrix CS4670 / 5670: Computer Vision Kavita Bala.
Correspondence and Stereopsis. Introduction Disparity – Informally: difference between two pictures – Allows us to gain a strong sense of depth Stereopsis.
CSE 185 Introduction to Computer Vision Stereo 2.
Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.
Noah Snavely, Zhengqi Li
55:148 Digital Image Processing Chapter 11 3D Vision, Geometry
Epipolar Geometry and Stereo Vision
제 5 장 스테레오.
Two-view geometry Computer Vision Spring 2018, Lecture 10
Epipolar geometry.
What have we learned so far?
Epipolar geometry continued
Multiple View Geometry for Robotics
Reconstruction.
Chapter 11: Stereopsis Stereopsis: Fusing the pictures taken by two cameras and exploiting the difference (or disparity) between them to obtain the depth.
Stereo vision Many slides adapted from Steve Seitz.
Presentation transcript:

CS4670 / 5670: Computer Vision Kavita Bala Lec 27: Stereo

Announcements PA 3 due on Monday 1pm Regrades etc. PA 4: Photometric stereo

Fundamental matrix result

Properties of the Fundamental Matrix is the epipolar line associated with and is rank 2 T Prove that F is rank 2. F

Estimating F If we don’t know K1, K2, R, or t, can we estimate F for two images? Yes, given enough correspondences

Estimating F – 8-point algorithm The fundamental matrix F is defined by for any pair of matches x and x’ in two images. Let x=(u,v,1)T and x’=(u’,v’,1)T, each match gives a linear equation

8-point algorithm In reality, instead of solving , we seek f to minimize , least eigenvector of .

8-point algorithm – Problem? F should have rank 2 To enforce that F is of rank 2, F is replaced by F’ that minimizes subject to the rank constraint. This is achieved by SVD. Let , where , let then is the solution. The problem is that the resulting often is ill-conditioned. In theory, should have one singular value equal to zero and the rest are non-zero. In practice, however, some of the non-zero singular values can become small relative to the larger ones. If more than eight corresponding points are used to construct , where the coordinates are only approximately correct, there may not be a well-defined singular value which can be identified as approximately zero. Consequently, the solution of the homogeneous linear system of equations may not be sufficiently accurate to be useful.

8-point algorithm Pros: it is linear, easy to implement and fast Cons: susceptible to noise Degenerate: if points are on same plane Normalized 8-point algorithm: Hartley Position origin at centroid of image points Rescale coordinates so that center to farthest point is sqrt (2)

What about more than two views? The geometry of three views is described by a 3 x 3 x 3 tensor called the trifocal tensor The geometry of four views is described by a 3 x 3 x 3 x 3 tensor called the quadrifocal tensor After this it starts to get complicated… Structure from motion

Stereo reconstruction pipeline Steps Compute Fundamental Matrix Calibrate cameras: K1, K2 Rectify images Compute correspondence (and hence disparity) Estimate depth

Stereo image rectification reproject image planes onto a common plane parallel to the line between optical centers pixel motion is horizontal after this transformation two homographies (3x3 transform), one for each input image reprojection C. Loop and Z. Zhang. Computing Rectifying Homographies for Stereo Vision. IEEE Conf. Computer Vision and Pattern Recognition, 1999.

Original stereo pair After rectification

Correspondence algorithms Algorithms may be classified into two types: Dense: compute a correspondence at every pixel Sparse: compute correspondences only for features

Example image pair – parallel cameras

First image

Second image

Dense correspondence algorithm epipolar line Search problem (geometric constraint): for each point in left image, corresponding point in right image lies on the epipolar line (1D ambiguity) Disambiguating assumption (photometric constraint): the intensity neighborhood of corresponding points are similar across images Measure similarity of neighborhood intensity by cross-correlation

Intensity profiles Clear correspondence, but also noise and ambiguity

Normalized Cross Correlation region A region B regions as vectors vector a vector b a b

Cross-correlation of neighborhood epipolar line translate so that mean is zero

left image band right image band 1 cross correlation 0.5 x

left image band right image band cross correlation x target region 1 0.5 x

fronto-parallel surface Why is cross-correlation not great? The neighborhood region does not have a “distinctive” spatial intensity distribution Foreshortening effects fronto-parallel surface imaged length the same slanting surface imaged lengths differ

Limitations of similarity constraint Occlusions, repetition Textureless surfaces Non-Lambertian surfaces, specularities

Other approaches to obtaining 3D structure

Active stereo with structured light Project “structured” light patterns onto the object simplifies the correspondence problem Allows us to use only one camera camera projector L. Zhang, B. Curless, and S. M. Seitz. Rapid Shape Acquisition Using Color Structured Light and Multi-pass Dynamic Programming. 3DPVT 2002

Active stereo with structured light L. Zhang, B. Curless, and S. M. Seitz. Rapid Shape Acquisition Using Color Structured Light and Multi-pass Dynamic Programming. 3DPVT 2002

Digital Michelangelo Project Laser scanning Digital Michelangelo Project http://graphics.stanford.edu/projects/mich/ Optical triangulation Project a single stripe of laser light Scan it across the surface of the object This is a very precise version of structured light scanning Source: S. Seitz

The Digital Michelangelo Project, Levoy et al. Laser scanned models The Digital Michelangelo Project, Levoy et al. Source: S. Seitz

The Digital Michelangelo Project, Levoy et al. Laser scanned models The Digital Michelangelo Project, Levoy et al. Source: S. Seitz

The Digital Michelangelo Project, Levoy et al. Source: S. Seitz

The Digital Michelangelo Project, Levoy et al. Source: S. Seitz

The Digital Michelangelo Project, Levoy et al. Source: S. Seitz

… which brings us to multi-view stereo Aligning range images A single range scan is not sufficient to describe a complex surface Need techniques to register multiple range images … which brings us to multi-view stereo

Microsoft Kinect

Road map What we’ve seen so far: What’s next: Low-level image processing: filtering, edge detecting, feature detection Geometry: image transformations, panoramas, single-view modeling Fundamental matrices What’s next: Photometric stereo (PA 4) Finishing up geometry: multi view stereo, structure from motion Recognition

Quiz