Presentation is loading. Please wait.

Presentation is loading. Please wait.

CS4670 / 5670: Computer Vision Kavita Bala Lec 27: Stereo.

Similar presentations


Presentation on theme: "CS4670 / 5670: Computer Vision Kavita Bala Lec 27: Stereo."— Presentation transcript:

1 CS4670 / 5670: Computer Vision Kavita Bala Lec 27: Stereo

2 Announcements PA 3 due on Monday 1pm Regrades etc.
PA 4: Photometric stereo

3 Fundamental matrix result

4 Properties of the Fundamental Matrix
is the epipolar line associated with and is rank 2 T Prove that F is rank 2. F

5 Estimating F If we don’t know K1, K2, R, or t, can we estimate F for two images? Yes, given enough correspondences

6 Estimating F – 8-point algorithm
The fundamental matrix F is defined by for any pair of matches x and x’ in two images. Let x=(u,v,1)T and x’=(u’,v’,1)T, each match gives a linear equation

7 8-point algorithm In reality, instead of solving , we seek f to minimize , least eigenvector of

8 8-point algorithm – Problem?
F should have rank 2 To enforce that F is of rank 2, F is replaced by F’ that minimizes subject to the rank constraint. This is achieved by SVD. Let , where , let then is the solution. The problem is that the resulting often is ill-conditioned. In theory, should have one singular value equal to zero and the rest are non-zero. In practice, however, some of the non-zero singular values can become small relative to the larger ones. If more than eight corresponding points are used to construct , where the coordinates are only approximately correct, there may not be a well-defined singular value which can be identified as approximately zero. Consequently, the solution of the homogeneous linear system of equations may not be sufficiently accurate to be useful.

9 8-point algorithm Pros: it is linear, easy to implement and fast
Cons: susceptible to noise Degenerate: if points are on same plane Normalized 8-point algorithm: Hartley Position origin at centroid of image points Rescale coordinates so that center to farthest point is sqrt (2)

10 What about more than two views?
The geometry of three views is described by a 3 x 3 x 3 tensor called the trifocal tensor The geometry of four views is described by a x 3 x 3 x 3 tensor called the quadrifocal tensor After this it starts to get complicated… Structure from motion

11 Stereo reconstruction pipeline
Steps Compute Fundamental Matrix Calibrate cameras: K1, K2 Rectify images Compute correspondence (and hence disparity) Estimate depth

12 Stereo image rectification
reproject image planes onto a common plane parallel to the line between optical centers pixel motion is horizontal after this transformation two homographies (3x3 transform), one for each input image reprojection C. Loop and Z. Zhang. Computing Rectifying Homographies for Stereo Vision. IEEE Conf. Computer Vision and Pattern Recognition, 1999.

13 Original stereo pair After rectification

14 Correspondence algorithms
Algorithms may be classified into two types: Dense: compute a correspondence at every pixel Sparse: compute correspondences only for features

15 Example image pair – parallel cameras

16 First image

17 Second image

18 Dense correspondence algorithm
epipolar line Search problem (geometric constraint): for each point in left image, corresponding point in right image lies on the epipolar line (1D ambiguity) Disambiguating assumption (photometric constraint): the intensity neighborhood of corresponding points are similar across images Measure similarity of neighborhood intensity by cross-correlation

19 Intensity profiles Clear correspondence, but also noise and ambiguity

20 Normalized Cross Correlation
region A region B regions as vectors vector a vector b a b

21 Cross-correlation of neighborhood
epipolar line translate so that mean is zero

22 left image band right image band 1 cross correlation 0.5 x

23 left image band right image band cross correlation x target region 1
0.5 x

24 fronto-parallel surface
Why is cross-correlation not great? The neighborhood region does not have a “distinctive” spatial intensity distribution Foreshortening effects fronto-parallel surface imaged length the same slanting surface imaged lengths differ

25 Limitations of similarity constraint
Occlusions, repetition Textureless surfaces Non-Lambertian surfaces, specularities

26 Other approaches to obtaining 3D structure

27 Active stereo with structured light
Project “structured” light patterns onto the object simplifies the correspondence problem Allows us to use only one camera camera projector L. Zhang, B. Curless, and S. M. Seitz. Rapid Shape Acquisition Using Color Structured Light and Multi-pass Dynamic Programming. 3DPVT 2002

28 Active stereo with structured light
L. Zhang, B. Curless, and S. M. Seitz. Rapid Shape Acquisition Using Color Structured Light and Multi-pass Dynamic Programming. 3DPVT 2002

29 Digital Michelangelo Project
Laser scanning Digital Michelangelo Project Optical triangulation Project a single stripe of laser light Scan it across the surface of the object This is a very precise version of structured light scanning Source: S. Seitz

30 The Digital Michelangelo Project, Levoy et al.
Laser scanned models The Digital Michelangelo Project, Levoy et al. Source: S. Seitz

31 The Digital Michelangelo Project, Levoy et al.
Laser scanned models The Digital Michelangelo Project, Levoy et al. Source: S. Seitz

32 The Digital Michelangelo Project, Levoy et al.
Source: S. Seitz

33 The Digital Michelangelo Project, Levoy et al.
Source: S. Seitz

34 The Digital Michelangelo Project, Levoy et al.
Source: S. Seitz

35 … which brings us to multi-view stereo
Aligning range images A single range scan is not sufficient to describe a complex surface Need techniques to register multiple range images … which brings us to multi-view stereo

36 Microsoft Kinect

37 Road map What we’ve seen so far: What’s next:
Low-level image processing: filtering, edge detecting, feature detection Geometry: image transformations, panoramas, single-view modeling Fundamental matrices What’s next: Photometric stereo (PA 4) Finishing up geometry: multi view stereo, structure from motion Recognition

38 Quiz


Download ppt "CS4670 / 5670: Computer Vision Kavita Bala Lec 27: Stereo."

Similar presentations


Ads by Google