Brief Introduction to Geometry and Vision

Slides:



Advertisements
Similar presentations
The fundamental matrix F
Advertisements

Lecture 11: Two-view geometry
Geometry 2: A taste of projective geometry Introduction to Computer Vision Ronen Basri Weizmann Institute of Science.
Two-View Geometry CS Sastry and Yang
Two-view geometry.
Camera calibration and epipolar geometry
Structure from motion.
Computer Vision : CISC 4/689
Geometry of Images Pinhole camera, projection A taste of projective geometry Two view geometry:  Homography  Epipolar geometry, the essential matrix.
Epipolar geometry. (i)Correspondence geometry: Given an image point x in the first view, how does this constrain the position of the corresponding point.
Structure from motion. Multiple-view geometry questions Scene geometry (structure): Given 2D point matches in two or more images, where are the corresponding.
Uncalibrated Geometry & Stratification Sastry and Yang
Previously Two view geometry: epipolar geometry Stereo vision: 3D reconstruction epipolar lines Baseline O O’ epipolar plane.
Computer Vision Structure from motion Marc Pollefeys COMP 256 Some slides and illustrations from J. Ponce, A. Zisserman, R. Hartley, Luc Van Gool, …
May 2004Stereo1 Introduction to Computer Vision CS / ECE 181B Tuesday, May 11, 2004  Multiple view geometry and stereo  Handout #6 available (check with.
Lec 21: Fundamental Matrix
CSCE 641 Computer Graphics: Image-based Modeling (Cont.) Jinxiang Chai.
Multiple View Geometry. THE GEOMETRY OF MULTIPLE VIEWS Reading: Chapter 10. Epipolar Geometry The Essential Matrix The Fundamental Matrix The Trifocal.
55:148 Digital Image Processing Chapter 11 3D Vision, Geometry Topics: Basics of projective geometry Points and hyperplanes in projective space Homography.
3-D Scene u u’u’ Study the mathematical relations between corresponding image points. “Corresponding” means originated from the same 3D point. Objective.
Epipolar Geometry and Stereo Vision Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 04/12/11 Many slides adapted from Lana Lazebnik,
55:148 Digital Image Processing Chapter 11 3D Vision, Geometry Topics: Basics of projective geometry Points and hyperplanes in projective space Homography.
Multi-view geometry. Multi-view geometry problems Structure: Given projections of the same 3D point in two or more images, compute the 3D coordinates.
Epipolar Geometry and Stereo Vision Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/05/15 Many slides adapted from Lana Lazebnik,
Computer vision: models, learning and inference
Lecture 11 Stereo Reconstruction I Lecture 11 Stereo Reconstruction I Mata kuliah: T Computer Vision Tahun: 2010.
Epipolar Geometry and Stereo Vision Computer Vision ECE 5554 Virginia Tech Devi Parikh 10/15/13 These slides have been borrowed from Derek Hoiem and Kristen.
Structure from images. Calibration Review: Pinhole Camera.
Multi-view geometry.
Recap from Monday Image Warping – Coordinate transforms – Linear transforms expressed in matrix form – Inverse transforms useful when synthesizing images.
Lecture 04 22/11/2011 Shai Avidan הבהרה : החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
Geometry 3: Stereo Reconstruction Introduction to Computer Vision Ronen Basri Weizmann Institute of Science.
Metrology 1.Perspective distortion. 2.Depth is lost.
Multiview Geometry and Stereopsis. Inputs: two images of a scene (taken from 2 viewpoints). Output: Depth map. Inputs: multiple images of a scene. Output:
Lec 22: Stereo CS4670 / 5670: Computer Vision Kavita Bala.
Announcements Project 3 due Thursday by 11:59pm Demos on Friday; signup on CMS Prelim to be distributed in class Friday, due Wednesday by the beginning.
EECS 274 Computer Vision Stereopsis.
Affine Structure from Motion
Two-view geometry. Epipolar Plane – plane containing baseline (1D family) Epipoles = intersections of baseline with image planes = projections of the.
EECS 274 Computer Vision Affine Structure from Motion.
Feature Matching. Feature Space Outlier Rejection.
55:148 Digital Image Processing Chapter 11 3D Vision, Geometry Topics: Basics of projective geometry Points and hyperplanes in projective space Homography.
55:148 Digital Image Processing Chapter 11 3D Vision, Geometry Topics: Basics of projective geometry Points and hyperplanes in projective space Homography.
776 Computer Vision Jan-Michael Frahm & Enrique Dunn Spring 2013.
MASKS © 2004 Invitation to 3D vision Uncalibrated Camera Chapter 6 Reconstruction from Two Uncalibrated Views Modified by L A Rønningen Oct 2008.
Reconstruction from Two Calibrated Views Two-View Geometry
Geometry Reconstruction March 22, Fundamental Matrix An important problem: Determine the epipolar geometry. That is, the correspondence between.
Uncalibrated reconstruction Calibration with a rig Uncalibrated epipolar geometry Ambiguities in image formation Stratified reconstruction Autocalibration.
Structure from motion Multi-view geometry Affine structure from motion Projective structure from motion Planches : –
MASKS © 2004 Invitation to 3D vision. MASKS © 2004 Invitation to 3D vision Lecture 1 Overview and Introduction.
Multiple View Geometry and Stereo. Overview Single camera geometry – Recap of Homogenous coordinates – Perspective projection model – Camera calibration.
Lec 26: Fundamental Matrix CS4670 / 5670: Computer Vision Kavita Bala.
CSE 185 Introduction to Computer Vision Stereo 2.
Multi-view geometry. Multi-view geometry problems Structure: Given projections of the same 3D point in two or more images, compute the 3D coordinates.
55:148 Digital Image Processing Chapter 11 3D Vision, Geometry
Epipolar Geometry and Stereo Vision
CS4670 / 5670: Computer Vision Kavita Bala Lec 27: Stereo.
Multiple-View Geometry for Image-Based Modeling (Course 42)
STEREOPSIS The Stereopsis Problem: Fusion and Reconstruction
Two-view geometry Computer Vision Spring 2018, Lecture 10
Epipolar geometry.
EECS 274 Computer Vision Stereopsis.
3D Photography: Epipolar geometry
Structure from motion Input: Output: (Tomasi and Kanade)
Multiway Cut for Stereo and Motion with Slanted Surfaces
Multiple View Geometry in Computer Vision
Multiple View Geometry for Robotics
Uncalibrated Geometry & Stratification
Chapter 11: Stereopsis Stereopsis: Fusing the pictures taken by two cameras and exploiting the difference (or disparity) between them to obtain the depth.
Structure from motion Input: Output: (Tomasi and Kanade)
Presentation transcript:

Brief Introduction to Geometry and Vision A.L. Yuille (UCLA)

Plan of Talk Four Topics: (I) Basic Projection. Perspective. Vanishing Points. (II) Camera Calibration. Stereopsis. Essential Matrix. Fundamental Matrix. (III) Structure from Motion. Rigid. Extension to Non-Rigid. (IV) Geometric Priors. Manhattan World.

Geometry of Projection. Most analysis is based on the Pinhole camera model. Real cameras have lens (W. Freeman’s lectures). See Szeliski’s book for corrections to the pinhole camera model.

Properties of Perspective Projection Straight lines project to straight lines. Parallel lines in space project to lines which converge at a vanishing point.

Perspective 1

Perspective 2

Vanishing Points 1.

Vanishing points 2.

Linear approximations: Weak Orthographic Perspective projection can often be approximated by scaled orthographic projection (e.g., if Z is constant). This is a linear operation. Parallel lines project to parallel lines (vanishing points at infinity). This is often a good approximation which is easy to use. Maths of weak orthographic projection.

Linear Projection 1

Two Cameras. Binocular Stereo. Estimate depth from two eyes/cameras by triangulation. Requires solving the correspondence problem between points in the two images. Correspondence problem is helped by the epipolar line constraint. Camera calibration needed.

Epipolar Lines: Points on one epipolar line can only be matched to points on the corresponding epipolar line. Epipolar lines depend on the camera parameters. If both cameras are parallel, then epipolar lines are horizontal. Geometric demonstration of epipolar line constraint.

Stereo Algorithms can exploit epipolar line constraints Simplest model: estimate the disparity d at each point (convert to depth by geometry). Matching unambiguous, despite epipolar line constraints. Regularize by smoothness (ordering), E.g., Marr and Poggio 1978. Arbib and Dev. 1977. Simple Energy function: (Boykov)

Exploit Epipolar Line constraint The epipolar line constraint reduces correspondence to a one-dimensional problem. Dynamic programming can be applied.

Results using Dynamic Programming

Half-Occlusions. Da Vinci’s stereopsis. Points are half-occluded: visible to one eye/camera but not to the other. This gives cues for the detection of boundaries. Geiger, Ladendorf, Yuille, 1992, Belhumeur and Mumford 1992. H. Ishikawa and D. Geiger. 1998 (across epipolar lines).

Camera Callibration Essential Matrix (Longuet-Higgins 1981). Fundamental Matrix (Q.T. Luong and O. D. Faugeras 1992, Hartley 1992). More calibration (Z. Zhang 2000). More reading on geometry: R. Hartley and A. Zisserman. Multiple View Geometry in computer vision. 2003. Y. Ma, S. Soatto, J. Kosecka, and S. Sastry. An Invitation to 3-D Vision. 2004.

Essential Matrix 1

Essential Matrix 2

Structure from Motion: Rigid Linear Projection. 3D structure can be estimated by linear algebra (Singular Value Decomposition). Camera parameters can also be estimated. This estimation is up to an ambiguity. Main paper: C. Tomasi and T. Kanade. 1991. But see also: L.L. Kontsevich, M.L. Kontsevich, A. Kh. Shen. “Two Algorithms fro Reconstruction Shapes}. Avtometriya. 1987.

Structure from Motion: Rigid. Linear projection. Set of images is rank 3. R. Basri and S. Ullman. Recognition by Linear Combinations of Models. 1991. Maths of SVD.

Structure from Motion: Rigid 1

Structure from Motion: Rigid 2

Structure from Motion: 3

Structure from Motion: Rigid 4

Structure from Motion: 5

Structure From Motion: Rigid.

Extension to Non-Rigid Motion This approach can be extended to a special class of non-rigid motion. The object can be expressed as a linear sum of basis functions. The sum varies over time. C. Bregler, A. Hertzmann, and H. Biermann. Recovering non-rigid 3D shape from image streams. CVPR. 2000. Theory clarified by: Y. Dai, H. Li, and M. He. A Simple Prior-free Method for Non Rigid Structure from Motion Factorization, in CVPR 2012 (ORAL). IEEE CVPR Best Paper Award-2012. (Code available). http://users.cecs.anu.edu.au/~hongdong/

Non-rigid motion

Manhattan World Many scenes, particularly man-made scenes, have a natural three- dimensional coordinate systems caused by the structure of the world.

Manhattan World: 2 Back to Ames Room: Non-Manhattan Edges:

Manhattan World 1

Manhattan World 2

Manhattan World 3

Manhattan World 4

Beyond Manhattan: There are parallel lines in the scene, but they are not orthogonal.

Manhattan World Manhattan World stereo: Piecewise planar surfaces with dominant Directions. Instead of assuming Piecewise smoothness. Video available: http://grail.cs.washington.edu/projects/manhattan/ Y. Fuukawa, B. Curless, S. Seitz, and R. Szeliski. 2009.

Manhattan World Grammar Website: http://www.youtube.com/watch?v=s0mhpKFv36g