Structure from Motion Paul Heckbert, Nov. 1999 15-869, Image-Based Modeling and Rendering.

Slides:



Advertisements
Similar presentations
The fundamental matrix F
Advertisements

Structure from motion Digital Visual Effects Yung-Yu Chuang with slides by Richard Szeliski, Steve Seitz, Zhengyou Zhang and Marc Pollefyes.
Structure from motion Digital Visual Effects, Spring 2005 Yung-Yu Chuang 2005/4/20 with slides by Richard Szeliski, Steve Seitz, Zhengyou Zhang and Marc.
MASKS © 2004 Invitation to 3D vision Lecture 7 Step-by-Step Model Buidling.
3D Reconstruction – Factorization Method Seong-Wook Joo KG-VISA 3/10/2004.
Computer vision: models, learning and inference
Two-View Geometry CS Sastry and Yang
N-view factorization and bundle adjustment CMPUT 613.
Camera calibration and epipolar geometry
Structure from motion.
Stanford CS223B Computer Vision, Winter 2005 Lecture 11: Structure From Motion 2 Sebastian Thrun, Stanford Rick Szeliski, Microsoft Hendrik Dahlkamp and.
Announcements Quiz Thursday Quiz Review Tomorrow: AV Williams 4424, 4pm. Practice Quiz handout.
Today Feature Tracking Structure from Motion on Monday (1/29)
Stanford CS223B Computer Vision, Winter 2007 Lecture 8 Structure From Motion Professors Sebastian Thrun and Jana Košecká CAs: Vaibhav Vaish and David Stavens.
Epipolar geometry. (i)Correspondence geometry: Given an image point x in the first view, how does this constrain the position of the corresponding point.
Structure from motion. Multiple-view geometry questions Scene geometry (structure): Given 2D point matches in two or more images, where are the corresponding.
Uncalibrated Geometry & Stratification Sastry and Yang
Multiple View Geometry Marc Pollefeys University of North Carolina at Chapel Hill Modified by Philippos Mordohai.
Stanford CS223B Computer Vision, Winter 2006 Lecture 8 Structure From Motion Professor Sebastian Thrun CAs: Dan Maynes-Aminzade, Mitul Saha, Greg Corrado.
Multiple-view Reconstruction from Points and Lines
Uncalibrated Epipolar - Calibration
Structure From Motion Sebastian Thrun, Gary Bradski, Daniel Russakoff
Single View Metrology Class 3. 3D photography course schedule (tentative) LectureExercise Sept 26Introduction- Oct. 3Geometry & Camera modelCamera calibration.
© 2003 by Davi GeigerComputer Vision October 2003 L1.1 Structure-from-EgoMotion (based on notes from David Jacobs, CS-Maryland) Determining the 3-D structure.
Previously Two view geometry: epipolar geometry Stereo vision: 3D reconstruction epipolar lines Baseline O O’ epipolar plane.
Computer Vision Structure from motion Marc Pollefeys COMP 256 Some slides and illustrations from J. Ponce, A. Zisserman, R. Hartley, Luc Van Gool, …
Optical Flow Estimation
Multiple View Reconstruction Class 23 Multiple View Geometry Comp Marc Pollefeys.
CSCE 641 Computer Graphics: Image-based Modeling (Cont.) Jinxiang Chai.
May 2004Stereo1 Introduction to Computer Vision CS / ECE 181B Tuesday, May 11, 2004  Multiple view geometry and stereo  Handout #6 available (check with.
Matching Compare region of image to region of image. –We talked about this for stereo. –Important for motion. Epipolar constraint unknown. But motion small.
KLT tracker & triangulation Class 6 Read Shi and Tomasi’s paper on good features to track
Advanced Computer Vision Structure from Motion. Geometric structure-from-motion problem: using image matches to estimate: The 3D positions of the corresponding.
Camera parameters Extrinisic parameters define location and orientation of camera reference frame with respect to world frame Intrinsic parameters define.
CSCE 641 Computer Graphics: Image-based Modeling (Cont.) Jinxiang Chai.
Computer vision: models, learning and inference
Projective geometry of 2-space DLT alg HZ 4.1 Rectification HZ 2.7 Hierarchy of maps Invariants HZ 2.4 Projective transform HZ 2.3 Behaviour at infinity.
Geometric Models & Camera Calibration
Structure from Motion Computer Vision CS 143, Brown James Hays 11/18/11 Many slides adapted from Derek Hoiem, Lana Lazebnik, Silvio Saverese, Steve Seitz,
CSCE 643 Computer Vision: Structure from Motion
Robot Vision SS 2007 Matthias Rüther 1 ROBOT VISION Lesson 6a: Shape from Stereo, short summary Matthias Rüther Slides partial courtesy of Marc Pollefeys.
Parameter estimation. 2D homography Given a set of (x i,x i ’), compute H (x i ’=Hx i ) 3D to 2D camera projection Given a set of (X i,x i ), compute.
Last lecture Passive Stereo Spacetime Stereo Multiple View Stereo.
December 12 th, 2001C. Geyer/K. Daniilidis GRASP Laboratory Slide 1 Structure and Motion from Uncalibrated Catadioptric Views Christopher Geyer and Kostas.
Affine Structure from Motion
Feature Reconstruction Using Lucas-Kanade Feature Tracking and Tomasi-Kanade Factorization EE7740 Project I Dr. Gunturk.
3D Imaging Motion.
Raquel A. Romano 1 Scientific Computing Seminar May 12, 2004 Projective Geometry for Computer Vision Projective Geometry for Computer Vision Raquel A.
Last lecture Passive Stereo Spacetime Stereo.
EECS 274 Computer Vision Affine Structure from Motion.
Structure from motion Digital Visual Effects, Spring 2008 Yung-Yu Chuang 2008/4/22 with slides by Richard Szeliski, Steve Seitz, Zhengyou Zhang and Marc.
776 Computer Vision Jan-Michael Frahm & Enrique Dunn Spring 2013.
Lecture 9 Feature Extraction and Motion Estimation Slides by: Michael Black Clark F. Olson Jean Ponce.
Structure from Motion ECE 847: Digital Image Processing
Motion Estimation Today’s Readings Trucco & Verri, 8.3 – 8.4 (skip 8.3.3, read only top half of p. 199) Newton's method Wikpedia page
Structure from motion Multi-view geometry Affine structure from motion Projective structure from motion Planches : –
Structure from Motion. For now, static scene and moving cameraFor now, static scene and moving camera – Equivalently, rigidly moving scene and static.
Parameter estimation class 5 Multiple View Geometry CPSC 689 Slides modified from Marc Pollefeys’ Comp
Camera Calibration Course web page: vision.cis.udel.edu/cv March 24, 2003  Lecture 17.
Structure from motion Digital Visual Effects, Spring 2007 Yung-Yu Chuang 2007/4/24 with slides by Richard Szeliski, Steve Seitz, Zhengyou Zhang and Marc.
Reconstruction of a Scene with Multiple Linearly Moving Objects Mei Han and Takeo Kanade CISC 849.
Computer vision: models, learning and inference
CS4670 / 5670: Computer Vision Kavita Bala Lec 27: Stereo.
Motion and Optical Flow
Structure from motion II
Parameter estimation class 5
Structure from motion Input: Output: (Tomasi and Kanade)
Estimating 2-view relationships
Uncalibrated Geometry & Stratification
Structure from motion Input: Output: (Tomasi and Kanade)
Presentation transcript:

Structure from Motion Paul Heckbert, Nov , Image-Based Modeling and Rendering

Approach Problem: Reconstruct scene geometry and camera motion from two or more images Typically, assume –static scene (model camera motion only) –diffuse, opaque surfaces (simplifies feature tracking) –orthographic projection, for starters, then generalize to projective Steps: –shoot f frames of video (or sequence of stills) –track n feature points from frame to frame –build large matrix of image feature coordinates –factor this matrix to extract motion and shape –build 3-D model by connecting the features with triangles

Image Features Good features are spots, corners, and other points where the image, regarded as a terrain, has high curvature. Good features are image windows that can be tracked well [Tomasi- Kanade 92] Pick good features using the gradient covariance matrix [Lucas-Kanade 81]: How many large eigenvalues does this matrix have? 2 - good feature (spot) 1 - poor (edge - aperture problem) 0 - poor (flat)

Orthographic Projection image point projection matrix scene point image offset Trick Choose scene origin to be centroid of 3D points Choose image origins to be centroid of 2D points Allows us to drop the camera translation:

Tomasi-Kanade Orthographic Method projection of n features in one image: projection of n features in f images W measurement M motion S shape

SVD Factorization Technique W is at most rank 3 (assuming no noise) We can use singular value decomposition to factor W: Tomasi-Kanade Orthographic Method S’ differs from S by a linear transformation A: Solve for A by enforcing constraints on M known solve for

Problems with Basic Orthographic Method, 1 Measurement matrix W might have many voids, since occlusion and noise cause features to appear and disappear. Solution: take linear combinations of rows & columns to “hallucinate” entries. Features can be mis-tracked: Solution: maintain tree of multiple hypotheses SVD is slow: O(fn·min(f,n)). Solution 1: solve bilinear equation by alternating between: freeze S and solve W=MS for M freeze M and solve W=MS for S This is faster than SVD. Solution 2: don’t compute full SVD, but partial one

Problems with Basic Orthographic Method, 2 Feature set too sparse, doesn’t yield a good surface model. Solution: Don’t track as a preprocess. Instead solve for correspondence as you solve for motion and shape. Orthographic projection is poor approximation for nearby objects. Solution: model perspective, but this is more complex. Projective factorization: [Poelman 95] refined orthographic solution using nonlinear optimization (Levenberg-Marquardt)

Triggs’ Projective Method [Triggs CVPR ‘96] generalized orthographic method by using homogeneous coordinates. Image coordinates u have a third, unknown DOF, projective depth. This yields a larger system of equations, but very similar approach. Steps: –track features –find fundamental matrices F between successive frames –use F’s to solve for projective depth –solve projective factorization equation W 3f×n = M 3f × 3 S 3 × n