3D Motion Estimation. 3D model construction Video Manipulation.

Slides:



Advertisements
Similar presentations
Single-view geometry Odilon Redon, Cyclops, 1914.
Advertisements

Epipolar Geometry.
3D reconstruction.
Two-View Geometry CS Sastry and Yang
Two-view geometry.
Camera calibration and epipolar geometry
Camera Models A camera is a mapping between the 3D world and a 2D image The principal camera of interest is central projection.
Structure from motion.
3D Computer Vision and Video Computing 3D Vision Topic 4 of Part II Visual Motion CSc I6716 Fall 2011 Cover Image/video credits: Rick Szeliski, MSR Zhigang.
Motion Analysis Slides are from RPI Registration Class.
Epipolar geometry. (i)Correspondence geometry: Given an image point x in the first view, how does this constrain the position of the corresponding point.
Structure from motion. Multiple-view geometry questions Scene geometry (structure): Given 2D point matches in two or more images, where are the corresponding.
Uncalibrated Geometry & Stratification Sastry and Yang
Epipolar Geometry and the Fundamental Matrix F
CSc83029 – 3-D Computer Vision/ Ioannis Stamos 3-D Computational Vision CSc Optical Flow & Motion The Factorization Method.
Single-view geometry Odilon Redon, Cyclops, 1914.
Linearizing (assuming small (u,v)): Brightness Constancy Equation: The Brightness Constraint Where:),(),(yxJyxII t  Each pixel provides 1 equation in.
CS223b, Jana Kosecka Rigid Body Motion and Image Formation.
COMP 290 Computer Vision - Spring Motion II - Estimation of Motion field / 3-D construction from motion Yongjik Kim.
Camera parameters Extrinisic parameters define location and orientation of camera reference frame with respect to world frame Intrinsic parameters define.
Multiple View Geometry
Motion from normal flow. Optical flow difficulties The aperture problemDepth discontinuities.
3D Computer Vision and Video Computing 3D Vision Topic 8 of Part 2 Visual Motion (II) CSC I6716 Spring 2004 Zhigang Zhu, NAC 8/203A
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean 5403 T-R 3:00pm – 4:20pm Lecture #15.
Multi-view geometry. Multi-view geometry problems Structure: Given projections of the same 3D point in two or more images, compute the 3D coordinates.
776 Computer Vision Jan-Michael Frahm, Enrique Dunn Spring 2013.
Camera Geometry and Calibration Thanks to Martial Hebert.
Multi-view geometry.
1 Preview At least two views are required to access the depth of a scene point and in turn to reconstruct scene structure Multiple views can be obtained.
Linearizing (assuming small (u,v)): Brightness Constancy Equation: The Brightness Constraint Where:),(),(yxJyxII t  Each pixel provides 1 equation in.
The Brightness Constraint
Homogeneous Coordinates (Projective Space) Let be a point in Euclidean space Change to homogeneous coordinates: Defined up to scale: Can go back to non-homogeneous.
Course 12 Calibration. 1.Introduction In theoretic discussions, we have assumed: Camera is located at the origin of coordinate system of scene.
Computer Vision, Robert Pless Lecture 11 our goal is to understand the process of multi-camera vision. Last time, we studies the “Essential” and “Fundamental”
视觉的三维运动理解 刘允才 上海交通大学 2002 年 11 月 16 日 Understanding 3D Motion from Images Yuncai Liu Shanghai Jiao Tong University November 16, 2002.
Metrology 1.Perspective distortion. 2.Depth is lost.
1 Fundamentals of Robotics Linking perception to action 2. Motion of Rigid Bodies 南台科技大學電機工程系謝銘原.
Multiview Geometry and Stereopsis. Inputs: two images of a scene (taken from 2 viewpoints). Output: Depth map. Inputs: multiple images of a scene. Output:
Geometry of Multiple Views
Single-view geometry Odilon Redon, Cyclops, 1914.
3D Imaging Motion.
A Flexible New Technique for Camera Calibration Zhengyou Zhang Sung Huh CSPS 643 Individual Presentation 1 February 25,
Two-view geometry. Epipolar Plane – plane containing baseline (1D family) Epipoles = intersections of baseline with image planes = projections of the.
EECS 274 Computer Vision Affine Structure from Motion.
Computer vision: models, learning and inference M Ahad Multiple Cameras
Part II Plane Equation and Epipolar Geometry. Duality.
Linearizing (assuming small (u,v)): Brightness Constancy Equation: The Brightness Constraint Where:),(),(yxJyxII t  Each pixel provides 1 equation in.
Single-view geometry Odilon Redon, Cyclops, 1914.
Dynamic Perspective How a moving camera reveals scene depth and egomotion parameters Jitendra Malik UC Berkeley.
Linearizing (assuming small (u,v)): Brightness Constancy Equation: The Brightness Constraint Where:),(),(yxJyxII t  Each pixel provides 1 equation in.
Camera Calibration Course web page: vision.cis.udel.edu/cv March 24, 2003  Lecture 17.
Lec 26: Fundamental Matrix CS4670 / 5670: Computer Vision Kavita Bala.
11/25/03 3D Model Acquisition by Tracking 2D Wireframes Presenter: Jing Han Shiau M. Brown, T. Drummond and R. Cipolla Department of Engineering University.
Multi-view geometry. Multi-view geometry problems Structure: Given projections of the same 3D point in two or more images, compute the 3D coordinates.
CS682, Jana Kosecka Rigid Body Motion and Image Formation Jana Kosecka
Calibrating a single camera
3D Vision Topic 4 of Part II Visual Motion CSc I6716 Fall 2009
Homogeneous Coordinates (Projective Space)
The Brightness Constraint
3D Motion Estimation.
Epipolar geometry.
Structure from motion Input: Output: (Tomasi and Kanade)
The Brightness Constraint
The Brightness Constraint
Uncalibrated Geometry & Stratification
George Mason University
Course 7 Motion.
Multi-view geometry.
Single-view geometry Odilon Redon, Cyclops, 1914.
Structure from motion Input: Output: (Tomasi and Kanade)
Presentation transcript:

3D Motion Estimation

3D model construction

Video Manipulation

Visual Motion Allows us to compute useful properties of the 3D world, with very little knowledge. Example: Time to collision

Time to Collision f L v L DoDoDoDo l(t) An object of height L moves with constant velocity v: At time t=0 the object is at:At time t=0 the object is at: D(0) = D o D(0) = D o At time t it is atAt time t it is at D(t) = D o – vtD(t) = D o – vt It will crash with the camera at time:It will crash with the camera at time: D(  ) = D o – v  = 0 D(  ) = D o – v  = 0  = D o /v  = D o /v t=0t D(t)

Time to Collision f L v L DoDoDoDo l(t) t=0t D(t) The image of the object has size l(t): Taking derivative wrt time:

Time to Collision f L v L DoDoDoDo l(t) t=0t D(t) And their ratio is:

Time to Collision f L v L DoDoDoDo l(t) t=0t D(t) And time to collision: Can be directly measured from image Can be found, without knowing L or D o or v !!

Structure from Motion t,  r Z u

Passive Navigation and Structure from Motion

Image Flow due to Rigid Motion The velocity of a point with respect to the XYZ coordinate system is Scaling ambiguity: We can compute the translation only up to a scale factor (Kt, KZ) give the same flow as (t, Z).

Consider a 3D point P and its image: f P p Z z Using pinhole camera equation:

Let things move: f P V p v Z z The relative velocity of P wrt camera: Translation velocity Rotation angular velocity

3D Relative Velocity: f P V p v Z z The relative velocity of P wrt camera:

Motion Field: the velocity of p f P V p v Z z Taking derivative wrt time:

Motion Field: the velocity of p f P V p v Z z

f P V p v Z z

f P V p v Z z Translational component Scaling ambiguity (t and Z can only be derived up to a scale Factor)

Motion Field: the velocity of p f P V p v Z z Rotational component NOTE: The rotational component is independent of depth Z !

Translational flow field

Rotational flow field

Pure Translation What if t z  0 ? All motion field vectors are parallel to each other and inversely proportional to depth !

Pure Translation: Properties of the MF If t z  0 the MF is RADIAL with all vectors pointing towards (or away from) a single point p o. If t z = 0 the MF is PARALLEL. The length of the MF vectors is inversely proportional to depth Z. If t z  0 it is also directly proportional to the distance between p and p o.

Pure Translation: Properties of the MF p o is the vanishing point of the direction of translation. p o is the intersection of the ray parallel to the translation vector and the image plane.

Special Case: Moving Plane f P V p v Z z n Planar surfaces are common in man-made environments Question: How does the MF of a moving plane look like?

Special Case: Moving Plane Points on the plane must satisfy the equation describing the plane. Let n be the unit vector normal to the plane.n be the unit vector normal to the plane. d be the distance from the plane to the origin.d be the distance from the plane to the origin. NOTE: If the plane is moving wrt camera, n and d are functions of time.NOTE: If the plane is moving wrt camera, n and d are functions of time. Then: d P Z n O YXwhere

Special Case: Moving Plane d P Z n O Y X Using the pinhole projection equation: p Let be the image of P Using the plane equation: Solving for Z:

Special Case: Moving Plane And Plug in: d P Z n O YXp Now consider the MF equations:

Special Case: Moving Planed P Z n O YXp The MF equations become: where

Special Case: Moving Plane d P Z n O YXp MF equations: MF equations: Q: What is the significance of this? A: The MF vectors are given by low order (second) polynomials. Their coeffs. a 1 to a 8 (only 8 !) are functions of n, d, t and .Their coeffs. a 1 to a 8 (only 8 !) are functions of n, d, t and . The same coeffs. can be obtained with a different plane and relative velocity.The same coeffs. can be obtained with a different plane and relative velocity.

Moving Plane: Properties of the MF The MF of a planar surface is at any time a quadratic function in the image coordinates. A plane n T P=d moving with velocity V=-t-  £ P has the same MF than a plane with normal n’=t/|t|, distance d and moving with velocity V=|t|n –(  + n £ t/d) £ P

Classical Structure from Motion Established approach is the epipolar minimization: The “derotated flow” should be parallel to the translational flow. u u -u rot E E u tr t

The Translational Case (a least squares formulation) Substitute back

Step 2: Differentiate with respect to t x, t y, t z, set expression to zero. Equations are nonlinear in t x, t y, t z

Using a different norm First, differentiate integrand with respect to Z and set to zero

where

Differentiate g( t x, t y, t z ) with respect to t x, t y, t z and set to zero Solution is singular vector corresponding to smallest singular value

The Rotational Case In matrix form Let f=1

The General Case Minimization of epipolar distance or, in vector notation

Motion Parallax The difference in motion between two very close points does not depend on rotation. Can be used at depth discontinuities to obtain the direction of translation. FOE

Motion Parallax

Vectors perpendicular to translational component Vector component perpendicular to translational component is only due to rotation rotation can be estimated from it.

Motion Estimation Techniques Prazdny (1981), Burger Bhanu (1990), Nelson Aloimonos (1988), Heeger Jepson (1992): Decomposition of flow field into translational and rotational components. Translational flow field is radial (all vectors are emanating from (or pouring into) one point), rotational flow field is quadratic in image coordinates. Either search in the space of rotations: remainig flow field should be translational. Translational flow field is evaluated by minimizing deviation from radial field:. Or search in the space of directions of translation: Vectors perpendicular to translation are due to rotation only

Motion Estimation Techniques Longuet-Higgins Prazdny (1980), Waxman (1987): Parametric model for local surface patches (planes or quadrics) solve locally for motion parameters and structure, because flow is linear in the motion parameters (quadratic or higher order in the image coordinates)