Non-Euclidean Example: The Unit Sphere

Slides:



Advertisements
Similar presentations
Epipolar Geometry.
Advertisements

3D Geometry for Computer Graphics
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
3D reconstruction.
Medical Image Registration Kumar Rajamani. Registration Spatial transform that maps points from one image to corresponding points in another image.
MASKS © 2004 Invitation to 3D vision Lecture 7 Step-by-Step Model Buidling.
Inspired by the structure tensor which computes the second-order moment of image gradients for representing local image properties, and the Diffusion Tensor.
Mapping: Scaling Rotation Translation Warp
December 5, 2013Computer Vision Lecture 20: Hidden Markov Models/Depth 1 Stereo Vision Due to the limited resolution of images, increasing the baseline.
Structure from motion.
Dimensionality Reduction Chapter 3 (Duda et al.) – Section 3.8
Mean Shift A Robust Approach to Feature Space Analysis Kalyan Sunkavalli 04/29/2008 ES251R.
MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.
Epipolar geometry. (i)Correspondence geometry: Given an image point x in the first view, how does this constrain the position of the corresponding point.
Structure from motion. Multiple-view geometry questions Scene geometry (structure): Given 2D point matches in two or more images, where are the corresponding.
Uncalibrated Geometry & Stratification Sastry and Yang
Multiple View Geometry Marc Pollefeys University of North Carolina at Chapel Hill Modified by Philippos Mordohai.
Feature extraction: Corners and blobs
© 2003 by Davi GeigerComputer Vision October 2003 L1.1 Structure-from-EgoMotion (based on notes from David Jacobs, CS-Maryland) Determining the 3-D structure.
Previously Two view geometry: epipolar geometry Stereo vision: 3D reconstruction epipolar lines Baseline O O’ epipolar plane.
A Global Geometric Framework for Nonlinear Dimensionality Reduction Joshua B. Tenenbaum, Vin de Silva, John C. Langford Presented by Napat Triroj.
CS223b, Jana Kosecka Rigid Body Motion and Image Formation.
Dorin Comaniciu Visvanathan Ramesh (Imaging & Visualization Dept., Siemens Corp. Res. Inc.) Peter Meer (Rutgers University) Real-Time Tracking of Non-Rigid.
Camera parameters Extrinisic parameters define location and orientation of camera reference frame with respect to world frame Intrinsic parameters define.
Computer vision: models, learning and inference
SVD(Singular Value Decomposition) and Its Applications
Computer vision.
Mean-shift and its application for object tracking
Epipolar geometry The fundamental matrix and the tensor
1 Preview At least two views are required to access the depth of a scene point and in turn to reconstruct scene structure Multiple views can be obtained.
Local invariant features Cordelia Schmid INRIA, Grenoble.
ALIGNMENT OF 3D ARTICULATE SHAPES. Articulated registration Input: Two or more 3d point clouds (possibly with connectivity information) of an articulated.
From Pixels to Features: Review of Part 1 COMP 4900C Winter 2008.
CSE 185 Introduction to Computer Vision Pattern Recognition 2.
December 4, 2014Computer Vision Lecture 22: Depth 1 Stereo Vision Comparing the similar triangles PMC l and p l LC l, we get: Similarly, for PNC r and.
CSCE 643 Computer Vision: Structure from Motion
CMIC Journal Club R. Woods (2003) Neuroimage Characterising Volume and Surface Deformations in an Atlas Framework Ged Ridgway.
Geometric Camera Models
Non-Euclidean Example: The Unit Sphere. Differential Geometry Formal mathematical theory Work with small ‘patches’ –the ‘patches’ look Euclidean Do calculus.
Affine Structure from Motion
Local invariant features Cordelia Schmid INRIA, Grenoble.
EECS 274 Computer Vision Affine Structure from Motion.
1 Chapter 2: Geometric Camera Models Objective: Formulate the geometrical relationships between image and scene measurements Scene: a 3-D function, g(x,y,z)
Uncalibrated reconstruction Calibration with a rig Uncalibrated epipolar geometry Ambiguities in image formation Stratified reconstruction Autocalibration.
Introduction to Scale Space and Deep Structure. Importance of Scale Painting by Dali Objects exist at certain ranges of scale. It is not known a priory.
Instructor: Mircea Nicolescu Lecture 7
Instructor: Mircea Nicolescu Lecture 10 CS 485 / 685 Computer Vision.
11/25/03 3D Model Acquisition by Tracking 2D Wireframes Presenter: Jing Han Shiau M. Brown, T. Drummond and R. Cipolla Department of Engineering University.
CS682, Jana Kosecka Rigid Body Motion and Image Formation Jana Kosecka
CSE 554 Lecture 8: Alignment
Spectral Methods for Dimensionality
Edge Detection using Mean Shift Smoothing
Deep Feedforward Networks
University of Ioannina
Motion Segmentation with Missing Data using PowerFactorization & GPCA
Unsupervised Riemannian Clustering of Probability Density Functions
René Vidal and Xiaodong Fan Center for Imaging Science
Segmentation of Dynamic Scenes
Two-view geometry Computer Vision Spring 2018, Lecture 10
Epipolar geometry.
Common Classification Tasks
CS Visual Recognition Projective Geometry Projective Geometry is a mathematical framework describing image formation by perspective camera. Under.
Outline Peter N. Belhumeur, Joao P. Hespanha, and David J. Kriegman, “Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection,”
Geometric Camera Models
3D reconstruction class 11
Uncalibrated Geometry & Stratification
Segmentation of Dynamical Scenes
Feature space tansformation methods
Single-view geometry Odilon Redon, Cyclops, 1914.
Back to equations of geometric transformations
Presentation transcript:

Non-Euclidean Example: The Unit Sphere

Differential Geometry Is the mathematical theory which… …works with small ‘patches’ looking Euclidean… …and do calculus over these patches.

Manifolds Open Sets Coordinate neighbourhood Compatible neighbourhoods Talk about analytic manifold

Tangent Space Tangent Vectors Tangent Space, Inner Product along D Norm: depends on varies smoothly

Geodesics and Metrics The shortest path between two points X, Y is the geodesic in the manifold. The length of the geodesic is the distance between the two points

Exponential and Logarithm Maps : Maps tangents to the manifold : Maps points on the manifold to Both maps are locally defined only

Gradient In euclidean space: direction of fastest increase On the manifold: tangent of fastest increase is a real valued function on the manifold The gradient at X is is unique satisfying directional derivative along D

Vector Space vs. Manifold Space X the “origin”, Y an other point X. Pennec, P. Fillard and N. Ayache , “A Riemannian Framework for Tensor Computing,” International. Journal of Computer Vision., 66(1), 41–66, 2006.

Sets of matrices, general linear group GL(n,R) Lie Groups Sets of matrices, general linear group GL(n,R) form a group under matrix multiplication are the simplest Riemannian manifolds Examples Rigid body transformations SE(n) (n+1)x(n+1) Rotations SO(n) nxn Affine motions A(n) (n+1)x(n+1) R t 0T 1 W. Rossman, “Lie Groups: An Introduction through Linear Groups,” Oxford University Press, 2003.

Therefore the representation is not unique! Grassmann Manifolds, . Each point on the Grassmann manifold, , represents a dimensional subspace of . Numerically, represented by an orthonormal basis (does not matter which one) matrix such that Therefore the representation is not unique! computation should account for this A. Edelman, T. A. Arias and S. T. Smith, “The Geometry of Algorithms with Orthogonality Constraints,” SIAM Journal on Matrix Analysis and Applications, 20(2), 303–353, 1998.

Equivalent to SO(3)xSO(3) Essential Manifold calibrated cameras Set of matrices with two equal and one zero singular value let the two equal singular values be 1 Equivalent to SO(3)xSO(3) two-time covering of the essential manifold Correction by Dubbelman(2012). Subbarao&Meer(2009) had two solutions from eight possible ones, and one still had to be eliminated S. Soatto, R. Frezza and P. Perona , “Recursive Estimation on the Essential Manifold,” 3rd Europan Conference on Computer Vision, Stockholm, Sweden, May 1994, vol.II, p.61-72.

Twisted pairs gives the same essential matrix Four different camera geometries, but the same essential matrix.

Symmetric Positive Define (SPD) Manifold contains symmetric positive definite matrices. e.g. diffusion tensor MRI has two different metrics Affine Invariant metric (precise) Log-Euclidean Metric (this is used in comp.vision) practically similar to the affine invariant metric computationally easier to work with V. Arsigny , P. Fillard, X. Pennec and N. Ayache , “Geometric Means in a Novel Vector Space Structure on Symmetric Positive-Definite Matrices,” SIAM Journal of Matrix Analysis and Applications, 29(1), 328–347, 2006.

Mean Shift for Euclidean Spaces The kernel density estimate Mean shift as normalized gradient of where The iteration

Gradient ascent on the kernel density on the closest mode. Original Mean Shift Gradient ascent on the kernel density on the closest mode. Equivalent to expectation-maximization for Gaussian kernels Mean shift is a nonparametric clustering D. Comaniciu and P. Meer , “Mean Shift: A Robust Approach Towards Feature Space Analysis,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, 603–619, 2002. D. Comaniciu, V. Ramesh and P. Meer , “Kernel-based Object Tracking,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.25, 564–577, 2003.

Mean Shift for Manifolds The kernel density estimate on the manifold Mean shift as normalized gradient of , tangent space The iteration

Mean Shift for Riemannian Manifolds Map locally the points to tangent space Compute the weighted average of the tangent vectors and obtain the mean shift vector Map the mean shift vector back to the manifold

Properties for Nonlinear Mean Shift Still the gradient ascent on the kernel density Nonlinear mean shift is convergent but depend on an upper limit on allowed bandwidth in the tangent space! nonlinear mean shift equivalent to EM for Gaussians kernels. Valid for the groups mentioned. All significant modes may come out in the manifold.

Clustering by nonlinear mean shift Motion Segmentation Hypothesis Generation matrix Lie groups Randomly pick N elemental subset of corresponding points. May have outlier elemental subsets too. Generate N parameter hypotheses R, t Clustering by nonlinear mean shift the clustering needs h(-s) Returns Number of dominant modes Positions of dominant modes

2D Affine Motion A(2) matrix 3x3 Total of 80 matches. Affine has 6 parameters. Three point pairs per elemental subset. 500 clustering hypotheses.

Camera Pose Estimation SE(3) 4x4 matrix Calibrated camera. Three point clouds, triangulated in 3D have different coordinate systems. With all three scenes present, segmentation in 2D can be used to establish 2D to 3D correspondences for the entire scene. 3D poses are estimated with SE(3) and h=0.1.

Results with all three scenes together M1 books on right M2 books on left M3 box in middle

Factorization G10,3 uncalibrated affine camera Grassmann. 5 frames n=2x5=10. planar move each object k=3. e.subset 3 points in all 5 frames. 257 corners. 1000 hypotheses. h = 0.07

Essential Matrix SO(3)xSO(3) Two motions. 100 point matches. 1000 hypotheses. h = 0.001 Second solution comes from rotation +-p around the y-axis, which maps into the same tangent vector.

Discontinuity Preserving Filtering An image is a mapping from a lattice in to data lying on a manifold Filtering: Run mean shift in the space 2D hs Mean shift iterations update the spatial and parameter values. The iteration from converges to set in the filtered image

Chromatic Noise Filtering G3,1 Chromaticity affects direction of the color vector but not the intensity. s = 0.2 original mean shift hs=11 hp=7 chromatic hs=11 hp=0.7

Chromatic Noise Filtering original mean shift hs=11 hp=10.5 chromatic hs=11 hp=0.3

Diffusion tensor Image R3xSym+3 Images encoded with 3x3 Symmetric Positive Definite (SPD) matrices at each pixel. Smoothing the DTI improves the result.

Group theory starts to be used as part of a deep learning method. Example: Lohit,Turaga: Learning Invariant Riemannian Geometric Representations Using Deep Nets. https://arxiv.org/pdf/1708.09 Grassmannian manifold multi-class classification on a unit hypersphere Exmp: Face to illumination subspace 28x28 images G784,d Orthonornal matrix U:784xd The grassmannian manifold is invariant to orthogonal transformations, like illuminations, which is in a low dim. subspace. The 90% of the variances are in the first d=5 principal components of a database. 250(200+50) 3D random faces. 33 illuminations from Yale database B, which had a total of 64 illuminations. Neural network has 3 convolution layers and 2 full connected. || Frechet (geometric) mean in the tangent space for the ground-truth subspace of the training set. || All the results are projected in the manifold.

gives RQ*, a new point of the manifold. d=5 d=4 T DG the mean subspace distance of principal angles obtain from Uavg Utest Returns the subspace of illumination with a point of the manifold R which spans the same subspace as the columns of U. The rotation Q* = min ||Uavg – RQ||F gives RQ*, a new point of the manifold. orthogonal Procrustes problem