Computer Vision Laboratory 1 Unrestricted Recognition of 3-D Objects Using Multi-Level Triplet Invariants Gösta Granlund and Anders Moe Computer Vision.

Slides:



Advertisements
Similar presentations
Epipolar Geometry.
Advertisements

Learning deformable models Yali Amit, University of Chicago Alain Trouvé, CMLA Cachan.
Context-based object-class recognition and retrieval by generalized correlograms by J. Amores, N. Sebe and P. Radeva Discussion led by Qi An Duke University.
Alignment Visual Recognition “Straighten your paths” Isaiah.
3D Geometry for Computer Graphics
Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
Isoparametric Elements Element Stiffness Matrices
RGB-D object recognition and localization with clutter and occlusions Federico Tombari, Samuele Salti, Luigi Di Stefano Computer Vision Lab – University.
電腦視覺 Computer and Robot Vision I
November 12, 2013Computer Vision Lecture 12: Texture 1Signature Another popular method of representing shape is called the signature. In order to compute.
電腦視覺 Computer and Robot Vision I Chapter2: Binary Machine Vision: Thresholding and Segmentation Instructor: Shih-Shinh Huang 1.
Computer Vision Lecture 16: Region Representation
Mapping: Scaling Rotation Translation Warp
September 10, 2013Computer Vision Lecture 3: Binary Image Processing 1Thresholding Here, the right image is created from the left image by thresholding,
Image Indexing and Retrieval using Moment Invariants Imran Ahmad School of Computer Science University of Windsor – Canada.
Uncertainty Representation. Gaussian Distribution variance Standard deviation.
Instructor: Mircea Nicolescu Lecture 13 CS 485 / 685 Computer Vision.
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
Data Structures For Image Analysis
HMM-BASED PATTERN DETECTION. Outline  Markov Process  Hidden Markov Models Elements Basic Problems Evaluation Optimization Training Implementation 2-D.
Surface Reconstruction from 3D Volume Data. Problem Definition Construct polyhedral surfaces from regularly-sampled 3D digital volumes.
Motion Analysis Slides are from RPI Registration Class.
CS485/685 Computer Vision Prof. George Bebis
Computer Vision Laboratory 1 B-Spline Channels & Channel Smoothing Michael Felsberg Computer Vision Laboratory Linköping University SWEDEN.
Computer Vision Laboratory 1 Hiperlearn: A Channel Learning Architecture Gösta Granlund Computer Vision Laboratory Linköping University SWEDEN.
Regionalized Variables take on values according to spatial location. Given: Where: A “structural” coarse scale forcing or trend A random” Local spatial.
A unified statistical framework for sequence comparison and structure comparison Michael Levitt Mark Gerstein.
CS292 Computational Vision and Language Visual Features - Colour and Texture.
Learning the space of time warping functions for Activity Recognition Function-Space of an Activity Ashok Veeraraghavan Rama Chellappa Amit K. Roy-Chowdhury.
Comparison and Combination of Ear and Face Images in Appearance-Based Biometrics IEEE Trans on PAMI, VOL. 25, NO.9, 2003 Kyong Chang, Kevin W. Bowyer,
Information that lets you recognise a region.
October 7, 2010Neural Networks Lecture 10: Setting Backpropagation Parameters 1 Creating Data Representations On the other hand, sets of orthogonal vectors.
CSci 6971: Image Registration Lecture 5: Feature-Base Regisration January 27, 2004 Prof. Chuck Stewart, RPI Dr. Luis Ibanez, Kitware Prof. Chuck Stewart,
55:148 Digital Image Processing Chapter 11 3D Vision, Geometry Topics: Basics of projective geometry Points and hyperplanes in projective space Homography.
Image Segmentation by Clustering using Moments by, Dhiraj Sakumalla.
Introduction --Classification Shape ContourRegion Structural Syntactic Graph Tree Model-driven Data-driven Perimeter Compactness Eccentricity.
Computer vision.
Machine Vision for Robots
October 14, 2014Computer Vision Lecture 11: Image Segmentation I 1Contours How should we represent contours? A good contour representation should meet.
Intelligent Vision Systems ENT 496 Object Shape Identification and Representation Hema C.R. Lecture 7.
September 23, 2014Computer Vision Lecture 5: Binary Image Processing 1 Binary Images Binary images are grayscale images with only two possible levels of.
September 5, 2013Computer Vision Lecture 2: Digital Images 1 Computer Vision A simple two-stage model of computer vision: Image processing Scene analysis.
Generalized Hough Transform
November 13, 2014Computer Vision Lecture 17: Object Recognition I 1 Today we will move on to… Object Recognition.
Course 9 Texture. Definition: Texture is repeating patterns of local variations in image intensity, which is too fine to be distinguished. Texture evokes.
September 17, 2013Computer Vision Lecture 5: Image Filtering 1ColorRGB HSI.
Image Compression – Fundamentals and Lossless Compression Techniques
December 9, 2014Computer Vision Lecture 23: Motion Analysis 1 Now we will talk about… Motion Analysis.
1 Research Question  Can a vision-based mobile robot  with limited computation and memory,  and rapidly varying camera positions,  operate autonomously.
1 Self-Calibration and Neural Network Implementation of Photometric Stereo Yuji IWAHORI, Yumi WATANABE, Robert J. WOODHAM and Akira IWATA.
Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.
Colour and Texture. Extract 3-D information Using Vision Extract 3-D information for performing certain tasks such as manipulation, navigation, and recognition.
Yizhou Yu Texture-Mapping Real Scenes from Photographs Yizhou Yu Computer Science Division University of California at Berkeley Yizhou Yu Computer Science.
1 Overview representing region in 2 ways in terms of its external characteristics (its boundary)  focus on shape characteristics in terms of its internal.
Principal Component Analysis
Instructor: Mircea Nicolescu Lecture 5 CS 485 / 685 Computer Vision.
Digital Image Processing, 3rd ed. © 1992–2008 R. C. Gonzalez & R. E. Woods Gonzalez & Woods Chapter 11 Representation and.
Copyright ©2008, Thomson Engineering, a division of Thomson Learning Ltd.
Visual homing using PCA-SIFT
Lecture 07 13/12/2011 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
Recognizing Deformable Shapes
René Vidal and Xiaodong Fan Center for Imaging Science
Mean Shift Segmentation
Slope and Curvature Density Functions
Computer Vision Lecture 5: Binary Image Processing
Computer Vision Lecture 4: Color
Object Recognition Today we will move on to… April 12, 2018
Quantum Two.
Fourier Transform of Boundaries
Random Neural Network Texture Model
Presentation transcript:

Computer Vision Laboratory 1 Unrestricted Recognition of 3-D Objects Using Multi-Level Triplet Invariants Gösta Granlund and Anders Moe Computer Vision Laboratory Linköping University SWEDEN

Computer Vision Laboratory 2 By unrestricted, we imply that the recognition shall be done independently of object position, scale, orientation and pose, against a structured background. It shall not assume any preceding segmentation and allow a reasonable degree of occlusion.

Computer Vision Laboratory 3 Traditional approach

Computer Vision Laboratory 4 Object Representation

Computer Vision Laboratory 5 Object Parameters VarObject characterization ωObject class xHorisontal position of object yVertical position of object Horisontal pose angle of object θVertical pose angle of object ψOrientation of object in image plane sScale or size of object

Computer Vision Laboratory 6 Object Identification -- An Inverse Problem Implies a two-step process: 1.Postulation of a certain model 2.Performing measurements, and comparing these with a reference, under the assumption of the particular model

Computer Vision Laboratory 7 Requirements of Model Structure Models shall be fragmentable such that a certain model can be part of a more complex or higher order model. Due to this recursive character, we will simply denote them all models, be it parts or combinations. Learning of models shall proceed from lower levels to higher levels. Acquired lower level models shall be usable as parts of several different higher order models. A particular model is only acquired once, and its first occurrence is used as the representation of that model.

Computer Vision Laboratory 8 Invariance to illumination

Computer Vision Laboratory 9 Local versus global properties

Computer Vision Laboratory 10 Conflicting interpretations

Computer Vision Laboratory 11 Object Representation

Computer Vision Laboratory 12 Compact description of regions

Computer Vision Laboratory 13 Edges and lines

Computer Vision Laboratory 14 Curvature, corners I

Computer Vision Laboratory 15 Curvature, corners II

Computer Vision Laboratory 16 Triplet Models

Computer Vision Laboratory 17 Multi-level Triplet Models

Computer Vision Laboratory 18 Properties of Triplet Structure It allows a unique ordering of the feature points, which is implemented such that the triplet is ”right oriented”, i.e. that the angle α < π. The triplet structure allows us to define a scale invariant structure parameter The distance between the two feature points not connected by the triplet, must be shorter than the two other distances between feature points. The triplet can be brought into a ”normal orientation” by aligning leg to make

Computer Vision Laboratory 19 Invariance Properties The preceding properties together with the hierarchical arrangement of triplets make the following parameter variations trivial: Orientation in the image plane Scale Object position in x and y

Computer Vision Laboratory 20 Procedures Statistics Assumption of Structure Preselection Grouping

Computer Vision Laboratory 21 Examples of Grouping Rules Spatial grouping range: We expect primitives to increase in spatial size going towards higher levels. Object closure criteria: Tests for homogeneity such as similar density or color inside the triplets, to indicate parts of a common object or region. Symmetry

Computer Vision Laboratory 22 Channel Inform. Representation

Computer Vision Laboratory 23 Triplet Components A triplet can be characterized in a number of equivalent fashions. The components used are: : Point feature vectors, k=1,2,3, each one coded with h f channels. : Angle between triplet legs 1 and 2, coded with h a channels. : Relative length of triplet legs, coded with h g channels.

Computer Vision Laboratory 24 Triplet Vector

Computer Vision Laboratory 25 Mapping Onto Object States

Computer Vision Laboratory 26 Dynamic Binding Variables From the purely geometric and feature related entities, it is desired to map into variables that are object-related: They change as a consequence of manipulation of the object, which is essential They can be expected to be shared with, or at least coupled to, other primitives at the same level, or at a different level

Computer Vision Laboratory 27 Object States

Computer Vision Laboratory 28 Mapping from Features to Objects

Computer Vision Laboratory 29 Features for Higher Level Triplets

Computer Vision Laboratory 30 Mapping from Features to Objects

Computer Vision Laboratory 31 Consistency Check of Outputs

Computer Vision Laboratory 32 Removal of Multiple Models

Computer Vision Laboratory 33 Estimation of Object Orientation Orientation estimates of an object are obtained as the difference between the observed orientation of the higher level triplets, and their estimates of the original orientation at training.

Computer Vision Laboratory 34 Estimation of Scale Derived estimate of scale is divided by the length of the original triplet.

Computer Vision Laboratory 35 Object Training Setup

Computer Vision Laboratory 36 Object scanner

Computer Vision Laboratory 37 Training Sequence

Computer Vision Laboratory 38 Estim. Pose, Orientation and Scale

Computer Vision Laboratory 39 Average Error for 60° Rotated and 10% Rescaled Object in Test Set EstimateAverage Error Pose-x1.8° Pose-y1.8° Orientation6.3° Scale4.9%

Computer Vision Laboratory 40 Scaled and Occluded Object

Computer Vision Laboratory 41 Multiple, Occluded Objects

Computer Vision Laboratory 42 Estimates of Pose