CSCE 643 Computer Vision: Extractions of Image Features Jinxiang Chai.

Slides:

Advertisements

Similar presentations

Distinctive Image Features from Scale-Invariant Keypoints

Advertisements

Feature Detection. Description Localization More Points Robust to occlusion Works with less texture More Repeatable Robust detection Precise localization.

Distinctive Image Features from Scale-Invariant Keypoints David Lowe.

Object Recognition from Local Scale-Invariant Features David G. Lowe Presented by Ashley L. Kapron.

Presented by Xinyu Chang

The SIFT (Scale Invariant Feature Transform) Detector and Descriptor

Features Induction Moravec Corner Detection

TP14 - Local features: detection and description Computer Vision, FCUP, 2014 Miguel Coimbra Slides by Prof. Kristen Grauman.

Object Recognition using Invariant Local Features Applications l Mobile robots, driver assistance l Cell phone location or object recognition l Panoramas,

CSE 473/573 Computer Vision and Image Processing (CVIP)

Distinctive Image Features from Scale- Invariant Keypoints Mohammad-Amin Ahantab Technische Universität München, Germany.

Instructor: Mircea Nicolescu Lecture 13 CS 485 / 685 Computer Vision.

Object Recognition with Invariant Features n Definition: Identify objects or scenes and determine their pose and model parameters n Applications l Industrial.

Fast High-Dimensional Feature Matching for Object Recognition David Lowe Computer Science Department University of British Columbia.

A Study of Approaches for Object Recognition

Object Recognition with Invariant Features n Definition: Identify objects or scenes and determine their pose and model parameters n Applications l Industrial.

CSCE 641 Computer Graphics: Image Filtering & Feature Detection Jinxiang Chai.

Distinctive Image Feature from Scale-Invariant KeyPoints

Feature extraction: Corners and blobs

Distinctive image features from scale-invariant keypoints. David G. Lowe, Int. Journal of Computer Vision, 60, 2 (2004), pp Presented by: Shalomi.

Scale Invariant Feature Transform (SIFT)

Automatic Matching of Multi-View Images

Blob detection.

Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2006 with a lot of slides stolen from Steve Seitz and.

1 Image Features Hao Jiang Sept Image Matching 2.

The SIFT (Scale Invariant Feature Transform) Detector and Descriptor

Distinctive Image Features from Scale-Invariant Keypoints David G. Lowe – IJCV 2004 Brien Flewelling CPSC 643 Presentation 1.

Sebastian Thrun CS223B Computer Vision, Winter Stanford CS223B Computer Vision, Winter 2005 Lecture 3 Advanced Features Sebastian Thrun, Stanford.

Computational Photography: Image Processing Jinxiang Chai.

Scale-Invariant Feature Transform (SIFT) Jinxiang Chai.

Distinctive Image Features from Scale-Invariant Keypoints By David G. Lowe, University of British Columbia Presented by: Tim Havinga, Joël van Neerbos.

Computer vision.

Object Tracking/Recognition using Invariant Local Features Applications l Mobile robots, driver assistance l Cell phone location or object recognition.

Reporter: Fei-Fei Chen. Wide-baseline matching Object recognition Texture recognition Scene classification Robot wandering Motion tracking.

CVPR 2003 Tutorial Recognition and Matching Based on Local Invariant Features David Lowe Computer Science Department University of British Columbia.

Wenqi Zhu 3D Reconstruction From Multiple Views Based on Scale-Invariant Feature Transform.

Lecture 7: Features Part 2 CS4670/5670: Computer Vision Noah Snavely.

Distinctive Image Features from Scale-Invariant Keypoints Ronnie Bajwa Sameer Pawar * * Adapted from slides found online by Michael Kowalski, Lehigh University.

Harris Corner Detector & Scale Invariant Feature Transform (SIFT)

Features Digital Visual Effects, Spring 2006 Yung-Yu Chuang 2006/3/15 with slides by Trevor Darrell Cordelia Schmid, David Lowe, Darya Frolova, Denis Simakov,

Distinctive Image Features from Scale-Invariant Keypoints David Lowe Presented by Tony X. Han March 11, 2008.

Feature extraction: Corners and blobs. Why extract features? Motivation: panorama stitching We have two images – how do we combine them?

CSE 185 Introduction to Computer Vision Feature Matching.

A Tutorial on using SIFT Presented by Jimmy Huff (Slightly modified by Josiah Yoder for Winter )

Local features: detection and description

Scale Invariant Feature Transform (SIFT)

Presented by David Lee 3/20/2006

Recognizing specific objects Matching with SIFT Original suggestion Lowe, 1999,2004.

Distinctive Image Features from Scale-Invariant Keypoints Presenter :JIA-HONG,DONG Advisor : Yen- Ting, Chen 1 David G. Lowe International Journal of Computer.

Blob detection.

SIFT Scale-Invariant Feature Transform David Lowe

CS262: Computer Vision Lect 09: SIFT Descriptors

Interest Points EE/CSE 576 Linda Shapiro.

Lecture 07 13/12/2011 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.

Distinctive Image Features from Scale-Invariant Keypoints

Scale Invariant Feature Transform (SIFT)

TP12 - Local features: detection and description

Feature description and matching

CAP 5415 Computer Vision Fall 2012 Dr. Mubarak Shah Lecture-5

From a presentation by Jimmy Huff Modified by Josiah Yoder

The SIFT (Scale Invariant Feature Transform) Detector and Descriptor

Interest Points & Descriptors 3 - SIFT

Lecture VI: Corner and Blob Detection

SIFT SIFT is an carefully designed procedure with empirically determined parameters for the invariant and distinctive features.

Presented by Xu Miao April 20, 2005

Presentation transcript:

CSCE 643 Computer Vision: Extractions of Image Features Jinxiang Chai

Good Image Features What are we looking for? –Strong features –Invariant to changes (affine and perspective, occlusion, illumination, etc.)

Feature Extraction Why do we need to detect features? - Features correspond to important points in both the world and image spaces - Object detection/recognition - Solve the problem of correspondence Locate an object in multiple images (i.e. in video) Track the path of the object, infer 3D structures, object and camera movement

Outline Image Features - Corner detection - SIFT extraction

What are Corners? Point features

What are Corners? Point features Where two edges come together Where the image gradient has significant components in the x and y direction We will establish corners from the gradient rather than the edge images

Basic Ideas What are gradients along x and y directions?

Basic Ideas What are gradients along x and y directions?

Basic Ideas What are gradients along x and y directions? How to measure corners based on the gradient images?

Basic Ideas What are gradients along x and y directions? How to measure corners based on the gradient images? - two major axes in the local window!

How to Find Two Major Axes? Principal component analysis (PCA)Principal component analysis

How to Find Two Major Axes? Principal component analysis (PCA)Principal component analysis The length of two major axes is dependent on the ration of eigen values (λ1/λ2 ).

Corner Detection Algorithm 1. Compute the image gradients 2. Define a neighborhood size as an area of interest around each pixel 3x3 neighborhood

3.For each image pixel (i,j), construct the following matrix from it and its neighborhood values e.g. Corner Detection Algorithm (cont’d) Similar to covariance matrix (I x,I y ) T !

Corner Detection Algorithm (cont’d) 4.For each matrix C (i,j), determine the 2 eigenvalues λ (i.j) = [λ 1, λ 2 ]. - This means dominant gradient direction aligns with x or y axis. - If either λ1 or λ2 is close to zero, then this is not a corner. Simple case:

Corner Detection Algorithm (cont’d) 4.For each matrix C (i,j), determine the 2 eigenvalues λ (i.j) = [λ 1, λ 2 ]. Simple case: Isolated pixelsInterior Region Edge Corner λ 1, λ 2 =0 Large λ 1 and small λ 2 Large λ 1 and large λ 2 small λ 1 and small λ 2

Corner Detection Algorithm (cont’d) 4.For each matrix C (i,j), determine the 2 eigenvalues λ (i.j) = [λ 1, λ 2 ]. - This is just a rotated version of the one on last slide - If either λ1 or λ2 is close to zero, then this is not a corner. - invariant to 2D rotation General case:

Eigen-values and Corner - λ1 is large - λ2 is large

Eigen-values and Corner - λ1 is large - λ2 is small

Eigen-values and Corner - λ1 is small - λ2 is small

Corner Detection Algorithm (cont’d) 4.For each matrix C (i,j), determine the 2 eigenvalues λ (i.j) = [λ 1, λ 2 ]. 5. If both λ 1 and λ 2 are big, we have a corner (Harris also checks the ratio of λs is not too high) ISSUE: The corners obtained will be a function of the threshold !

Image Gradients

Closeup of image orientation at each pixel

The Orientation Field Corners are detected where both λ1 and λ2 are big

The Orientation Field Corners are detected where both λ1 and λ2 are big

Corner Detection Sample Results Threshold=25,000Threshold=10,000 Threshold=5,000

Outline Image Features - Corner detection - SIFT extraction

Scale Invariant Feature Transform (SIFT) Choosing features that are invariant to image scaling and rotation Also, partially invariant to changes in illumination and 3D camera viewpoint

Motivation for SIFT Earlier Methods –Harris corner detector Sensitive to changes in image scale Finds locations in image with large gradients in two directions –No method was fully affine invariant Although the SIFT approach is not fully invariant it allows for considerable affine change SIFT also allows for changes in 3D viewpoint

Invariance Illumination Scale Rotation Affine

Readings Object recognition from local scale- invariant features [pdf link], ICCV 09pdf link David G. Lowe, "Distinctive image features from scale-invariant keypoints," International Journal of Computer Vision, 60, 2 (2004), pp

SIFT Algorithm Overview 1.Scale-space extrema detection 2.Keypoint localization 3.Orientation Assignment 4.Generation of keypoint descriptors.

Scale Space Different scales are appropriate for describing different objects in the image, and we may not know the correct scale/size ahead of time.

Scale space (Cont.) Looking for features (locations) that are stable (invariant) across all possible scale changes –use a continuous function of scale (scale space) Which scale-space kernel will we use? –The Gaussian Function

-variable-scale Gaussian -input image Scale-Space of Image

-variable-scale Gaussian -input image To detect stable keypoint locations, find the scale-space extrema in difference-of- Gaussian function Scale-Space of Image

-variable-scale Gaussian -input image To detect stable keypoint locations, find the scale-space extrema in difference-of- Gaussian function Scale-Space of Image

-variable-scale Gaussian -input image To detect stable keypoint locations, find the scale-space extrema in difference-of- Gaussian function Scale-Space of Image Look familiar?

-variable-scale Gaussian -input image To detect stable keypoint locations, find the scale-space extrema in difference-of- Gaussian function Scale-Space of Image Look familiar? -bandpass filter!

Difference of Gaussian 1.A = Convolve image with vertical and horizontal 1D Gaussians, σ=sqrt(2) 2.B = Convolve A with vertical and horizontal 1D Gaussians, σ=sqrt(2) 3.DOG (Difference of Gaussian) = A – B 4.So how to deal with different scales?

Difference of Gaussian 1.A = Convolve image with vertical and horizontal 1D Gaussians, σ=sqrt(2) 2.B = Convolve A with vertical and horizontal 1D Gaussians, σ=sqrt(2) 3.DOG (Difference of Gaussian) = A – B 4.Downsample B with bilinear interpolation with pixel spacing of 1.5 (linear combination of 4 adjacent pixels)

A1 B1 Difference of Gaussian Pyramid Input Image Blur Downsample B2 B3 A2 A3 A3-B3 A2-B2 A1-B1 DOG2 DOG1 DOG3 Blur

Other issues Initial smoothing ignores highest spatial frequencies of images

Other issues Initial smoothing ignores highest spatial frequencies of images - expand the input image by a factor of 2, using bilinear interpolation, prior to building the pyramid

Other issues Initial smoothing ignores highest spatial frequencies of images - expand the input image by a factor of 2, using bilinear interpolation, prior to building the pyramid How to do downsampling with bilinear interpolations?

Bilinear Filter Weighted sum of four neighboring pixels x y u v

Bilinear Filter Sampling at S(x,y): (i+1,j) (i,j) (i,j+1) (i+1,j+1) S(x,y) = a*b*S(i,j) + a*(1-b)*S(i+1,j) + (1-a)*b*S(i,j+1) + (1-a)*(1-b)*S(i+1,j+1) u v y x

Bilinear Filter Sampling at S(x,y): (i+1,j) (i,j) (i,j+1) (i+1,j+1) S(x,y) = a*b*S(i,j) + a*(1-b)*S(i+1,j) + (1-a)*b*S(i,j+1) + (1-a)*(1-b)*S(i+1,j+1) S i = S(i,j) + a*(S(i,j+1)-S(i)) S j = S(i+1,j) + a*(S(i+1,j+1)-S(i+1,j)) S(x,y) = S i +b*(S j -S i) To optimize the above, do the following u v y x

Bilinear Filter (i+1,j) (i,j) (i,j+1) (i+1,j+1) y x

Pyramid Example A1 B1DOG1 DOG3 A2 A3 B3 B2

Feature Detection Find maxima and minima of scale space For each point on a DOG level: –Compare to 8 neighbors at same level –If max/min, identify corresponding point at pyramid level below –Determine if the corresponding point is max/min of its 8 neighbors –If so, repeat at pyramid level above Repeat for each DOG level Those that remain are key points

Identifying Max/Min DOG L-1 DOG L DOG L+1

Refining Key List: Illumination For all levels, use the “A” smoothed image to compute –Gradient Magnitude Threshold gradient magnitudes: –Remove all key points with M ij less than 0.1 times the max gradient value Motivation: Low contrast is generally less reliable than high for feature points

SIFT Feature Orientation? We now obtain the location and scale of SIFT features How can we obtain the orientation of features?

Assigning Canonical Orientation For each remaining key point: –Choose surrounding N x N window at DOG level it was detected DOG image

Assigning Canonical Orientation For all levels, use the “A” smoothed image to compute –Gradient Orientation + Gaussian Smoothed Image Gradient OrientationGradient Magnitude

Assigning Canonical Orientation Gradient magnitude weighted by 2D gaussian Gradient Magnitude2D GaussianWeighted Magnitude * =

Assigning Canonical Orientation Accumulate in histogram based on orientation Histogram has 36 bins with 10° increments Weighted Magnitude Gradient Orientation Sum of Weighted Magnitudes

Assigning Canonical Orientation Identify peak and assign orientation and sum of magnitude to key point Weighted Magnitude Gradient Orientation Sum of Weighted Magnitudes Peak *

Local Image Description SIFT keys each assigned: –Location –Scale (analogous to level it was detected) –Orientation (assigned in previous canonical orientation steps) Now: Describe local image region invariant to the above transformations

SIFT key example

Local Image Description For each key point: Identify 8x8 neighborhood (from DOG level it was detected) Align orientation to x- axis

Local Image Description 3.Calculate gradient magnitude and orientation map 4.Weight by Gaussian

Local Image Description 5.Calculate histogram of each 4x4 region. 8 bins for gradient orientation. Tally weighted gradient magnitude.

Local Image Description 6.This histogram array is the image descriptor. (Example here is vector, length 8*4=32. Best suggestion: 128 vector for 16x16 neighborhood)

Applications: Image Matching Find all key points identified in source and target image –Each key point will have 2d location, scale and orientation, as well as invariant descriptor vector For each key point in source image, search corresponding SIFT features in target image. Find the transformation between two images using epipolar geometry constraints or affine transformation.

Image matching via SIFT featrues Feature detection

Image matching via SIFT featrues Image matching via nearest neighbor search - if the ratio of closest distance to 2nd closest distance greater than 0.8 then reject as a false match. Remove outliers using epipolar line constraints.

Image matching via SIFT featrues

Summary SIFT features are reasonably invariant to rotation, scaling, and illumination changes. We can use them for image matching and object recognition among other things. Efficient on-line matching and recognition can be performed in real time