Presentation is loading. Please wait.

Presentation is loading. Please wait.

Locating and Describing Interest Points

Similar presentations


Presentation on theme: "Locating and Describing Interest Points"— Presentation transcript:

1 Locating and Describing Interest Points
02/10/15 Locating and Describing Interest Points Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Acknowledgment: Many keypoint slides from Grauman&Leibe 2008 AAAI Tutorial

2 Overview of correspondence and alignment
Applications Basic approach: find corresponding points, estimate alignment Interest points: one very useful way to find corresponding points ESP: two people separately look at a picture and select the same points Or look at a local region and select the same single point Eye scan

3 Laplacian Pyramid clarification

4 This section: correspondence and alignment
Correspondence: matching points, patches, edges, or regions across images

5 This section: correspondence and alignment
Alignment: solving the transformation that makes two things match better T

6 Example: fitting an 2D shape template
Slide from Silvio Savarese

7 Example: fitting a 3D object model
As shown at the beginning, it interesting to visualize the geometrical relationship between canonical parts. Notice that: These 2 canonical parts share the same pose. All parts that share the same pose form a canonical pose. These are other examples of canonical poses Part belonging to different canonical poses are linked by a full homograhic transformation Here we see examples of canonical poses mapped into object instances %% This plane collect canonical parts that share the same pose. All this canonical parts are related by a pure translation constraint. - These canonical parts do not share the same pose and belong to different planes. This change of pose is described by the homographic transformation Aij. - Canonical parts that are not visible at the same time are not linked. Canonical parts that share the same pose forms a single-view submodel. %Canonical parts that share the same pose form a single view sub-model of the object class. Canonical parts that do not belong to the same plane, correspond to different %poses of the 3d object. The linkage stucture quantifies this relative change of pose through the homographic transformations. Canonical parts that are not visible at the same %time are not linked. Notice this representation is more flexible than a full 3d model yet, much richer than those where parts are linked by 'right to left', 'up to down‘ relationship, yet. . Also it is different from aspect graph: in that: aspect graphs are able to segment the object in stable regions across views Slide from Silvio Savarese

8 Example: Estimating an homographic transformation
Slide from Silvio Savarese

9 Example: estimating “fundamental matrix” that corresponds two views
Slide from Silvio Savarese

10 Example: tracking points
frame 0 frame 22 frame 49 Your problem 1 for HW 2!

11 HW 2 Interest point detection and tracking Detect trackable points
Track them across 50 frames In HW 3, you will use these tracked points for structure from motion frame 0 frame 22 frame 49

12 HW 2 Alignment of object edge images
Compute a transformation that aligns two edge maps

13 HW 2 Initial steps of object alignment
Derive basic equations for interest-point based alignment

14 This class: interest points
Note: “interest points” = “keypoints”, also sometimes called “features” Many applications tracking: which points are good to track? recognition: find patches likely to tell us something about object category 3D reconstruction: find correspondences across different views

15 Human eye movements Yarbus eye tracking
Your eyes don’t see everything at once, but they jump around. You see only about 2 degrees with high resolution Yarbus eye tracking

16 Human eye movements Study by Yarbus
Change blindness:

17 This class: interest points
original Suppose you have to click on some point, go away and come back after I deform the image, and click on the same points again. Which points would you choose? deformed

18 Overview of Keypoint Matching
1. Find a set of distinctive key- points A1 A2 A3 B1 B2 B3 2. Define a region around each keypoint 3. Extract and normalize the region content 4. Compute a local descriptor from the normalized region 5. Match local descriptors K. Grauman, B. Leibe

19 Goals for Keypoints Detect points that are repeatable and distinctive

20 Key trade-offs Detection Description More Repeatable More Points
Robust detection Precise localization Robust to occlusion Works with less texture More Distinctive More Flexible Description Minimize wrong matches Robust to expected variations Maximize correct matches

21 Choosing distinctive interest points
If you wanted to meet a friend would you say “Let’s meet on campus.” “Let’s meet on Green street.” “Let’s meet at Green and Wright.” Corner detection Or if you were in a secluded area: “Let’s meet in the Plains of Akbar.” “Let’s meet on the side of Mt. Doom.” “Let’s meet on top of Mt. Doom.” Blob (valley/peak) detection

22 Choosing interest points
Where would you tell your friend to meet you? Corners!

23 Choosing interest points
Where would you tell your friend to meet you? Blobs!

24 Many Existing Detectors Available
Hessian & Harris [Beaudet ‘78], [Harris ‘88] Laplacian, DoG [Lindeberg ‘98], [Lowe 1999] Harris-/Hessian-Laplace [Mikolajczyk & Schmid ‘01] Harris-/Hessian-Affine [Mikolajczyk & Schmid ‘04] EBR and IBR [Tuytelaars & Van Gool ‘04] MSER [Matas ‘02] Salient Regions [Kadir & Brady ‘01] Others… K. Grauman, B. Leibe

25 Harris Detector [Harris88]
Second moment matrix Intuition: Search for local neighborhoods where the image content has two main directions (eigenvectors). K. Grauman, B. Leibe

26 Harris Detector [Harris88]
Second moment matrix 1. Image derivatives Ix Iy (optionally, blur first) Ix2 Iy2 IxIy 2. Square of derivatives g(IxIy) 3. Gaussian filter g(sI) g(Ix2) g(Iy2) 4. Cornerness function – both eigenvalues are strong 5. Non-maxima suppression har

27 Harris Detector: Mathematics
Want large eigenvalues, and small ratio We know Leads to (k :empirical constant, k = ) Nice brief derivation on wikipedia

28 Explanation of Harris Criterion
From Grauman and Leibe

29 Harris Detector – Responses [Harris88]
Effect: A very precise corner detector.

30 Harris Detector - Responses [Harris88]

31 Hessian Detector [Beaudet78]
Hessian determinant Ixx Iyy Ixy Intuition: Search for strong curvature in two orthogonal directions K. Grauman, B. Leibe

32 Hessian Detector [Beaudet78]
Hessian determinant Ixx Iyy Ixy Find maxima of determinant In Matlab: K. Grauman, B. Leibe

33 Hessian Detector – Responses [Beaudet78]
Effect: Responses mainly on corners and strongly textured areas.

34 Hessian Detector – Responses [Beaudet78]

35 So far: can localize in x-y, but not scale

36 Automatic Scale Selection
How to find corresponding patch sizes? K. Grauman, B. Leibe

37 Automatic Scale Selection
Function responses for increasing scale (scale signature) K. Grauman, B. Leibe

38 Automatic Scale Selection
Function responses for increasing scale (scale signature) K. Grauman, B. Leibe

39 Automatic Scale Selection
Function responses for increasing scale (scale signature) K. Grauman, B. Leibe

40 Automatic Scale Selection
Function responses for increasing scale (scale signature) K. Grauman, B. Leibe

41 Automatic Scale Selection
Function responses for increasing scale (scale signature) K. Grauman, B. Leibe

42 Automatic Scale Selection
Function responses for increasing scale (scale signature) K. Grauman, B. Leibe

43 What Is A Useful Signature Function?
Difference-of-Gaussian = “blob” detector K. Grauman, B. Leibe

44 Difference-of-Gaussian (DoG)
= K. Grauman, B. Leibe

45 DoG – Efficient Computation
Computation in Gaussian scale pyramid s Original image Sampling with step s4 =2 K. Grauman, B. Leibe

46 Find local maxima in position-scale space of Difference-of-Gaussian
 List of (x, y, s) K. Grauman, B. Leibe

47 Results: Difference-of-Gaussian
K. Grauman, B. Leibe

48 Orientation Normalization
Compute orientation histogram Select dominant orientation Normalize: rotate to fixed orientation [Lowe, SIFT, 1999] 2 p T. Tuytelaars, B. Leibe

49 Harris-Laplace [Mikolajczyk ‘01]
Initialization: Multiscale Harris corner detection s4 s3 s2 s Computing Harris function Detecting local maxima

50 Harris-Laplace [Mikolajczyk ‘01]
Initialization: Multiscale Harris corner detection Scale selection based on Laplacian (same procedure with Hessian  Hessian-Laplace) Harris points Harris-Laplace points K. Grauman, B. Leibe

51 Maximally Stable Extremal Regions [Matas ‘02]
Based on Watershed segmentation algorithm Select regions that stay stable over a large parameter range K. Grauman, B. Leibe

52 Example Results: MSER K. Grauman, B. Leibe

53 Comparison Hessian Harris MSER LoG

54 Available at a web site near you…
For most local feature detectors, executables are available online: K. Grauman, B. Leibe

55 Local Descriptors The ideal descriptor should be
Robust Distinctive Compact Efficient Most available descriptors focus on edge/gradient information Capture texture information Color rarely used K. Grauman, B. Leibe

56 Local Descriptors: SIFT Descriptor
Histogram of oriented gradients Captures important texture information Robust to small translations / affine deformations [Lowe, ICCV 1999] K. Grauman, B. Leibe

57 Details of Lowe’s SIFT algorithm
Run DoG detector Find maxima in location/scale space Remove edge points Find all major orientations Bin orientations into 36 bin histogram Weight by gradient magnitude Weight by distance to center (Gaussian-weighted mean) Return orientations within 0.8 of peak Use parabola for better orientation fit For each (x,y,scale,orientation), create descriptor: Sample 16x16 gradient mag. and rel. orientation Bin 4x4 samples into 4x4 histograms Threshold values to max of 0.2, divide by L2 norm Final descriptor: 4x4x8 normalized histograms Lowe IJCV 2004

58 Matching SIFT Descriptors
Nearest neighbor (Euclidean distance) Threshold ratio of nearest to 2nd nearest descriptor Lowe IJCV 2004

59 SIFT Repeatability Lowe IJCV 2004

60 SIFT Repeatability

61 SIFT Repeatability Lowe IJCV 2004

62 SIFT Repeatability Lowe IJCV 2004

63 Local Descriptors: SURF
Fast approximation of SIFT idea Efficient computation by 2D box filters & integral images  6 times faster than SIFT Equivalent quality for object identification GPU implementation available Feature 200Hz (detector + descriptor, 640×480 img) Many other efficient descriptors are also available [Bay, ECCV’06], [Cornelis, CVGPU’08] K. Grauman, B. Leibe

64 Local Descriptors: Shape Context
Count the number of points inside each bin, e.g.: Count = 4 ... Count = 10 Log-polar binning: more precision for nearby points, more flexibility for farther points. Belongie & Malik, ICCV 2001 K. Grauman, B. Leibe

65 ~ Local Descriptors: Geometric Blur Compute edges at four orientations
Extract a patch in each channel ~ Apply spatially varying blur and sub-sample Example descriptor (Idealized signal) Berg & Malik, CVPR 2001 K. Grauman, B. Leibe

66 Choosing a detector What do you want it for?
Precise localization in x-y: Harris Good localization in scale: Difference of Gaussian Flexible region shape: MSER Best choice often application dependent Harris-/Hessian-Laplace/DoG work well for many natural categories MSER works well for buildings and printed things Why choose? Get more points with more detectors There have been extensive evaluations/comparisons [Mikolajczyk et al., IJCV’05, PAMI’05] All detectors/descriptors shown here work well

67 Comparison of Keypoint Detectors
Tuytelaars Mikolajczyk 2008

68 Choosing a descriptor Again, need not stick to one
For object instance recognition or stitching, SIFT or variant is a good choice

69 Things to remember Keypoint detection: repeatable and distinctive
Corners, blobs, stable regions Harris, DoG Descriptors: robust and selective spatial histograms of orientation SIFT

70 Next time Feature tracking


Download ppt "Locating and Describing Interest Points"

Similar presentations


Ads by Google