Efficient Algorithms for Matching Pedro Felzenszwalb Trevor Darrell Yann LeCun Alex Berg.

Slides:

Advertisements

Similar presentations

Shape Matching and Object Recognition using Low Distortion Correspondence Alexander C. Berg, Tamara L. Berg, Jitendra Malik U.C. Berkeley.

Advertisements

Distinctive Image Features from Scale-Invariant Keypoints David Lowe.

Presenter: Duan Tran (Part of slides are from Pedro’s)

3 Small Comments Alex Berg Stony Brook University I work on recognition: features – action recognition – alignment – detection – attributes – hierarchical.

Recovering Human Body Configurations: Combining Segmentation and Recognition Greg Mori, Xiaofeng Ren, and Jitentendra Malik (UC Berkeley) Alexei A. Efros.

MIT CSAIL Vision interfaces Towards efficient matching with random hashing methods… Kristen Grauman Gregory Shakhnarovich Trevor Darrell.

Computer vision: models, learning and inference Chapter 13 Image preprocessing and feature extraction.

Face Alignment with Part-Based Modeling

2D matching part 2 Review of alignment methods and

The Pyramid Match Kernel: Discriminative Classification with Sets of Image Features Kristen Grauman Trevor Darrell MIT.

Announcements Final Exam May 13th, 8 am (not my idea).

Recognition by Probabilistic Hypothesis Construction P. Moreels, M. Maire, P. Perona California Institute of Technology.

1 Building a Dictionary of Image Fragments Zicheng Liao Ali Farhadi Yang Wang Ian Endres David Forsyth Department of Computer Science, University of Illinois.

Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.

Uncertainty Representation. Gaussian Distribution variance Standard deviation.

Computer Vision Group University of California Berkeley Shape Matching and Object Recognition using Shape Contexts Jitendra Malik U.C. Berkeley (joint.

Fitting: The Hough transform

Robust and large-scale alignment Image from

Announcements Final Exam May 16 th, 8 am (not my idea). Practice quiz handout 5/8. Review session: think about good times. PS5: For challenge problems,

Computational Vision: Object Recognition Object Recognition Jeremy Wyatt.

Iterative closest point algorithms

A Study of Approaches for Object Recognition

Object Recognition with Invariant Features n Definition: Identify objects or scenes and determine their pose and model parameters n Applications l Industrial.

The Theory of NP-Completeness

Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.

Visual Cognition II Object Perception. Theories of Object Recognition Template matching models Feature matching Models Recognition-by-components Configural.

Feature-based object recognition Prof. Noah Snavely CS1114

Object Recognition Using Geometric Hashing

Computer Vision Group University of California Berkeley Matching Shapes Serge Belongie *, Jitendra Malik and Jan Puzicha U.C. Berkeley * Present address:

Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.

Mathematical Modeling What is it? (and how do you spell it?)

CSE 185 Introduction to Computer Vision

Image Based Positioning System Ankit Gupta Rahul Garg Ryan Kaminsky.

EADS DS / SDC LTIS Page 1 7 th CNES/DLR Workshop on Information Extraction and Scene Understanding for Meter Resolution Image – 29/03/07 - Oberpfaffenhofen.

Hurieh Khalajzadeh Mohammad Mansouri Mohammad Teshnehlab

Multiscale Symmetric Part Detection and Grouping Alex Levinshtein, Sven Dickinson, University of Toronto and Cristian Sminchisescu, University of Bonn.

Nearest Neighbor Searching Under Uncertainty

Intelligent Vision Systems ENT 496 Object Shape Identification and Representation Hema C.R. Lecture 7.

SVM-KNN Discriminative Nearest Neighbor Classification for Visual Category Recognition Hao Zhang, Alex Berg, Michael Maire, Jitendra Malik.

Fast Similarity Search for Learned Metrics Prateek Jain, Brian Kulis, and Kristen Grauman Department of Computer Sciences University of Texas at Austin.

Object Detection with Discriminatively Trained Part Based Models

80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.

Features-based Object Recognition P. Moreels, P. Perona California Institute of Technology.

Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.

CSCI 3160 Design and Analysis of Algorithms Tutorial 10 Chengyu Lin.

MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.

Cliff Shaffer Computer Science Computational Complexity.

BLAST: Basic Local Alignment Search Tool Altschul et al. J. Mol Bio CS 466 Saurabh Sinha.

Fitting: The Hough transform

Computer Vision Group University of California Berkeley On Visual Recognition Jitendra Malik UC Berkeley.

Geometric Hashing: A General and Efficient Model-Based Recognition Scheme Yehezkel Lamdan and Haim J. Wolfson ICCV 1988 Presented by Budi Purnomo Nov 23rd.

NP-COMPLETE PROBLEMS. Admin  Two more assignments…  No office hours on tomorrow.

NP-Complete problems.

A feature-based kernel for object classification P. Moreels - J-Y Bouguet Intel.

Discussion of Pictorial Structures Pedro Felzenszwalb Daniel Huttenlocher Sicily Workshop September, 2006.

Pictorial Structures and Distance Transforms Computer Vision CS 543 / ECE 549 University of Illinois Ian Endres 03/31/11.

CSC321: 2011 Introduction to Neural Networks and Machine Learning Lecture 6: Applying backpropagation to shape recognition Geoffrey Hinton.

A global approach Finding correspondence between a pair of epipolar lines for all pixels simultaneously Local method: no guarantee we will have one to.

776 Computer Vision Jan-Michael Frahm Spring 2012.

Computational Intelligence: Methods and Applications Lecture 26 Density estimation, Expectation Maximization. Włodzisław Duch Dept. of Informatics, UMK.

Hough Transform CS 691 E Spring Outline Hough transform Homography Reading: FP Chapter 15.1 (text) Some slides from Lazebnik.

776 Computer Vision Jan-Michael Frahm Spring 2012.

Object Recognition by Parts

Geometric Hashing: An Overview

Brief Review of Recognition + Context

Outline Background Motivation Proposed Model Experimental Results

A Graph-Matching Kernel for Object Categorization

Object Recognition by Parts

Presentation transcript:

Efficient Algorithms for Matching Pedro Felzenszwalb Trevor Darrell Yann LeCun Alex Berg

Efficient Algorithms for Matching Pedro Felzenszwalb Trevor Darrell Yann LeCun Alex Berg Polynomial & exact

Efficient Algorithms for Matching Pedro Felzenszwalb Trevor Darrell Yann LeCun Alex Berg Polynomial & exact Multilinear & approximate

Efficient Algorithms for Matching Pedro Felzenszwalb Trevor Darrell Yann LeCun Alex Berg Polynomial & exact Multilinear & approximate “Fast?” but very good

Efficient Algorithms for Matching Pedro Felzenszwalb Trevor Darrell Yann LeCun Alex Berg Polynomial & exact Multilinear & approximate “Fast?” but very good Happy when things work

First Criticism Efficiently computing the wrong solution is not so useful… First Response Even if say, an algorithm does not solve object recognition, it can still be a useful tool…

Why Matching? Ideas hatched before me –Statistical Pattern Theory (Ulf Grenander) –Deformable Templates –Fischler & Elshlager –Etc. at least by the early 1970’s “transform” and “appearance” parameters Matching to estimate transform Used to be continuous, now often discrete Very general, Translation / Diffeomorphism / Assignment Image / Features / “Parts” / etc.

Why Matching? Ideas hatched before me –Statistical Pattern Theory (Ulf Grenander) –Deformable Templates –Fischler & Elschlager –Etc. at least by the early 1970’s “transform” and “appearance” parameters Matching to estimate transform Used to be continuous, now often discrete Very general, Translation / Diffeomorphism / Assignment Image / Features / “Parts” / etc. MODEL TRANSFORM IMAGE

Why Matching? Ideas hatched before me –Statistical Pattern Theory (Ulf Grenander) –Deformable Templates –Fischler & Elschlager –Etc. at least by the early 1970’s “transform” and “appearance” parameters Matching to estimate transform Used to be continuous, now often discrete Very general, Translation / Diffeomorphism / Assignment Image / Features / “Parts” / etc. MODEL TRANSFORM IMAGE

Why Matching? Ideas hatched before me –Statistical Pattern Theory (Ulf Grenander) –Deformable Templates –Fischler & Elschlager –Etc. at least by the early 1970’s “transform” and “appearance” parameters Matching to estimate transform –Searching over diffeomorphisms difficult –Searching over discrete assignments easier? Used to be continuous, now often discrete Very general, Translation / Diffeomorphism / Assignment Image / Features / “Parts” / etc. MODEL TRANSFORM IMAGE

Search for a Transformation Model of Car Image ?

Find Transformation Using Correspondence Model of Car Image Search through a discrete set of possible point correspondences Objective function should be close to cost of the original model Use the discrete correspondences to obtain a continuous transformation if needed Sometimes…

Find Transformation Using Correspondence Model of Car Image Why it works… Sometimes we can measure consistency of model appearance locally Inspired by branch and bound: “If local appearance is inconsistent, any alignment with that appearance is bad.” My preferred way of motivating local features…

Find Transformation Using Correspondence Model of Car Image Sometimes local appearance is not enough, so we model some version of spatial constraints. Do not make the problem harder than it was…

Linear Assignment e.g. Hungarian Algorithm Just Features, no Geometry Individual feature matches provide most of the solution.

Quadratic Assignment (Adding Geometric Constraints) Individual feature matches provide most of the solution. Geometric consistency only has to clean things up a little. In this case we formulate the matching as an Integer Quadratic Programming problem and look for an approximate solution…

Second Meta-Comment Even if a problem can have very difficult instances, the effective complexity of certain instances might be quite low. This can be quite difficult to verify formally.

Use Alignment to Compare Model of Car Given alignment evaluate the model Note: we might have been done already Grauman et al, Zhang et al Actually do some alignment and check the quality of the fit. Back to the alignment…

Humans can be very efficient Simon Thorpe: animal or not in <=150 ms –“Feed forward” process –Difficult to retrain (familiarization does not make a difference) –Salient parts of images are actually processed more rapidly. –Support for some styles of current algorithms –Neurophysiological evidence for some mid-level vision (illusory contours, figure ground, etc.) von der Heydt et al That’s all fine, but

What is an Object? Apple yes, mist no A rule of thumb is that objects have some definite spatial support… Image/Scene? Some context models treat scenes or images as objects Face, eyes, nose, eye-lashes We can build SPT models for all of these…

Heuristics Take advantage of the data –Sometimes a single feature is enough –For efficiency need to weigh this against how often that feature is found –Many (?) object recognition datasets allow easy discrimination between categories with only very simple features extracted from the whole image, eg Pascal and Caltech 101. –Segmentation or Figure/Ground -- might as well see if there is an object there before trying to recognize it…

An Approach 1.Extract features from an image 2.Look features up in a large database Approximate Nearest Neighbor algorithms can make this sub-linear. Each entry tells us about which hypotheses [Object,position,pose,...] might be present. 3.Use a “short list” of these to check in more detail using a matching framework Simple matching can actually be indexed. Indyk et al, Grauman et al. 4.Finally use a matching to align models and apply more expensive processing