Brief Review of Recognition + Context

Slides:

Advertisements

Similar presentations

Distinctive Image Features from Scale-Invariant Keypoints

Advertisements

Distinctive Image Features from Scale-Invariant Keypoints David Lowe.

Computer Vision for Human-Computer InteractionResearch Group, Universität Karlsruhe (TH) cv:hci Dr. Edgar Seemann 1 Computer Vision: Histograms of Oriented.

Lecture 31: Modern object recognition

Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.

Object Recognition & Model Based Tracking © Danica Kragic Tracking system.

Uncertainty Representation. Gaussian Distribution variance Standard deviation.

Lecture 6: Feature matching CS4670: Computer Vision Noah Snavely.

1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.

A Study of Approaches for Object Recognition

CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.

Distinctive Image Feature from Scale-Invariant KeyPoints

The Terrapins Computer Vision Laboratory University of Maryland.

CS4670: Computer Vision Kavita Bala Lecture 8: Scale invariance.

Opportunities of Scale, Part 2 Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.

Object Recognition and Augmented Reality

Computer Vision: Summary and Discussion Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 04/30/2015.

Computer Vision: Summary and Discussion

Computer vision.

Keypoint-based Recognition Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/04/10.

Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.

EADS DS / SDC LTIS Page 1 7 th CNES/DLR Workshop on Information Extraction and Scene Understanding for Meter Resolution Image – 29/03/07 - Oberpfaffenhofen.

Computer Vision: Summary and Discussion Computer Vision ECE 5554 Virginia Tech Devi Parikh 11/21/

Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao.

Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.

Face Recognition: An Introduction

Computer Vision: Summary and Discussion Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 05/03/2011.

Computer Vision 776 Jan-Michael Frahm 12/05/2011 Many slides from Derek Hoiem, James Hays.

Lecture 7: Features Part 2 CS4670/5670: Computer Vision Noah Snavely.

Pictorial Structures and Distance Transforms Computer Vision CS 543 / ECE 549 University of Illinois Ian Endres 03/31/11.

Category Independent Region Proposals Ian Endres and Derek Hoiem University of Illinois at Urbana-Champaign.

CSE 185 Introduction to Computer Vision Feature Matching.

Project 3 questions? Interest Points and Instance Recognition Computer Vision CS 143, Brown James Hays 10/21/11 Many slides from Kristen Grauman and.

776 Computer Vision Jan-Michael Frahm Spring 2012.

Image features and properties. Image content representation The simplest representation of an image pattern is to list image pixels, one after the other.

CSE 185 Introduction to Computer Vision Local Invariant Features.

Finding Clusters within a Class to Improve Classification Accuracy Literature Survey Yong Jae Lee 3/6/08.

Computer Vision: Summary and Discussion Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 05/01/2012.

Another Example: Circle Detection

SIFT Scale-Invariant Feature Transform David Lowe

Object DetectionI Ali Taalimi 01/08/2013.

CS262: Computer Vision Lect 09: SIFT Descriptors

Lecture 13: Feature Descriptors and Matching

Summary and Discussion

Local features: main components

University of Ioannina

Project 1: hybrid images

TP12 - Local features: detection and description

Computer Vision: Summary and Discussion

Lecture 25: Introduction to Recognition

Face Recognition and Feature Subspaces

LOCUS: Learning Object Classes with Unsupervised Segmentation

Paper Presentation: Shape and Matching

Lecture 14: Introduction to Recognition

Feature description and matching

Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT

Object detection as supervised classification

Lecture 25: Introduction to Recognition

Image Segmentation Techniques

Creating Data Representations

CS4670: Intro to Computer Vision

Grouping/Segmentation

CSE 185 Introduction to Computer Vision

Feature descriptors and matching

Lecture 5: Feature invariance

Presented by Xu Miao April 20, 2005

Lecture 5: Feature invariance

The “Margaret Thatcher Illusion”, by Peter Thompson

Presentation transcript:

Brief Review of Recognition + Context 04/01/10 Brief Review of Recognition + Context Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem

Object Instance Recognition Want to recognize the same or equivalent object instance, which may vary Slight deformations Change in lighting Occlusion Rotation, rescaling, translation, perspective = = =

Object Instance Recognition Template matching: faces Recognize by directly computing pixel distance of aligned faces Principal component analysis gives a subspace that preserves variance Linear Discriminant Analysis (LDA) or Fisher Linear Discriminants (FLD) gives a subspace that maximizes discrimination This could work for other kinds of aligned objects

Object Instance Recognition If object is not aligned, we need to perform geometric matching Find distinctive and repeatable keypoints E.g., Difference of Gaussian, Harris corners, or MSER regions Represent the appearance at these points (e.g., SIFT) Match pairs of keypoints Estimate transformation (e.g., rotation, scale, translation) from matched keypoints Hough voting Geometric refinement Clustering (visual words) and inverse document frequency enable fast search in large datasets B1 B2 B3 A1 A2 A3

Category recognition Instances across categories tend to vary in more challenging ways than a single instance across images

Image Categorization In training, a classifier is trained for a particular feature representation using labeled examples The features should generally capture local patterns but with loose spatial encoding For scene categorization, a reasonable choice is often Compute visual words (detect interest points, represent them with SIFT, and cluster) Compute a spatial pyramid of these visual words, composed of histograms at different spatial resolutions Train a linear SVM classifier or one with a Chi-squared kernel Training Labels Training Images Classifier Training Image Features Trained Classifier

Object Category Detection One difficulty of object category detection is that objects could appear at many scales or translations, and keypoint matching will be unreliable A simple way around this is to treat category detection as a series of image categorization tasks, breaking up the image into thousands of windows and applying a binary classifier to each Often, the object is classified using edge-based features whose positions are defined at fixed position in the sliding window Object or Background?

Object Category Detection Sliding windows might work well for rigid objects But some objects may be better thought of as spatial arrangements of parts

Object Category Detection Part-based models have three key components Part definition and appearance model Model of geometry or layout of parts Algorithm for efficient search ISM Model Parts are clustered detected keypoints Position of each part wrt object center/size is recorded Search is done through Hough voting / Mean-shift clustering combination Pictorial structures model Parts are rectangles detected in silhouette Layout is articulated model with tree-shaped graph Search through dynamic programming or probabilistic sampling

Region-based recognition Sometimes, we want to label image pixels or regions Basic approach: Segment the image into blocks, superpixels, or regions Represent each region with histograms of keypoints, color, texture, and position Classify each region (variety of classifiers used)

Context in Recognition Objects usually are surrounded by a scene that can provide context in the form of naerby objects, surfaces, scene category, geometry, etc.

Context provides clues for function What is this? These examples from Antonio Torralba

Context provides clues for function What is this? Now can you tell?

Sometimes context is the major component of recognition What is this?

Sometimes context is the major component of recognition What is this? Now can you tell?

More Low-Res What are these blobs?

More Low-Res The same pixels! (a car)

We will see more on context later…