Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce.

Slides:



Advertisements
Similar presentations
ISSUES IN 3D OBJECT RECOGNITION Jean Ponce Department of Computer Science and Beckman Institute University of Illinois at Urbana-Champaign Joint work with.
Advertisements

Object class recognition using unsupervised scale-invariant learning Rob Fergus Pietro Perona Andrew Zisserman Oxford University California Institute of.
TP14 - Local features: detection and description Computer Vision, FCUP, 2014 Miguel Coimbra Slides by Prof. Kristen Grauman.
1 Part 1: Classical Image Classification Methods Kai Yu Dept. of Media Analytics NEC Laboratories America Andrew Ng Computer Science Dept. Stanford University.
CS4670 / 5670: Computer Vision Bag-of-words models Noah Snavely Object
Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.
In Search of Objects: 50 years of wondering : Learning-Based Methods in Vision A. Efros, CMU, Spring 2009.
Object Recognition. So what does object recognition involve?
Bag-of-features models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Beyond bags of features: Part-based models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Statistical Recognition Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Kristen Grauman.
Object Recognition Szeliski Chapter 14.
1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.
1 Interest Operators Find “interesting” pieces of the image –e.g. corners, salient regions –Focus attention of algorithms –Speed up computation Many possible.
Lecture 28: Bag-of-words models
A Study of Approaches for Object Recognition
1 Interest Operator Lectures lecture topics –Interest points 1 (Linda) interest points, descriptors, Harris corners, correlation matching –Interest points.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Object recognition Jana Kosecka Slides from D. Lowe, D. Forsythe and J. Ponce book, ICCV 2005 Tutorial Fei-Fei Li, Rob Fergus and A. Torralba.
Bag-of-features models
Introduction to Object Recognition CS773C Machine Intelligence Advanced Applications Spring 2008: Object Recognition.
Object recognition Jana Kosecka Slides from D. Lowe, D. Forsythe and J. Ponce book, ICCV 2005 Tutorial Fei-Fei Li, Rob Fergus and A. Torralba.
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.
Visual Object Recognition Rob Fergus Courant Institute, New York University
By Suren Manvelyan,
Opportunities of Scale, Part 2 Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.
Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.
Understanding Faces Computational Photography
Face Detection and Recognition
Machine learning & category recognition Cordelia Schmid Jakob Verbeek.
The Beauty of Local Invariant Features
Review: Intro to recognition Recognition tasks Machine learning approach: training, testing, generalization Example classifiers Nearest neighbor Linear.
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
Advanced Computer Vision Devi Parikh Electrical and Computer Engineering.
Face Detection and Recognition Computational Photography Derek Hoiem, University of Illinois Lecture by Kevin Karsch 12/3/13 Chuck Close, self portrait.
Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.
Machine learning & category recognition Cordelia Schmid Jakob Verbeek.
Computational Photography Tamara Berg Features. Representing Images Keep all the pixels! Pros? Cons?
Recovering Surface Layout from a Single Image D. Hoiem, A.A. Efros, M. Hebert Robotics Institute, CMU Presenter: Derek Hoiem CS 598, Spring 2009 Jan 29,
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
Why is computer vision difficult?
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
11/26/2015 Copyright G.D. Hager Class 2 - Schedule 1.Optical Illusions 2.Lecture on Object Recognition 3.Group Work 4.Sports Videos 5.Short Lecture on.
Project 3 Results.
Discussion of Pictorial Structures Pedro Felzenszwalb Daniel Huttenlocher Sicily Workshop September, 2006.
Grammars in computer vision
Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories ICCV 2005 Beijing, Short Course, Oct 15.
Project 3 questions? Interest Points and Instance Recognition Computer Vision CS 143, Brown James Hays 10/21/11 Many slides from Kristen Grauman and.
Local features: detection and description
776 Computer Vision Jan-Michael Frahm Spring 2012.
Using the Forest to see the Trees: A computational model relating features, objects and scenes Antonio Torralba CSAIL-MIT Joint work with Aude Oliva, Kevin.
High-Level Vision Object Recognition.
Introduction to Recognition CS4670/5670: Intro to Computer Vision Noah Snavely mountain building tree banner vendor people street lamp.
Face detection and recognition Many slides adapted from K. Grauman and D. Lowe.
Li Fei-Fei, Stanford Rob Fergus, NYU Antonio Torralba, MIT Recognizing and Learning Object Categories: Year 2009 ICCV 2009 Kyoto, Short Course, September.
776 Computer Vision Jan-Michael Frahm Spring 2012.
University of Ioannina
TP12 - Local features: detection and description
It’s a 3D World, After All Alyosha Efros CMU.
Lecture 25: Introduction to Recognition
Face Recognition and Feature Subspaces
Paper Presentation: Shape and Matching
Lecture 25: Introduction to Recognition
CS 1674: Intro to Computer Vision Scene Recognition
CSE 455 – Guest Lectures 3 lectures Contact Interest points 1
Brief Review of Recognition + Context
SIFT keypoint detection
Generic object recognition
Presentation transcript:

Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce

How many visual object categories are there? Biederman 1987

OBJECTS ANIMALS INANIMATE PLANTS MAN-MADENATURAL VERTEBRATE ….. MAMMALS BIRDS GROUSEBOARTAPIR CAMERA

So what does object recognition involve?

Scene categorization outdoor city …

Image-level annotation: are there people? outdoor city …

Object detection: where are the people?

Image parsing mountain building tree banner vendor people street lamp

Variability : Camera position Illumination Shape parameters Within-class variations? Modeling variability

Within-class variations

Variability : Camera position Illumination  Alignment Roberts (1965); Lowe (1987); Faugeras & Hebert (1986); Grimson & Lozano-Perez (1986); Huttenlocher & Ullman (1987) Shape: assumed known

Recall: Alignment Alignment: fitting a model to a transformation between pairs of features (matches) in two images Find transformation T that minimizes T xixi xixi '

Recall: Origins of computer vision L. G. Roberts, Machine Perception of Three Dimensional Solids, Ph.D. thesis, MIT Department of Electrical Engineering, 1963.Machine Perception of Three Dimensional Solids

Alignment: Huttenlocher & Ullman (1987)

Variability Camera position Illumination Internal parameters Invariance to: Duda & Hart ( 1972); Weiss (1987); Mundy et al. ( ); Rothwell et al. (1992); Burns et al. (1993)

General 3D objects do not admit monocular viewpoint invariants (Burns et al., 1993) Projective invariants (Rothwell et al., 1992): Recall: invariant to similarity transformations computed from four points A B C D

ACRONYM (Brooks and Binford, 1981) Representing and recognizing object categories is harder... Binford (1971), Nevatia & Binford (1972), Marr & Nishihara (1978)

Recognition by components Geons (Biederman 1987) ???

Zisserman et al. (1995) Generalized cylinders Ponce et al. (1989) Forsyth (2000) General shape primitives?

Empirical models of image variability Appearance-based techniques Turk & Pentland (1991); Murase & Nayar (1995); etc.

Eigenfaces (Turk & Pentland, 1991)

Color Histograms Swain and Ballard, Color Indexing, IJCV 1991.Color Indexing

H. Murase and S. Nayar, Visual learning and recognition of 3-d objects from appearance, IJCV 1995 Appearance manifolds

Limitations of global appearance models Requires global registration of patterns Not robust to clutter, occlusion, geometric transformations

Sliding window approaches Turk and Pentland, 1991 Belhumeur, Hespanha, & Kriegman, 1997 Schneiderman & Kanade 2004 Viola and Jones, 2000 Schneiderman & Kanade, 2004 Argawal and Roth, 2002 Poggio et al. 1993

–Scale / orientation range to search over –Speed –Context Sliding window approaches

Lowe’02 Mahamud & Hebert’03 Local features Combining local appearance, spatial constraints, invariants, and classification techniques from machine learning. Schmid & Mohr’97

Local features for recognition of object instances

Lowe, et al. 1999, 2003 Mahamud and Hebert, 2000 Ferrari, Tuytelaars, and Van Gool, 2004 Rothganger, Lazebnik, and Ponce, 2004 Moreels and Perona, 2005 … Local features for recognition of object instances

Representing categories: Parts and Structure Weber, Welling & Perona (2000), Fergus, Perona & Zisserman (2003)

Parts-and-shape representation Model: –Object as a set of parts –Relative locations between parts –Appearance of part Figure from [Fischler & Elschlager 73]

Object Bag of ‘words’ Bag-of-features models

Objects as texture All of these are treated as being the same No distinction between foreground and background: scene recognition?

Timeline of recognition 1965-late 1980s: alignment, geometric primitives Early 1990s: invariants, appearance-based methods Mid-late 1990s: sliding window approaches Late 1990s: feature-based methods Early 2000s: parts-and-shape models 2003 – present: bags of features Present trends: combination of local and global methods, modeling context, emphasis on “image parsing”

Global scene context The “gist” of a scene: Oliva & Torralba (2001)

J. Hays and A. Efros, Scene Completion using Millions of Photographs, SIGGRAPH 2007Scene Completion using Millions of Photographs

Scene-level context for image parsing J. Tighe and S. Lazebnik, ECCV 2010 submission

D. Hoiem, A. Efros, and M. Herbert. Putting Objects in Perspective. CVPR 2006.Putting Objects in Perspective. Geometric context

What “works” today Reading license plates, zip codes, checks

What “works” today Reading license plates, zip codes, checks Fingerprint recognition

What “works” today Reading license plates, zip codes, checks Fingerprint recognition Face detection

What “works” today Reading license plates, zip codes, checks Fingerprint recognition Face detection Recognition of flat textured objects (CD covers, book covers, etc.)