Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories ICCV 2005 Beijing, Short Course, Oct 15.

Slides:



Advertisements
Similar presentations
Object class recognition using unsupervised scale-invariant learning Rob Fergus Pietro Perona Andrew Zisserman Oxford University California Institute of.
Advertisements

Part 1: Bag-of-words models by Li Fei-Fei (Princeton)
1 Part 1: Classical Image Classification Methods Kai Yu Dept. of Media Analytics NEC Laboratories America Andrew Ng Computer Science Dept. Stanford University.
16-721: Advanced Machine Perception Staff: Instructor: Alexei (Alyosha) Efros 4207 TA: David Bradley 2216 NSH Web Page:
Student: Yao-Sheng Wang Advisor: Prof. Sheng-Jyh Wang ARTICULATED HUMAN DETECTION 1 Department of Electronics Engineering National Chiao Tung University.
Discriminative and generative methods for bags of features
Unsupervised Learning for Recognition Pietro Perona California Institute of Technology & Universita di Padova 11 th British Machine Vision Conference –
Object Recognition. So what does object recognition involve?
Bag-of-features models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Beyond bags of features: Part-based models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Recognition: A machine learning approach
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
How many object categories are there? Biederman 1987.
Statistical Recognition Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Kristen Grauman.
Object Recognition Szeliski Chapter 14.
1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.
Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
Object Recognition. So what does object recognition involve?
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce.
Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.
Object Recognition. So what does object recognition involve?
Object recognition Jana Kosecka Slides from D. Lowe, D. Forsythe and J. Ponce book, ICCV 2005 Tutorial Fei-Fei Li, Rob Fergus and A. Torralba.
16-721: Learning-based Methods in Vision Staff: Instructor: Alexei (Alyosha) Efros 4207 TA: Tomasz Malisiewicz Smith Hall.
Bag-of-features models
Object recognition Jana Kosecka Slides from D. Lowe, D. Forsythe and J. Ponce book, ICCV 2005 Tutorial Fei-Fei Li, Rob Fergus and A. Torralba.
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.
Agenda Introduction Bag-of-words model Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.
Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
Visual Object Recognition Rob Fergus Courant Institute, New York University
16-721: Learning-based Methods in Vision Staff: Instructor: Alexei (Alyosha) Efros 4207 TA: Jean-Francois Lalonde A521.
By Suren Manvelyan,
CS294‐43: Visual Object and Activity Recognition Prof. Trevor Darrell Spring 2009.
Face Recognition: An Introduction
Machine learning & category recognition Cordelia Schmid Jakob Verbeek.
The Beauty of Local Invariant Features
Crash Course on Machine Learning
Computer Vision CS 776 Spring 2014 Recognition Machine Learning Prof. Alex Berg.
“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)
Perceptual and Sensory Augmented Computing Visual Object Recognition Tutorial Visual Object Recognition Bastian Leibe & Computer Vision Laboratory ETH.
Last part: datasets and object collections. CMU/MIT frontal facesvasc.ri.cmu.edu/idb/html/face/frontal_images cbcl.mit.edu/software-datasets/FaceData2.html.
Building local part models for category-level recognition C. Schmid, INRIA Grenoble Joint work with G. Dorko, S. Lazebnik, J. Ponce.
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
Computing & Information Sciences Kansas State University Wednesday, 03 Dec 2008CIS 530 / 730: Artificial Intelligence Lecture 38 of 42 Wednesday, 03 December.
Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.
Pedestrian Detection and Localization
Perceptual and Sensory Augmented Computing Discussion Session: Sliding Windows Sliding Windows – Silver Bullet or Evolutionary Deadend? Alyosha Efros,
Why is computer vision difficult?
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
Putting Context into Vision Derek Hoiem September 15, 2004.
Bayesian Parameter Estimation Liad Serruya. Agenda Introduction Bayesian decision theory Scale-Invariant Learning Bayesian “One-Shot” Learning.
Handwritten digit recognition
11/26/2015 Copyright G.D. Hager Class 2 - Schedule 1.Optical Illusions 2.Lecture on Object Recognition 3.Group Work 4.Sports Videos 5.Short Lecture on.
Grammars in computer vision
A Brief Introduction on Face Detection Mei-Chen Yeh 04/06/2010 P. Viola and M. J. Jones, Robust Real-Time Face Detection, IJCV 2004.
Recognition Readings – C. Bishop, “Neural Networks for Pattern Recognition”, Oxford University Press, 1998, Chapter 1. – Szeliski, Chapter (eigenfaces)
Using the Forest to see the Trees: A computational model relating features, objects and scenes Antonio Torralba CSAIL-MIT Joint work with Aude Oliva, Kevin.
Introduction to Recognition CS4670/5670: Intro to Computer Vision Noah Snavely mountain building tree banner vendor people street lamp.
A Discriminatively Trained, Multiscale, Deformable Part Model Yeong-Jun Cho Computer Vision and Pattern Recognition,2008.
CS 558 Computer Vision John Oliensis. Today’s class What is vision What is computer vision How we can solve vision problems –Important tools –Overall.
Li Fei-Fei, Stanford Rob Fergus, NYU Antonio Torralba, MIT Recognizing and Learning Object Categories: Year 2009 ICCV 2009 Kyoto, Short Course, September.
Lecture 25: Introduction to Recognition
Lecture 14: Introduction to Recognition
Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT
Object detection as supervised classification
Lecture 25: Introduction to Recognition
Recognizing and Learning Object categories
Lecture: Object Recognition
Generic object recognition
Presentation transcript:

Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories ICCV 2005 Beijing, Short Course, Oct 15

Agenda Introduction Bag of words models Part-based models Discriminative methods Segmentation and recognition Conclusions

perceptible vision materialthing

Plato said… Ordinary objects are classified together if they `participate' in the same abstract Form, such as the Form of a Human or the Form of Quartz. Forms are proper subjects of philosophical investigation, for they have the highest degree of reality. Ordinary objects, such as humans, trees, and stones, have a lower degree of reality than the Forms. Fictions, shadows, and the like have a still lower degree of reality than ordinary objects and so are not proper subjects of philosophical enquiry.

Bruegel, 1564

How many object categories are there? Biederman 1987

So what does object recognition involve?

Verification: is that a bus?

Detection: are there cars?

Identification: is that a picture of Mao?

Object categorization sky building flag wall banner bus cars bus face street lamp

Scene and context categorization outdoor city traffic …

Challenges 1: view point variation Michelangelo

Challenges 2: illumination slide credit: S. Ullman

Challenges 3: occlusion Magritte, 1957

Challenges 4: scale

Challenges 5: deformation Xu, Beihong 1943

Challenges 6: background clutter Klimt, 1913

History: single object recognition

Lowe, et al. 1999, 2003 Mahamud and Herbert, 2000 Ferrari, Tuytelaars, and Van Gool, 2004 Rothganger, Lazebnik, and Ponce, 2004 Moreels and Perona, 2005 …

Challenges 7: intra-class variation

History: early object categorization

Turk and Pentland, 1991 Belhumeur et al Schneiderman et al Viola and Jones, 2000 Amit and Geman, 1999 LeCun et al Belongie and Malik, 2002 Schneiderman et al Argawal and Roth, 2002 Poggio et al. 1993

OBJECTS ANIMALS INANIMATE PLANTS MAN-MADENATURAL VERTEBRATE ….. MAMMALS BIRDS GROUSEBOARTAPIR CAMERA

Scenes, Objects, and Parts Features Parts Objects Scene E. Sudderth, A. Torralba, W. Freeman, A. Willsky. ICCV 2005.

Object categorization: the statistical viewpoint vs. Bayes rule: posterior ratio likelihood ratioprior ratio

Object categorization: the statistical viewpoint posterior ratio likelihood ratioprior ratio Discriminative methods model posterior Generative methods model likelihood and prior

Discriminative Direct modeling of Zebra Non-zebra Decision boundary

Model and Generative LowMiddle HighMiddle  Low

Three main issues Representation –How to represent an object category Learning –How to form the classifier, given training data Recognition –How the classifier is to be used on novel data

Representation –Generative / discriminative / hybrid

Representation –Generative / discriminative / hybrid –Appearance only or location and appearance

Representation –Generative / discriminative / hybrid –Appearance only or location and appearance –Invariances View point Illumination Occlusion Scale Deformation Clutter etc.

Representation –Generative / discriminative / hybrid –Appearance only or location and appearance –invariances –Part-based or global w/sub-window

Representation –Generative / discriminative / hybrid –Appearance only or location and appearance –invariances –Parts or global w/sub- window –Use set of features or each pixel in image

–Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning Learning

–Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning) –Methods of training: generative vs. discriminative Learning

–Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning) –What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.) –Level of supervision Manual segmentation; bounding box; image labels; noisy labels Learning Contains a motorbike

–Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning) –What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.) –Level of supervision Manual segmentation; bounding box; image labels; noisy labels –Batch/incremental (on category and image level; user-feedback ) Learning

–Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning) –What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.) –Level of supervision Manual segmentation; bounding box; image labels; noisy labels –Batch/incremental (on category and image level; user-feedback ) –Training images: Issue of overfitting Negative images for discriminative methods Priors Learning

–Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning) –What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.) –Level of supervision Manual segmentation; bounding box; image labels; noisy labels –Batch/incremental (on category and image level; user-feedback ) –Training images: Issue of overfitting Negative images for discriminative methods –Priors Learning

–Scale / orientation range to search over –Speed Recognition