Presentation is loading. Please wait.

Presentation is loading. Please wait.

Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories ICCV 2005 Beijing, Short Course, Oct 15.

Similar presentations


Presentation on theme: "Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories ICCV 2005 Beijing, Short Course, Oct 15."— Presentation transcript:

1 Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories ICCV 2005 Beijing, Short Course, Oct 15

2 Agenda Introduction Bag of words models Part-based models Discriminative methods Segmentation and recognition Conclusions

3

4 perceptible vision materialthing

5

6

7 Plato said… Ordinary objects are classified together if they `participate' in the same abstract Form, such as the Form of a Human or the Form of Quartz. Forms are proper subjects of philosophical investigation, for they have the highest degree of reality. Ordinary objects, such as humans, trees, and stones, have a lower degree of reality than the Forms. Fictions, shadows, and the like have a still lower degree of reality than ordinary objects and so are not proper subjects of philosophical enquiry.

8 Bruegel, 1564

9 How many object categories are there? Biederman 1987

10 So what does object recognition involve?

11 Verification: is that a bus?

12 Detection: are there cars?

13 Identification: is that a picture of Mao?

14 Object categorization sky building flag wall banner bus cars bus face street lamp

15 Scene and context categorization outdoor city traffic …

16 Challenges 1: view point variation Michelangelo 1475-1564

17 Challenges 2: illumination slide credit: S. Ullman

18 Challenges 3: occlusion Magritte, 1957

19 Challenges 4: scale

20 Challenges 5: deformation Xu, Beihong 1943

21 Challenges 6: background clutter Klimt, 1913

22 History: single object recognition

23 Lowe, et al. 1999, 2003 Mahamud and Herbert, 2000 Ferrari, Tuytelaars, and Van Gool, 2004 Rothganger, Lazebnik, and Ponce, 2004 Moreels and Perona, 2005 …

24 Challenges 7: intra-class variation

25 History: early object categorization

26 Turk and Pentland, 1991 Belhumeur et al. 1997 Schneiderman et al. 2004 Viola and Jones, 2000 Amit and Geman, 1999 LeCun et al. 1998 Belongie and Malik, 2002 Schneiderman et al. 2004 Argawal and Roth, 2002 Poggio et al. 1993

27

28 OBJECTS ANIMALS INANIMATE PLANTS MAN-MADENATURAL VERTEBRATE ….. MAMMALS BIRDS GROUSEBOARTAPIR CAMERA

29

30 Scenes, Objects, and Parts Features Parts Objects Scene E. Sudderth, A. Torralba, W. Freeman, A. Willsky. ICCV 2005.

31 Object categorization: the statistical viewpoint vs. Bayes rule: posterior ratio likelihood ratioprior ratio

32 Object categorization: the statistical viewpoint posterior ratio likelihood ratioprior ratio Discriminative methods model posterior Generative methods model likelihood and prior

33 Discriminative Direct modeling of Zebra Non-zebra Decision boundary

34 Model and Generative LowMiddle HighMiddle  Low

35 Three main issues Representation –How to represent an object category Learning –How to form the classifier, given training data Recognition –How the classifier is to be used on novel data

36 Representation –Generative / discriminative / hybrid

37 Representation –Generative / discriminative / hybrid –Appearance only or location and appearance

38 Representation –Generative / discriminative / hybrid –Appearance only or location and appearance –Invariances View point Illumination Occlusion Scale Deformation Clutter etc.

39 Representation –Generative / discriminative / hybrid –Appearance only or location and appearance –invariances –Part-based or global w/sub-window

40 Representation –Generative / discriminative / hybrid –Appearance only or location and appearance –invariances –Parts or global w/sub- window –Use set of features or each pixel in image

41 –Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning Learning

42 –Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning) –Methods of training: generative vs. discriminative Learning

43 –Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning) –What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.) –Level of supervision Manual segmentation; bounding box; image labels; noisy labels Learning Contains a motorbike

44 –Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning) –What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.) –Level of supervision Manual segmentation; bounding box; image labels; noisy labels –Batch/incremental (on category and image level; user-feedback ) Learning

45 –Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning) –What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.) –Level of supervision Manual segmentation; bounding box; image labels; noisy labels –Batch/incremental (on category and image level; user-feedback ) –Training images: Issue of overfitting Negative images for discriminative methods Priors Learning

46 –Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning) –What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.) –Level of supervision Manual segmentation; bounding box; image labels; noisy labels –Batch/incremental (on category and image level; user-feedback ) –Training images: Issue of overfitting Negative images for discriminative methods –Priors Learning

47 –Scale / orientation range to search over –Speed Recognition


Download ppt "Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories ICCV 2005 Beijing, Short Course, Oct 15."

Similar presentations


Ads by Google