Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake.

Slides:



Advertisements
Similar presentations
Shape Matching and Object Recognition using Low Distortion Correspondence Alexander C. Berg, Tamara L. Berg, Jitendra Malik U.C. Berkeley.
Advertisements

Shape Context and Chamfer Matching in Cluttered Scenes
The Layout Consistent Random Field for detecting and segmenting occluded objects CVPR, June 2006 John Winn Jamie Shotton.
Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.
Ignas Budvytis*, Tae-Kyun Kim*, Roberto Cipolla * - indicates equal contribution Making a Shallow Network Deep: Growing a Tree from Decision Regions of.
Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.
A generic model to compose vision modules for holistic scene understanding Adarsh Kowdle *, Congcong Li *, Ashutosh Saxena, and Tsuhan Chen Cornell University,
Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
Part 4: Combined segmentation and recognition by Rob Fergus (MIT)
Learning to Combine Bottom-Up and Top-Down Segmentation Anat Levin and Yair Weiss School of CS&Eng, The Hebrew University of Jerusalem, Israel.
Object class recognition using unsupervised scale-invariant learning Rob Fergus Pietro Perona Andrew Zisserman Oxford University California Institute of.
- Recovering Human Body Configurations: Combining Segmentation and Recognition (CVPR’04) Greg Mori, Xiaofeng Ren, Alexei A. Efros and Jitendra Malik -
Ivan Laptev IRISA/INRIA, Rennes, France September 07, 2006 Boosted Histograms for Improved Object Detection.
LOCUS (Learning Object Classes with Unsupervised Segmentation) A variational approach to learning model- based segmentation. John Winn Microsoft Research.
Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.
EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.
Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.
Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA
Pedestrian Detection in Crowded Scenes Dhruv Batra ECE CMU.
Detecting Pedestrians by Learning Shapelet Features
Fast intersection kernel SVMs for Realtime Object Detection
Beyond bags of features: Part-based models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.
TextonBoost : Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton*, J. Winn†, C. Rother†, and A.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Multi-Class Object Recognition Using Shared SIFT Features
Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.
Transferring information using Bayesian priors on object categories Li Fei-Fei 1, Rob Fergus 2, Pietro Perona 1 1 California Institute of Technology, 2.
The Layout Consistent Random Field for Recognizing and Segmenting Partially Occluded Objects By John Winn & Jamie Shotton CVPR 2006 presented by Tomasz.
Object Class Recognition by Unsupervised Scale-Invariant Learning R. Fergus, P. Perona, and A. Zisserman Presented By Jeff.
Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.
LOCUS Demo Stefan Zickler. Two “different” classes Class “Car Side Views” Class “Car Rears”
1 Outline Overview Integrating Vision Models CCM: Cascaded Classification Models Learning Spatial Context TAS: Things and Stuff Descriptive Querying of.
Unsupervised Category Modeling, Recognition and Segmentation Sinisa Todorovic and Narendra Ahuja.
What, Where & How Many? Combining Object Detectors and CRFs
Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.
Unsupervised Learning of Categories from Sets of Partially Matching Image Features Kristen Grauman and Trevor Darrel CVPR 2006 Presented By Sovan Biswas.
Computer Vision CS 776 Spring 2014 Recognition Machine Learning Prof. Alex Berg.
Professor: S. J. Wang Student : Y. S. Wang
Recognition using Regions (Demo) Sudheendra V. Outline Generating multiple segmentations –Normalized cuts [Ren & Malik (2003)] Uniform regions –Watershed.
Building local part models for category-level recognition C. Schmid, INRIA Grenoble Joint work with G. Dorko, S. Lazebnik, J. Ponce.
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
A Statistically Selected Part-Based Probabilistic Model for Object Recognition Zhipeng Zhao, Ahmed Elgammal Department of Computer Science, Rutgers, The.
Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.
Representations for object class recognition David Lowe Department of Computer Science University of British Columbia Vancouver, Canada Sept. 21, 2006.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.
Layered Object Detection for Multi-Class Image Segmentation UC Irvine Yi Yang Sam Hallman Deva Ramanan Charless Fowlkes.
Associative Hierarchical CRFs for Object Class Image Segmentation
O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.
Improved Object Detection
Recognition Using Visual Phrases
Jigsaws: joint appearance and shape clustering John Winn with Anitha Kannan and Carsten Rother Microsoft Research, Cambridge.
Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.
TextonBoost: Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton ; University of Cambridge J. Jinn,
Object Recognition by Discriminative Combinations of Line Segments and Ellipses Alex Chia ^˚ Susanto Rahardja ^ Deepu Rajan ˚ Maylor Leung ˚ ^ Institute.
Holistic Scene Understanding Virginia Tech ECE /02/26 Stanislaw Antol.
Object Recognition by Parts
Object detection with deformable part-based models
LOCUS: Learning Object Classes with Unsupervised Segmentation
Object Recognition by Parts
Object detection as supervised classification
Object Recognition by Parts
Learning to Combine Bottom-Up and Top-Down Segmentation
Object Recognition by Parts
Liyuan Li, Jerry Kah Eng Hoe, Xinguo Yu, Li Dong, and Xinqi Chu
Object Recognition by Parts
Object Recognition with Interest Operators
Presentation transcript:

Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake

Recent Related Work Recognition Recognition Constellation model – e.g. Fergus et al. [CVPR 03] and Li et al. [ICCV 03] Constellation model – e.g. Fergus et al. [CVPR 03] and Li et al. [ICCV 03] Deformable shape models – Berg et al. [CVPR 05] Deformable shape models – Berg et al. [CVPR 05] Segmentation Segmentation Top-down/bottom-up segmentation – Borenstein et al. [POCV 04] Top-down/bottom-up segmentation – Borenstein et al. [POCV 04] Joint recognition and segmentation Joint recognition and segmentation Implicit Shape Model – Leibe et al. [CVPR 05] Implicit Shape Model – Leibe et al. [CVPR 05] LOCUS – Winn & Jojic [ICCV 05] LOCUS – Winn & Jojic [ICCV 05] Obj Cut – Kumar et al. [ICCV 05] Obj Cut – Kumar et al. [ICCV 05] Multi-Class Recognition and Segmentation Multi-Class Recognition and Segmentation Multiscale CRFs – He et al. [CVPR 04] Multiscale CRFs – He et al. [CVPR 04] pLSA-based models – e.g. Sivic et al. and Fergus et al. [ICCV 05] pLSA-based models – e.g. Sivic et al. and Fergus et al. [ICCV 05] Hierarchical field framework – Kumar & Hebert [ICCV 05] Hierarchical field framework – Kumar & Hebert [ICCV 05]

Contour-Based Learning Goal – single class categorical recognition Goal – single class categorical recognition learn to detect and localise objects learn to detect and localise objects “find the car, face or horse” “find the car, face or horse” How can we exploit object contour? How can we exploit object contour? Desired detection results Our contribution

Contour Features Features Features contour fragments contour fragments and their parameters and their parameters Local features not whole contour Local features not whole contour account for variability separately account for variability separately increase generalisation increase generalisation decrease training requirements decrease training requirements

Object Model p σ T Model is set of M features Model is set of M features star constellation star constellation Each feature Each feature contour fragment expected offsetmodel parametersclassifier parameters

Matching Features Canny Edge Detector Distance Transform Gaussian weighted oriented chamfer matching Gaussian weighted oriented chamfer matching aligns features to image aligns features to image

Matching Features Gaussian weighted oriented chamfer matching Gaussian weighted oriented chamfer matching aligns features to image aligns features to image Chamfer Matching feature match score at optimal position optimal position

confidence weighted weak learner Location Sensitive Classification Feature match scores make detection simple Feature match scores make detection simple Detection uses a boosted classification function K(c): Detection uses a boosted classification function K(c): M number of features FmFmFmFm feature m E canny edge map c object centroid match scorethresholded match score  m weak learner threshold amamamam weak learner confidence bmbmbmbm  0-1 indicator function

Evaluate K(c) for all c gives a classification map Evaluate K(c) for all c gives a classification map confidence as function of position confidence as function of position Globally thresholded local maxima give final detections Globally thresholded local maxima give final detections Object Detection test image classification map contours object no object

Learning System Detection Boosting Algorithm K(c)K(c) Segmented Training Data Test Data Object Detections Background Training Data

Training Data

Boot-Strapping Learn detector K 1 (c) Learn detector K 1 (c) segmented training data segmented training data Evaluate detector K 1 (c) on Evaluate detector K 1 (c) on unsegmented class images unsegmented class images locates object centroids locates object centroids background images background images locates clutter locates clutter +

Learning System Detection Boosting Algorithm K1(c) K1(c) Segmented Training Data Unsegmented Training Data Detection Object Detections Boosting Algorithm K 2 (c) Test Data Background Training Data

Building a Fragment Dictionary …… Masks (~10 images) Contour Fragments T n (~1000 fragments) ……

Training Examples Learn classifier K(c) by boosting from Learn classifier K(c) by boosting from feature vectors x feature vectors x target values y (object/background) target values y (object/background) Encourage ‘good’ classification map: Encourage ‘good’ classification map: Take training examples at: Take training examples at: object no object + -

Boosting as Feature Selection Feature vectors Feature vectors 1000 random fragments 50 discriminative fragments Fragment Selection 1. Fragment Selection Model Parameter Estimation 2. Model Parameter Estimation Select ,  for each feature Weak-Learner Estimation 3. Weak-Learner Estimation Select  a, b for each feature FkFkFkFk candidate feature (fragment T 2 T, parameters  2 , 2  ) N number of candidate features = |T| x |  | x |  | EiEiEiEi canny edge map I cjcjcjcj example centroid j in image i

Learning System Detection Boosting Algorithm K 1 (c) Segmented Training Data Unsegmented Training Data Detection Object Detections Boosting Algorithm K 2 (c) Test Data Background Training Data

Contour Experiments Datasets: Datasets: Weizmann Horses Weizmann Horses UIUC Cars UIUC Cars Caltech Faces Caltech Faces Caltech Motorbikes Caltech Motorbikes Caltech Background Caltech Background Each category evaluated in turn Each category evaluated in turn 10 segmented training images 10 segmented training images 40 unsegmented training images 40 unsegmented training images 50 background images 50 background images single scale evaluation single scale evaluation

Contour Results

Recall Precision equal error rates Recall Precision equal error rates Weizmann Horses: 92.1% Weizmann Horses: 92.1% UIUC Cars: 92.8% UIUC Cars: 92.8% Caltech Faces: 94.0% Caltech Faces: 94.0% Caltech Motorbikes: 92.4% Caltech Motorbikes: 92.4% HorsesCars

Contour Results Occlusion Performance (horses)Performance of K 1 vs. K 2 (faces) No. Segmented Training Images

Conclusions Contour is very powerful cue Contour is very powerful cue Boot-strapping improves results Boot-strapping improves results Future directions Future directions extend to multiple classes, scales, views extend to multiple classes, scales, views segmentation segmentation