A Statistically Selected Part-Based Probabilistic Model for Object Recognition Zhipeng Zhao, Ahmed Elgammal Department of Computer Science, Rutgers, The.

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

A Bayesian Approach to Recognition Moshe Blank Ita Lifshitz Reverend Thomas Bayes
Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.
Detecting Faces in Images: A Survey
Object class recognition using unsupervised scale-invariant learning Rob Fergus Pietro Perona Andrew Zisserman Oxford University California Institute of.
Multi-Local Feature Manifolds for Object Detection Oscar Danielsson Stefan Carlsson Josephine Sullivan
Unsupervised Learning of Visual Object Categories Michael Pfeiffer
Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.
Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake.
Model: Parts and Structure. History of Idea Fischler & Elschlager 1973 Yuille ‘91 Brunelli & Poggio ‘93 Lades, v.d. Malsburg et al. ‘93 Cootes, Lanitis,
Beyond bags of features: Part-based models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.
Object Localization using affine invariant substructure constraints Ishani Chakraborty and Ahmed Elgammal, Dept. of Computer Science, Rutgers University.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Lecture 28: Bag-of-words models
Generic Object Recognition -- by Yatharth Saraf A Project on.
Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson
Robust Real-time Object Detection by Paul Viola and Michael Jones ICCV 2001 Workshop on Statistical and Computation Theories of Vision Presentation by.
1 Interest Operator Lectures lecture topics –Interest points 1 (Linda) interest points, descriptors, Harris corners, correlation matching –Interest points.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.
Object class recognition using unsupervised scale-invariant learning Rob Fergus Pietro Perona Andrew Zisserman Oxford University California Institute of.
Object Class Recognition Using Discriminative Local Features Gyuri Dorko and Cordelia Schmid.
1 Integration of Background Modeling and Object Tracking Yu-Ting Chen, Chu-Song Chen, Yi-Ping Hung IEEE ICME, 2006.
Automatic Matching of Multi-View Images
Object Class Recognition by Unsupervised Scale-Invariant Learning R. Fergus, P. Perona, and A. Zisserman Presented By Jeff.
Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.
Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.
Foundations of Computer Vision Rapid object / face detection using a Boosted Cascade of Simple features Presented by Christos Stoilas Rapid object / face.
Overview Introduction to local features
Exercise Session 10 – Image Categorization
Unsupervised Learning of Categories from Sets of Partially Matching Image Features Kristen Grauman and Trevor Darrel CVPR 2006 Presented By Sovan Biswas.
EADS DS / SDC LTIS Page 1 7 th CNES/DLR Workshop on Information Extraction and Scene Understanding for Meter Resolution Image – 29/03/07 - Oberpfaffenhofen.
SVCL Automatic detection of object based Region-of-Interest for image compression Sunhyoung Han.
Overview Harris interest points Comparing interest points (SSD, ZNCC, SIFT) Scale & affine invariant interest points Evaluation and comparison of different.
“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)
Building local part models for category-level recognition C. Schmid, INRIA Grenoble Joint work with G. Dorko, S. Lazebnik, J. Ponce.
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
Learning a Fast Emulator of a Binary Decision Process Center for Machine Perception Czech Technical University, Prague ACCV 2007, Tokyo, Japan Jan Šochman.
Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.
Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao.
Features-based Object Recognition P. Moreels, P. Perona California Institute of Technology.
In Defense of Nearest-Neighbor Based Image Classification Oren Boiman The Weizmann Institute of Science Rehovot, ISRAEL Eli Shechtman Adobe Systems Inc.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
Bayesian Parameter Estimation Liad Serruya. Agenda Introduction Bayesian decision theory Scale-Invariant Learning Bayesian “One-Shot” Learning.
Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.
Michael Isard and Andrew Blake, IJCV 1998 Presented by Wen Li Department of Computer Science & Engineering Texas A&M University.
The Viola/Jones Face Detector A “paradigmatic” method for real-time object detection Training is slow, but detection is very fast Key ideas Integral images.
Overview Introduction to local features Harris interest points + SSD, ZNCC, SIFT Scale & affine invariant interest point detectors Evaluation and comparison.
Discussion of Pictorial Structures Pedro Felzenszwalb Daniel Huttenlocher Sicily Workshop September, 2006.
Discovering Objects and their Location in Images Josef Sivic 1, Bryan C. Russell 2, Alexei A. Efros 3, Andrew Zisserman 1 and William T. Freeman 2 Goal:
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
Finding Clusters within a Class to Improve Classification Accuracy Literature Survey Yong Jae Lee 3/6/08.
Object Recognition by Parts
Nearest-neighbor matching to feature database
Video Google: Text Retrieval Approach to Object Matching in Videos
Object Recognition by Parts
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
A. Opelt, M. Fussenegger, A. Pinz, P. Auer
Finding Clusters within a Class to Improve Classification Accuracy
Nearest-neighbor matching to feature database
Object Recognition by Parts
Object Recognition by Parts
Unsupervised Learning of Models for Recognition
Liyuan Li, Jerry Kah Eng Hoe, Xinguo Yu, Li Dong, and Xinqi Chu
Video Google: Text Retrieval Approach to Object Matching in Videos
Object Recognition by Parts
Object Recognition with Interest Operators
Presentation transcript:

A Statistically Selected Part-Based Probabilistic Model for Object Recognition Zhipeng Zhao, Ahmed Elgammal Department of Computer Science, Rutgers, The State University of New Jersey, Piscataway, NJ, USA

Outline Background and motivations Statistical selection of image parts Parts based probabilistic model Experiments Conclusion

Generic Object Recognition Variations in scale, orientation and visibility Variability within Specificity Object of interest has to be recognized in context of multiple other objects and cluttered background

Recognition of Object Categories has recently gained a lot of attention in computer vision. 3D model based method traditionally handle specific object instance, e.g. with the goal of recovering object pose. Appearance template search-based methods, e.g., Schneiderman et al 00,Viola et al 02

Part-based object recognition Part-based methods [Fischler73, Lowe99, Fergus03, Fergus05]: Objects are modeled as constellation, a collection of parts or local features with distinctive appearance and spatial position. the recognition is based on inferring object class based on similarity in parts’ appearance and their spatial arrangement.

Bag-of-features: Zhang et al (1995), Mohr et al (1997), Lowe (1999), Matas et al (2002), Mikolajczyk et al(2003), Bray et al (2004). Ignore spatial arrangement of parts Very successful result But.. None of the features are completely affine invariant Correspondences based on very closely related features Not conducive for recognition of object categories

Part Structure Vidal-Naquet (2001), Zisserman et al (2003), Perona et al ( 2003), Schiele et al ( 2004), Triggs et al (2005), Serre et al (2005) Limitations: Combinatorial combination of parts and locations Number of parts considered are limited Huge chunk of image information is discarded

Part Based approaches for object recognition Phases in part based recognition: Part extraction [kadir01]: extract salient features from the image. Part selection [Dorko04]: select parts that best characterize the target object. Select parts by classification likelihood. Select parts by mutual information. Object Model [Fergus03, Fergus05]: construct a model of the target object using the selected parts. Object recognition [Fergus03, Fergus05]: having selected parts from a test image, perform its recognition using object model.

Discriminative Part Selection - Problem Description Given : A collection of parts from positive and negative images Find : Parts that consistently appear in the positive images but rarely in the negative images Requirements : Focus on part similarities across positive images Ignore similarity between parts within an image Avoid ubiquitous parts - those appear in all classes A collection of parts from the negative classes Parts from the positive class images

Our contributions Part selection: A statistical approaches for unsupervised selection of discriminative parts: Finds the parts that best discriminate between the positive and negative classes. Object model and recognition: A probabilistic Bayesian approach for recognition where object model does not need a reference part. We investigated PCA and 2D PCA for image part representation in our experiment and did a comparison.

Outline Background and motivations Statistical selection of image parts Parts based probabilistic model Experiments Conclusion

Statistical selection of image parts Given: image parts from positive training images. Find: parts that constantly appear in multiple instances of the positive evaluation images but only rarely appear in the negative evaluation images. We use each image part from training images to build classifier for positive and negative evaluation images. This classifier is built based on whether the image part appears in the evaluation image or not. A collection of parts from the negative classes Parts from the positive class images

Statistical selection of image parts This classifier is built based on whether the image part appears in the evaluation image or not. Similar to boosting in building week classifier, but we filter out uncharacteristic image parts and only keep the image parts on which better classifiers are built. These selected parts should be informative of the object and the model built from these parts is likely to achieve better recognition performance.

Statistical selection of image parts Build classifier: The appearance of image part v in evaluation image V can be implied by the Euclidean distance between v and the closest image part in V: We can use D(V,v) and a threshold t to classify evaluation data: Image V is positive if the distance between image part v and the image V is less than a threshold t, which implies v appears in V: Image V is negative if otherwise.

Statistical selection of image parts Measure the performance of the classifier: Performance is measure by the misclassified evaluation image: Select image parts: We select the image parts from which we can build a classifier with a classification error rate less than a threshold θ.

Experiments Image parts selection from the statistical methods:

Outline Background and motivations Statistical selection of image parts Parts based probabilistic model Experiments Conclusion

Parts based probabilistic model We model the probability of an object with its centroid, given the observed image parts from that image. If the probability is larger than a threshold, it indicates the presence of the object class in the image. Assuming independence between image parts v k and using Bayes’ rule, the probability model of object class O with its centroid C, given the m observed image parts (k=1,…,m), can be formulated as:

Parts based probabilistic model P(v k |O,C) can be modeled as mixture-Gaussians learnt from the selected image parts from the training data, which is clustered into n clusters A i, ( i =1..n ) according to their appearance and 2D offset of the centroid to them:

Parts based probabilistic model With further assumption of the independence between P(O) and P(C) and both part v k and the centroid C from one cluster following normal distribution, we can estimate the terms for P(v k |O,C) : Here and denote the sample mean for v k and C respectively. and denote the sample covariance for v k and C respectively. Other terms can be approximated using the statistics from each of the cluster A i.

Parts based probabilistic model Recognition: While performing recognition, the P(v k ) can be ignored so we have The recognition can be viewed as: for each of the image part v k, it will vote for the possible object centroid according the clusters it is close to: vkvk cluster i cluster j cluster k Weighted vote for centroid Test Image

Outline Background and motivations Statistical selection of image part Parts based probabilistic model Experiments Conclusion

Experiments Dataset: Dataset from Caltech database with four class objects: motorbikes, airplane, face, car rear end against background. trainevaluatetest positive negativepositivenegative Each class

Experiments Image part detection and representation: Use Kadir and Brady’s region based feature detector [Kadir01] for detecting informative image partes. Normalize and rescale image parts to 11×11 pixels and represent them as 121 dimension intensity vector. Both PCA and 2D PCA were applied on vectors to get a more compact 18 dimension intensity representation.

Experiments Experiment setting: Extract 100 image parts from each of the training image and evaluation image. Applied the statistical methods for removing the image parts from the background. Statistical method: build a simple classifier from each image part in training images and select the ones which led to a classifier with classification error rate less than 24%. Compute the probability of the centroid of a possible object in the image as the indicator of its presence. Use 2D offset between the image part and the centroid of the image as spatial information for the image part. Use k-means algorithm to cluster selected parts into 70 clusters and calculate the statistics for them.

Experiments Image parts selection from the statistical methods:

Experiments Objects detected in the image:

Experiments Equal ROC performance using different methods: datasetNo selection Statistical Method with 2D PCA Statistical Method with PCA [Fergus03][Opelt04] Airplane54.2%95.8%94.4%90.2%88.9% Motorbike67.8%93.7%94.9%92.5%92.2% Face62.7%97.3%98.4%96.4%93.5% Car(rear)65.6%98.0%96.7%90.3%n/a

Conclusion: We have presented a statistical method for selecting informative image parts for part-based object detection and class recognition. This method yields competitive recognition rates and surpasses the performance of many existing methods. It is a general method suitable for selecting a set of features in other application. Future work: Integrate information regarding the spatial arrangement between image parts; Develop the framework as a general method with various hyper-parameters automatically determined.

Main References [Dorko 04] G. Dorko and C. Schmid. Object Class Recognition Using Discriminative Local Features. Submitted to PAMI04. [Fergus 03] R. Fergus, P. Perona and A. Zisserman. Object class recognition by unsupervised scale-invariant learning. In CVPR(2) page , [Fergus 05] R. Fergus, P. Perona and A. Zisserman. A Sparse Object Category Model for Efficient Learning and Exhaustive Recogntion. [Fischler 73] M. Fischler and R.Elschlager. The representation and matching of pictorial structures, IEEE Transaction on Computer c-22(1): [Kadir 01] T. Kadir and M. Brady. Scale, saliency and image description. IJCV 2001 [Lowe 99] D. G. Lowe. Object recogntion from local scale-invariant features. In Proc. Of the International Conference on Computer Vision ICCV Corfu, pages , 1999 [Opelt 04] A. Opelt, M. Fussenegger, A. Pinz, P. Auer. Weak hypotheses and boosting for generic object detection and recognition. ECCV(2) 71-84, 2004 [Schneiderman 00] H.Schneiderman and T.Kanade. A statistical method 3d object detection applied to faces and cars pages 45-51, 2000 [Viola 02] P.Viola and M. Jones Robust real time object detection. International Journal of Computer Vision [Wolfson 97] H.J. Wolfson and I. Rigoutsos. Geometric hashing: An overview. IEEE Computational Science & Engineer, 4(4):10-21,/1997.