Deformable Part Model Presenter ： Liu Changyu Advisor ： Prof. Alex Hauptmann Interest ： Multimedia Analysis April 11 st, 2013.

Slides:

Advertisements

Similar presentations

Presenter: Duan Tran (Part of slides are from Pedro’s)

Advertisements

Jan-Michael Frahm, Enrique Dunn Spring 2013

DONG XU, MEMBER, IEEE, AND SHIH-FU CHANG, FELLOW, IEEE Video Event Recognition Using Kernel Methods with Multilevel Temporal Alignment.

Detecting Faces in Images: A Survey

Large Scale Visual Recognition Challenge (ILSVRC) 2013: Detection spotlights.

Lecture 31: Modern object recognition

LPP-HOG: A New Local Image Descriptor for Fast Human Detection Andy Qing Jun Wang and Ru Bo Zhang IEEE International Symposium.

Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.

Presenter: Hoang, Van Dung

Face Detection, Pose Estimation, and Landmark Localization in the Wild

Steerable Part Models Hamed Pirsiavash and Deva Ramanan

Intelligent Systems Lab. Recognizing Human actions from Still Images with Latent Poses Authors: Weilong Yang, Yang Wang, and Greg Mori Simon Fraser University,

Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA

More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.

Student: Yao-Sheng Wang Advisor: Prof. Sheng-Jyh Wang ARTICULATED HUMAN DETECTION 1 Department of Electronics Engineering National Chiao Tung University.

DISCRIMINATIVE DECORELATION FOR CLUSTERING AND CLASSIFICATION ECCV 12 Bharath Hariharan, Jitandra Malik, and Deva Ramanan.

Computer and Robot Vision I

Potential Projects RGBD gesture recognition with the Microsoft Kinect Person recognition by parts.

Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.

1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.

Object Recognizing We will discuss: Features Classifiers Example ‘winning’ system.

Spatial Pyramid Pooling in Deep Convolutional

Lecture 29: Recent work in recognition CS4670: Computer Vision Noah Snavely.

Global and Efficient Self-Similarity for Object Classification and Detection CVPR 2010 Thomas Deselaers and Vittorio Ferrari.

Generic object detection with deformable part-based models

Object Recognizing. Object Classes Individual Recognition.

Computer vision.

Shape-Based Human Detection and Segmentation via Hierarchical Part- Template Matching Zhe Lin, Member, IEEE Larry S. Davis, Fellow, IEEE IEEE TRANSACTIONS.

Object Bank Presenter ： Liu Changyu Advisor ： Prof. Alex Hauptmann Interest ： Multimedia Analysis April 4 th, 2013.

Object Recognizing. Recognition -- topics Features Classifiers Example ‘winning’ system.

“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)

Video Tracking Using Learned Hierarchical Features

Marco Pedersoli, Jordi Gonzàlez, Xu Hu, and Xavier Roca

Yao, B., and Fei-fei, L. IEEE Transactions on PAMI(2012)

Object Detection with Discriminatively Trained Part Based Models

Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.

BING: Binarized Normed Gradients for Objectness Estimation at 300fps

Pedestrian Detection and Localization

Object Recognition in Images Slides originally created by Bernd Heisele.

Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.

Locality-constrained Linear Coding for Image Classification

Face Detection Using Large Margin Classifiers Ming-Hsuan Yang Dan Roth Narendra Ahuja Presented by Kiang “Sean” Zhou Beckman Institute University of Illinois.

Students: Meera & Si Mentor: Afshin Dehghan WEEK 4: DEEP TRACKING.

Training and Evaluating of Object Bank Models Presenter ： Changyu Liu Advisor ： Prof. Alex Interest ： Multimedia Analysis May 16 th, 2013.

Histograms of Oriented Gradients for Human Detection(HOG)

CS 1699: Intro to Computer Vision Detection II: Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 12, 2015.

Object Detection Overview Viola-Jones Dalal-Triggs Deformable models Deep learning.

Recognition Using Visual Phrases

Timo Ahonen, Abdenour Hadid, and Matti Pietikainen

Object Recognizing. Object Classes Individual Recognition.

More sliding window detection: Discriminative part-based models

Object Recognizing. Object Classes Individual Recognition.

A Discriminatively Trained, Multiscale, Deformable Part Model Yeong-Jun Cho Computer Vision and Pattern Recognition,2008.

Fine-grained Fine-grained Recognition( 细粒度分类 ) 沈志强.

Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,

WLD: A Robust Local Image Descriptor Jie Chen, Shiguang Shan, Chu He, Guoying Zhao, Matti Pietikäinen, Xilin Chen, Wen Gao 报告人：蒲薇榄.

Biologically Inspired Vision-based Indoor Localization Zhihao Li, Ming Yang

1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.

Object detection with deformable part-based models

References [1] - Y. LeCun, L. Bottou, Y. Bengio and P. Haffner, Gradient-Based Learning Applied to Document Recognition, Proceedings of the IEEE, 86(11): ,

Object Localization Goal: detect the location of an object within an image Fully supervised: Training data labeled with object category and ground truth.

Object detection as supervised classification

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

HOGgles Visualizing Object Detection Features

An HOG-LBP Human Detector with Partial Occlusion Handling

“The Truth About Cats And Dogs”

Outline Background Motivation Proposed Model Experimental Results

University of Central Florida

Heterogeneous convolutional neural networks for visual recognition

Presentation transcript:

Deformable Part Model Presenter ： Liu Changyu Advisor ： Prof. Alex Hauptmann Interest ： Multimedia Analysis April 11 st, 2013

CMU - Language Technologies Institute 2 Contents Introduction Model Learning Experiment Conclusion

CMU - Language Technologies Institute 3 1. Research Question 1) Object bank is just a image representation for high-level visual tasks, it should be used combing with detailed efficient traning method. 2) For difficult tasks, such as extending Object Bank to over 1000 objects and benchmarks of the PASCAL Challenge, it need new traning method to improve the average precision. 3) So we want to combine use the part model that proposed at CVPR in Introduction

CMU - Language Technologies Institute 4 2. What’s Deformable Part Model? Deformable Part is a discriminatively trained, multiscale model for image training that aim at making possible the effective use of more latent information such as hierarchical (grammar) models and models involving latent three dimensional pose. Introduction

CMU - Language Technologies Institute 5 Contents Introduction Model Learning Experiment Conclusion

CMU - Language Technologies Institute 6 Fig. 1 Deformable Part Model Model --- Deformable Part The deformable model include both a coarse global template covering an entire object and higher resolution part templates.The templates represent histogram of gradient features (b1) coarse template (b2)part templates(b3) spatial model (a) person detection Example

CMU - Language Technologies Institute 7 Fig.2 Pyramids of Deformable Part Model Model ---Deformable Part Fig.2 illustrates a placement of such a model in a HOG pyramid. The root filter location defines the detection window (the pixels inside the cells covered by the filter). The part filters are placed several levels down in the pyramid, so the HOG cells at that level have half the size of cells in the root filter level. (b)HOG feature pyramid (a) Image pyramid

CMU - Language Technologies Institute 8 Model ---Deformable Parts The score of a placement is given by the scores of each filter (the data term) plus a score of the placement of each part relative to the root (the spatial term), Where is the w × h × 9 × 4 weight vector are the features in a w×h subwindow of a HOG pyramid. gives the location of the i-th part relative to the root location. ai and bi are two dimensional vectors coefficients for measuring a score for each possible placement of the i-th part.

CMU - Language Technologies Institute 9 Contents Introduction Model Learning Experiment Conclusion

CMU - Language Technologies Institute 10 Learning Latent SVMs This model use Latent SVMs to have a classification. As: where is a vector of model parameters, needed to be learned first, according to: z is a set of latent values.

CMU - Language Technologies Institute 11 Contents Introduction Model Learning Experiment Conclusion

CMU - Language Technologies Institute 12 Experiment

CMU - Language Technologies Institute 13 We execute the matlab code as….. Experiment

CMU - Language Technologies Institute 14 Contents Introduction Model Algorithm Experiment Conclusion

CMU - Language Technologies Institute 15 1)Experiment has not completed yet, it needed more object models used for deformable part training. 2) Computation need to be speed up. Conclusion

CMU - Language Technologies Institute 16 Reference [1] P. Felzenszwalb, D. McAllester, D. Ramanan. A Discriminatively Trained, Multiscale, Deformable Part Model. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2008 [2] P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan. Object Detection with Discriminatively Trained Part Based Models. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 32, No. 9, Sep [3] Level Image Representation for Scene Classification and Semantic Feature Sparsification. Proceedings of the Neural Information Processing Systems (NIPS), 2010.

CMU - Language Technologies Institute 17 Thank you!