Improved Object Detection

Slides:



Advertisements
Similar presentations
Weakly supervised learning of MRF models for image region labeling Jakob Verbeek LEAR team, INRIA Rhône-Alpes.
Advertisements

Combining Detectors for Human Hand Detection Antonio Hernández, Petia Radeva and Sergio Escalera Computer Vision Center, Universitat Autònoma de Barcelona,
Histograms of Oriented Gradients for Human Detection
Classification using intersection kernel SVMs is efficient Joint work with Subhransu Maji and Alex Berg Jitendra Malik UC Berkeley.
Ivan Laptev IRISA/INRIA, Rennes, France September 07, 2006 Boosted Histograms for Improved Object Detection.
Computer Vision for Human-Computer InteractionResearch Group, Universität Karlsruhe (TH) cv:hci Dr. Edgar Seemann 1 Computer Vision: Histograms of Oriented.
Lecture 31: Modern object recognition
Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA
Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake.
Detecting Pedestrians by Learning Shapelet Features
Fast intersection kernel SVMs for Realtime Object Detection
More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.
The Viola/Jones Face Detector (2001)
Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.
Graz University of Technology, AUSTRIA Institute for Computer Graphics and Vision Fast Visual Object Identification and Categorization Michael Grabner,
Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson
Learning Spatial Context: Using stuff to find things Geremy Heitz Daphne Koller Stanford University October 13, 2008 ECCV 2008.
TextonBoost : Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton*, J. Winn†, C. Rother†, and A.
Object Detection using Histograms of Oriented Gradients
Learning Spatial Context: Can stuff help us find things? Geremy Heitz Daphne Koller April 14, 2008 DAGS Stuff (n): Material defined by a homogeneous or.
Distinctive Image Features from Scale-Invariant Keypoints David G. Lowe – IJCV 2004 Brien Flewelling CPSC 643 Presentation 1.
Lecture 29: Recent work in recognition CS4670: Computer Vision Noah Snavely.
Programme 2pm Introduction –Andrew Zisserman, Chris Williams 2.10pm Overview of the challenge and results –Mark Everingham (Oxford) 2.40pm Session 1: The.
© 2013 IBM Corporation Efficient Multi-stage Image Classification for Mobile Sensing in Urban Environments Presented by Shashank Mujumdar IBM Research,
Face Detection CSE 576. Face detection State-of-the-art face detection demo (Courtesy Boris Babenko)Boris Babenko.
Matthew Brown University of British Columbia (prev.) Microsoft Research [ Collaborators: † Simon Winder, *Gang Hua, † Rick Szeliski † =MS Research, *=MS.
Generic object detection with deformable part-based models
EADS DS / SDC LTIS Page 1 7 th CNES/DLR Workshop on Information Extraction and Scene Understanding for Meter Resolution Image – 29/03/07 - Oberpfaffenhofen.
Recognition using Regions (Demo) Sudheendra V. Outline Generating multiple segmentations –Normalized cuts [Ren & Malik (2003)] Uniform regions –Watershed.
“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)
Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.
Face detection Slides adapted Grauman & Liebe’s tutorial
Visual Object Recognition
Object Detection with Discriminatively Trained Part Based Models
Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.
Pedestrian Detection and Localization
Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.
Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.
Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
Object Category Detection: Sliding Windows Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/18/10.
Project 3 Results.
Object detection, deep learning, and R-CNNs
Histograms of Oriented Gradients for Human Detection(HOG)
Grouplet: A Structured Image Representation for Recognizing Human and Object Interactions Bangpeng Yao and Li Fei-Fei Computer Science Department, Stanford.
Kylie Gorman WEEK 1-2 REVIEW. CONVERTING AN IMAGE FROM RGB TO HSV AND DISPLAY CHANNELS.
CS 1699: Intro to Computer Vision Detection II: Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 12, 2015.
Object Detection Overview Viola-Jones Dalal-Triggs Deformable models Deep learning.
Recognition Using Visual Phrases
FACE DETECTION : AMIT BHAMARE. WHAT IS FACE DETECTION ? Face detection is computer based technology which detect the face in digital image. Trivial task.
Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.
Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.
Presented by David Lee 3/20/2006
SUN Database: Large-scale Scene Recognition from Abbey to Zoo Jianxiong Xiao *James Haysy Krista A. Ehinger Aude Oliva Antonio Torralba Massachusetts Institute.
Object Recognition by Discriminative Combinations of Line Segments and Ellipses Alex Chia ^˚ Susanto Rahardja ^ Deepu Rajan ˚ Maylor Leung ˚ ^ Institute.
More sliding window detection: Discriminative part-based models
Week 4: 6/6 – 6/10 Jeffrey Loppert. This week.. Coded a Histogram of Oriented Gradients (HOG) Feature Extractor Extracted features from positive and negative.
Cascade for Fast Detection
Object detection with deformable part-based models
Presented by David Lee 3/20/2006
Presented by Minh Hoai Nguyen Date: 28 March 2007
Lit part of blue dress and shadowed part of white dress are the same color
Object detection as supervised classification
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
A Tutorial on HOG Human Detection
Finding Clusters within a Class to Improve Classification Accuracy
Learning to Detect Faces Rapidly and Robustly
“The Truth About Cats And Dogs”
Brief Review of Recognition + Context
Presentation transcript:

Improved Object Detection Boosted Histograms for Improved Object Detection 2. Interest points  - Motivation  - Detection of invariant interest points    -- Harris, Harris-Laplace, Harris-affine (Schmid, Mikolajczyk)    -- SIFT (Lowe)    -- Region-based (Matas / Tuytelaars, Van Gool)    -- Histogram-based (Kadir, Brady)  - Evaluation of stability  - Applications    -- Wide base-line matching    -- Object recognition    -- Video indexing Ivan Laptev IRISA/INRIA, Rennes, France December 08, 2006

Histograms for object recognition Remarkable success of recognition methods using histograms of local image measurements: [Swain & Ballard 1991] - Color histograms [Schiele & Crowley 1996] - Receptive field histograms [Lowe 1999] - localized orientation histograms (SIFT) [Schneiderman & Kanade 2000] - localized histograms of wavelet coef. [Leung & Malik 2001] - Texton histograms [Belongie et.al. 2002] - Shape context [Dalal & Triggs 2005] - Dense orientation histograms Likely explanation: Histograms are robust to image variations such as limited geometric transformations and object class variability.

Histograms: What vs. Where What to measure? Color [SB91] Gaussian derivatives [SC96] Wavelet coeff. [SK00] Textons [LM01] Gradient orientation [L99,DT05] Histograms Where to measure? Whole image [SB91,SC96] Pre-defined grid [SK00,BMP02,DT05] Key points [L99] No guarantee for optimal recognition Different regions may have different discriminative power Use of interest regions might be suboptimal: IR-detectors are designed for good localization/matching Given rough localization, IRs are not necessary best regions for recognition A B C D

Idea selected features boosting weak classifier    weak classifier Efficient discriminative classifier [Freund&Schapire’97] Good performance for face detection [Viola&Jones’01] AdaBoost: Haar features optimal threshold SVM, ANN 1-bin classifier Histogram features Fisher discriminant

Histogram features ~10^5 rectangle features     Histograms over 4 gradient orientations, 4 subdivisions for each rectangle ~10^5 rectangle features

+ Training data Crop and resize Perturb annotation Increase training set X 10    +

Training: Selected Features 0.999 correct classification 10^-5 false positives 376 of ~10^5 features selected

Object detection Scan and classify image windows at different positions and scales Conf.=5 Cluster detections in the space-scale space Assign cluster size to the detection confidence

PASCAL Visual Object Classes Challenge 2005 (VOC’05) motorbikes #217 / #220 bicycles #123 / #123 people #152 / #149 cars #320 / #341

Evaluation criteria Ground truth annotation Detection results: >50 % overlap of bounding box with GT one bounding box for each object confidence value for each detection Detection results: >50 % overlap of bounding box with GT one bounding box for each object confidence value for each detection Detection results: >50 % overlap of bounding box with GT one bounding box for each object confidence value for each detection Detection results: >50 % overlap of bounding box with GT one bounding box for each object confidence value for each detection Precision-Recall (PR) curve: Average Precision (AP) value:

Evaluation of detection PR-curves for the “Motorbike” validation dataset: FLD learner [Levi and Weiss, CVPR 2004] “Learning object detection from a small number of examples: The importance of good features” + 1-bin classifier

Results for VOC’05 Challenge Bicycles test1 People test1 Motorbikes test1 cars test1

Results for VOC’05 Challenge Average Precision values:

PASCAL Visual Object Classes Challenge 2006 (VOC’06)

Competition "comp3" (train on VOC data) Results for VOC’06 Challenge Competition "comp3" (train on VOC data) Class “bicycle" examples

Competition "comp3" (train on VOC data) Results for VOC’06 Challenge Competition "comp3" (train on VOC data) Class “cow" examples

Competition "comp3" (train on VOC data) Results for VOC’06 Challenge Competition "comp3" (train on VOC data) Class “horse" examples

Competition "comp3" (train on VOC data) Results for VOC’06 Challenge Competition "comp3" (train on VOC data) Class “motorbike"

Competition "comp3" (train on VOC data) Results for VOC’06 Challenge Competition "comp3" (train on VOC data) Class “person"

Results for VOC’06 Challenge Average Precision values:   bicycle bus car cat cow dog horse motorbike person sheep Cambridge 0.249 0.138 0.254 0.151 0.149 0.118 0.091 0.178 0.030 0.131 ENSMP - 0.398 0.159 INRIA_Douze 0.414 0.117 0.444 0.212 0.390 0.164 0.251 INRIA_Laptev 0.440 0.224 0.140 0.318 0.114 TUD 0.153 0.074 TKK 0.303 0.169 0.222 0.160 0.252 0.113 0.137 0.265 0.039 0.227

How “interesting” are boosted regions? Affine Harris regions Boosted regions Harris value is not a significant measure for boosted regions  “Interest regions” are not necessarily good for recognition?

Final Notes Open questions: All results are obtained with a single set of parameters Small number of training samples is sufficient Efficient detection: 10fps on 320x280 images Extension to texton/color histogram features is straightforward Open questions: Other free-shape regions better? How to find them? Better weak learner that takes advantage of histogram properties View transformations

Final Notes Open questions: All results are obtained with a single set of parameters Small number of training samples is sufficient Efficient detection: 10fps on 320x280 images Extension to texton/color histogram features is straightforward Open questions: Other free-shape regions better? How to find them? Better weak learner that takes advantage of histogram properties View transformations

Final Notes Open questions: All results are obtained with a single set of parameters Small number of training samples is sufficient Efficient detection: 10fps on 320x280 images Extension to texton/color histogram features is straightforward Open questions: Other free-shape regions better? How to find them? Better weak learner that takes advantage of histogram properties View transformations

Final Notes Open questions: All results are obtained with a single set of parameters Small number of training samples is sufficient Efficient detection: 10fps on 320x280 images Extension to texton/color histogram features is straightforward Open questions: Other free-shape regions better? How to find them? Better weak learner that takes advantage of histogram properties View transformations

Final Notes Open questions: All results are obtained with a single set of parameters Small number of training samples is sufficient Efficient detection: 10fps on 320x280 images Extension to texton/color histogram features is straightforward Open questions: Other free-shape regions better? How to find them? Better weak learner that takes advantage of histogram properties View transformations