Classification using intersection kernel SVMs is efficient

Slides:



Advertisements
Similar presentations
Max-Margin Additive Classifiers for Detection
Advertisements

Computer Vision Group UC Berkeley How should we combine high level and low level knowledge? Jitendra Malik UC Berkeley Recognition using regions is joint.
Introduction to Support Vector Machines (SVM)
Rich feature Hierarchies for Accurate object detection and semantic segmentation Ross Girshick, Jeff Donahue, Trevor Darrell, Jitandra Malik (UC Berkeley)
Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
Multiclass SVM and Applications in Object Classification
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
An Introduction of Support Vector Machine
Handwritten digit recognition

Classification using intersection kernel SVMs is efficient Joint work with Subhransu Maji and Alex Berg Jitendra Malik UC Berkeley.
Support Vector Machines
1 Lecture 5 Support Vector Machines Large-margin linear classifier Non-separable case The Kernel trick.
SVM—Support Vector Machines
Machine learning continued Image source:
Ivan Laptev IRISA/INRIA, Rennes, France September 07, 2006 Boosted Histograms for Improved Object Detection.
Lecture 31: Modern object recognition
LPP-HOG: A New Local Image Descriptor for Fast Human Detection Andy Qing Jun Wang and Ru Bo Zhang IEEE International Symposium.
Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.
Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.
Groups of Adjacent Contour Segments for Object Detection Vittorio Ferrari Loic Fevrier Frederic Jurie Cordelia Schmid.
Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA
Biased Normalized Cuts 1 Subhransu Maji and Jithndra Malik University of California, Berkeley IEEE Conference on Computer Vision and Pattern Recognition.
Detecting Pedestrians by Learning Shapelet Features
Fast intersection kernel SVMs for Realtime Object Detection
Discriminative and generative methods for bags of features
Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.
Support Vector Machines (and Kernel Methods in general)
Support Vector Machines and Kernel Methods
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Prénom Nom Document Analysis: Linear Discrimination Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Learning to Detect Natural Image Boundaries Using Local Brightness, Color, and Texture Cues David R. Martin Charless C. Fowlkes Jitendra Malik.
Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson
Lecture 10: Support Vector Machines
Generic object detection with deformable part-based models
July 11, 2001Daniel Whiteson Support Vector Machines: Get more Higgs out of your data Daniel Whiteson UC Berkeley.
ADVANCED CLASSIFICATION TECHNIQUES David Kauchak CS 159 – Fall 2014.
“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)
CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 24 – Classifiers 1.
Handwritten digit recognition Jitendra Malik. Handwritten digit recognition (MNIST,USPS) LeCun’s Convolutional Neural Networks variations (0.8%, 0.6%
Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.
Classifiers Given a feature representation for images, how do we learn a model for distinguishing features from different classes? Zebra Non-zebra Decision.
Recognition II Ali Farhadi. We have talked about Nearest Neighbor Naïve Bayes Logistic Regression Boosting.
Distributed Representative Reading Group. Research Highlights 1Support vector machines can robustly decode semantic information from EEG and MEG 2Multivariate.
Handwritten digit recognition
Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.
Histograms of Oriented Gradients for Human Detection(HOG)
CS 1699: Intro to Computer Vision Support Vector Machines Prof. Adriana Kovashka University of Pittsburgh October 29, 2015.
Locally Linear Support Vector Machines Ľubor Ladický Philip H.S. Torr.
Hybrid Classifiers for Object Classification with a Rich Background M. Osadchy, D. Keren, and B. Fadida-Specktor, ECCV 2012 Computer Vision and Video Analysis.
In: Pattern Analysis and Machine Intelligence, IEEE Transactions on, Vol. 30, Nr. 1 (2008), p Group of Adjacent Contour Segments for Object Detection.
Finding Clusters within a Class to Improve Classification Accuracy Literature Survey Yong Jae Lee 3/6/08.
1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.
Non-separable SVM's, and non-linear classification using kernels Jakob Verbeek December 16, 2011 Course website:
Recent developments in object detection
Cascade for Fast Detection
Support Vector Machines
Object detection with deformable part-based models
Performance of Computer Vision
Lit part of blue dress and shadowed part of white dress are the same color
Digit Recognition using SVMS
Object detection as supervised classification
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
Pawan Lingras and Cory Butz
A Tutorial on HOG Human Detection
Students: Meiling He Advisor: Prof. Brain Armstrong
An HOG-LBP Human Detector with Partial Occlusion Handling
Support Vector Machines
Support Vector Machines
Usman Roshan CS 675 Machine Learning
Presentation transcript:

Classification using intersection kernel SVMs is efficient Jitendra Malik UC Berkeley Joint work with Subhransu Maji and Alex Berg (CVPR’08)

Detection: Is this an X? Ask this question over and over again, varying position, scale, multiple categories…

Detection: Is this an X? Ask this question over and over again, varying position, scale, multiple categories…

Detection: Is this an X? This work Boosted decision trees + Very fast evaluation - Slow training (esp. multi-class) Linear SVM + Fast evaluation + Fast training - Low accuracy unless very good features Non-linear kernelized SVM + Better accuracy than linear . Medium training - Slow evaluation This work Ask this question over and over again, varying position, scale, multiple categories…

Support Vector Machines Linear Separators (aka. Perceptrons)

Support Vector Machines Other possible solutions

Support Vector Machines Which one is better? B1 or B2? How do you define better?

Support Vector Machines Find hyperplane maximizes the margin => B1 is better than B2

Kernel Support Vector Machines Inner Product in Hilbert Space Can Learn Non Linear Boundaries

Training Stage (+ examples) (- examples) Feature Representation Discriminative Classifier

Our Multiscale HOG-like feature Concatenate orientation histograms for each orange region. Differences from HOG: -- Hierarchy of regions -- Only performing L1 normalization once (at 16x16)

Comparison to HOG (Dalal & Triggs)

Comparison to HOG (Dalal & Triggs) Smaller Dimensional (1360 vs. 3780) Simple Implementation (Convolutions) Faster to compute + No non-local Normalization + No gaussian weighting + No color normalization

What is the Intersection Kernel? Histogram Intersection kernel between histograms a, b

What is the Intersection Kernel? Histogram Intersection kernel between histograms a, b K small -> a, b are different K large -> a, b are similar Intro. by Swain and Ballard 1991 to compare color histograms. Odone et al 2005 proved positive definiteness. Can be used directly as a kernel for an SVM. Compare to

linear SVM, Kernelized SVM, IKSVM Decision function is where: Linear: Non-linear Using Kernel Histogram Intersection Kernel

Kernelized SVMs slow to evaluate Decision function is where: Sum over all support vectors Kernel Evaluation Feature vector to evaluate Feature corresponding to a support vector l Arbitrary Kernel Histogram Intersection Kernel SVM with Kernel Cost: # Support Vectors x Cost of kernel comp. IKSVM Cost: # Support Vectors x # feature dimensions

The Trick Decision function is where: Just sort the support vector values in each coordinate, and pre-compute To evaluate, find position of in the sorted support vector values (cost: log #sv) look up values, multiply & add

The Trick #support vectors x #dimensions log( #support vectors ) x #dimensions Decision function is where: Just sort the support vector values in each coordinate, and pre-compute To evaluate, find position of in the sorted support vector values (cost: log #sv) look up values, multiply & add

The Trick 2 #support vectors x #dimensions log( #support vectors ) x #dimensions Decision function is where: For IK hi is piecewise linear, and quite smooth, blue plot. We can approximate with fewer uniformly spaced segments, red plot. Saves time & space!

The Trick 2 #support vectors x #dimensions log( #support vectors ) x #dimensions The Trick 2 constant x #dimensions Decision function is where: For IK hi is piecewise linear, and quite smooth, blue plot. We can approximate with fewer uniformly spaced segments, red plot. Saves time & space!

Timing Results reduced memory! Time to evaluate 10,000 feature vectors Linear SVM with our multi-scale Version of HOG features has worse classification perf. than Dalal & Triggs. IKSVM with our multi-scale version of HOG features beats Dalal & Triggs. Also for Daimler Chrysler data. Current Best on these datasets.

Distribution of support vector values and hi

Best Performance on Pedestrian Detection, Improve on Linear for Many Tasks Daimler Chrysler Pedestrians INRIA Pedestrians Caltech 101 with “simple features” Linear SVM 40% correct IKSVM 52% correct

Classification Errors

Results – ETHZ Dataset Dataset: Ferrari et al., ECCV 2006 255 images, over 5 classes training = half of positive images for a class + same number from the other classes (1/4 from each) testing = all other images large scale changes; extensive clutter

Results – ETHZ Dataset PAS* 65.0 89.3 72.3 80.6 64.7 76.7 Our 86.1 Beats many current techniques without any changes to our features/classification framework. Recall at 0.3 False Positive per Image Shape is an important cue (use Pb instead of OE) Method Applelogo Bottle Giraffe Mug Swan Avg PAS* 65.0 89.3 72.3 80.6 64.7 76.7 Our 86.1 81.0 62.1 78.0 100 81.4 *Ferarri et.al, IEEE PAMI - 08

Other kernels allow similar trick Decision function is where: IKSVM SVM hi are piece-wise linear, uniformly spaced piece-wise linear approx. is fast. hi not piece-wise linear, but we can still use an approximation for fast evaluation.

Conclusions Exactly evaluate IKSVM in O(n log m) as opposed to O(nm) Makes SV cascade or other ordering schemes irrelevant for intersection kernel Verified that IKSVM offers classification performance advantages over linear Approximate decision functions that decompose to a sum of functions for each coordinate (including Chi squared) Directly learn such classification functions (no SVM machinery) Generalized linear svm beats linear SVM in some applications often as good as more expensive RBF kernels Showed that relatively simple features with IKSVM beats Dalal & Triggs (linear SVM), leading to the state of the art in pedestrian detection. Applies to best Caltech 256, Pascal VOC 2007 methods.

Classification Using Intersection Kernel Support Vector Machines is efficient. Subhransu Maji and Alexander C. Berg and Jitendra Malik. Proceedings of CVPR 2008, Anchorage, Alaska, June 2008. Software and more results available at http://www.cs.berkeley.edu/~smaji/projects/fiksvm/