Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.

Slides:



Advertisements
Similar presentations
Latent SVMs for Human Detection with a Locally Affine Deformation Field Ľubor Ladický 1 Phil Torr 2 Andrew Zisserman 1 1 University of Oxford 2 Oxford.
Advertisements

Combining Detectors for Human Hand Detection Antonio Hernández, Petia Radeva and Sergio Escalera Computer Vision Center, Universitat Autònoma de Barcelona,
Jan-Michael Frahm, Enrique Dunn Spring 2013
Histograms of Oriented Gradients for Human Detection
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Appearance-based recognition & detection II Kristen Grauman UT-Austin Tuesday, Nov 11.
Ľubor Ladický1 Phil Torr2 Andrew Zisserman1
Machine learning continued Image source:
Recap: Advanced Feature Encoding Bag of Visual Words is only about counting the number of local descriptors assigned to each Voronoi region (0 th order.
Ivan Laptev IRISA/INRIA, Rennes, France September 07, 2006 Boosted Histograms for Improved Object Detection.
Computer Vision for Human-Computer InteractionResearch Group, Universität Karlsruhe (TH) cv:hci Dr. Edgar Seemann 1 Computer Vision: Histograms of Oriented.
Lecture 31: Modern object recognition
Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.
Histograms of Oriented Gradients for Human Detection Navneet Dalal and Bill Triggs CVPR 2005 Another Descriptor.
Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.
Object Category Detection: Sliding Windows Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 04/10/12.
Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.
Detecting Pedestrians by Learning Shapelet Features
More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.
DISCRIMINATIVE DECORELATION FOR CLUSTERING AND CLASSIFICATION ECCV 12 Bharath Hariharan, Jitandra Malik, and Deva Ramanan.
Discriminative and generative methods for bags of features
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson
Object Detection using Histograms of Oriented Gradients
The SIFT (Scale Invariant Feature Transform) Detector and Descriptor
Lecture 29: Recent work in recognition CS4670: Computer Vision Noah Snavely.
Discriminative and generative methods for bags of features
Generic object detection with deformable part-based models
Object Recognizing. Recognition -- topics Features Classifiers Example ‘winning’ system.
“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)
CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 24 – Classifiers 1.
Visual Object Recognition
Object Detection with Discriminatively Trained Part Based Models
Pedestrian Detection and Localization
Representations for object class recognition David Lowe Department of Computer Science University of British Columbia Vancouver, Canada Sept. 21, 2006.
Classifiers Given a feature representation for images, how do we learn a model for distinguishing features from different classes? Zebra Non-zebra Decision.
Deformable Part Model Presenter : Liu Changyu Advisor : Prof. Alex Hauptmann Interest : Multimedia Analysis April 11 st, 2013.
Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.
Recognition II Ali Farhadi. We have talked about Nearest Neighbor Naïve Bayes Logistic Regression Boosting.
Lecture 27: Recognition Basics CS4670/5670: Computer Vision Kavita Bala Slides from Andrej Karpathy and Fei-Fei Li
Project 3 Results.
Object detection, deep learning, and R-CNNs
Histograms of Oriented Gradients for Human Detection(HOG)
Methods for classification and image representation
Histograms of Oriented Gradients for Human Detection Navneet Dalal and Bill Triggs CVPR 2005 Another Descriptor.
Sean M. Ficht.  Problem Definition  Previous Work  Methods & Theory  Results.
CS 1699: Intro to Computer Vision Support Vector Machines Prof. Adriana Kovashka University of Pittsburgh October 29, 2015.
CS 1699: Intro to Computer Vision Detection II: Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 12, 2015.
Object Detection Overview Viola-Jones Dalal-Triggs Deformable models Deep learning.
Discriminative Sub-categorization Minh Hoai Nguyen, Andrew Zisserman University of Oxford 1.
Improved Object Detection
Object Recognizing. Object Classes Individual Recognition.
CS 2750: Machine Learning Support Vector Machines Prof. Adriana Kovashka University of Pittsburgh February 17, 2016.
More sliding window detection: Discriminative part-based models
Object Recognizing. Object Classes Individual Recognition.
Finding Clusters within a Class to Improve Classification Accuracy Literature Survey Yong Jae Lee 3/6/08.
Week 4: 6/6 – 6/10 Jeffrey Loppert. This week.. Coded a Histogram of Oriented Gradients (HOG) Feature Extractor Extracted features from positive and negative.
1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.
Cascade for Fast Detection
Object detection with deformable part-based models
Data Driven Attributes for Action Detection
Performance of Computer Vision
Lit part of blue dress and shadowed part of white dress are the same color
Object detection, deep learning, and R-CNNs
Object detection as supervised classification
A Tutorial on HOG Human Detection
HOGgles Visualizing Object Detection Features
An HOG-LBP Human Detector with Partial Occlusion Handling
CS 2750: Machine Learning Support Vector Machines
Brief Review of Recognition + Context
Presentation transcript:

Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely

Object detection: where are we? Incredible progress in the last ten years Better features, better models, better learning methods, better datasets Combination of science and hacks Credit: Flickr user neilalderney123neilalderney123

Vision Contests PASCAL VOC Challenge 20 categories Annual classification, detection, segmentation, … challenges

Machine learning for object detection What features do we use? – intensity, color, gradient information, … Which machine learning methods? – generative vs. discriminative – k-nearest neighbors, boosting, SVMs, … What hacks do we need to get things working?

Histogram of Oriented Gradients (HoG) [Dalal and Triggs, CVPR 2005] 10x10 cells 20x20 cells HoGify

Histogram of Oriented Gradients (HoG)

Like SIFT (Scale Invariant Feature Transform), but… – Sampled on a dense, regular grid – Gradients are contrast normalized in overlapping blocks 10x10 cells 20x20 cells HoGify [Dalal and Triggs, CVPR 2005]

Histogram of Oriented Gradients (HoG) First used for application of person detection [Dalal and Triggs, CVPR 2005] Cited since in thousands of computer vision papers

Linear classifiers Find linear function to separate positive and negative examples Which line is best? [slide credit: Kristin Grauman]

Support Vector Machines (SVMs) Discriminative classifier based on optimal separating line (for 2D case) Maximize the margin between the positive and negative training examples [slide credit: Kristin Grauman]

Support vector machines Want line that maximizes the margin. Margin Support vectors C. Burges, A Tutorial on Support Vector Machines for Pattern Recognition, Data Mining and Knowledge Discovery, 1998A Tutorial on Support Vector Machines for Pattern Recognition For support, vectors, wx+b=-1 wx+b=0 wx+b=1 [slide credit: Kristin Grauman]

Person detection, ca Represent each example with a single, fixed HoG template 2.Learn a single [linear] SVM as a detector (average gradient image, max positive weight, max negative weight)

Positive and negative examples + thousands more… + millions more…

HoG templates for person detection

Detection Run detector as a sliding window over an image, at a range of different scales Non-maxima suppression

Person detection with HoG & linear SVM [Dalal and Triggs, CVPR 2005]

Are we done? Single, rigid template usually not enough to represent a category – Many objects (e.g. humans) are articulated, or have parts that can vary in configuration – Many object categories look very different from different viewpoints, or from instance to instance

Difficulty of representing positive instances Discriminative methods have proven very powerful But linear SVM on HoG templates not sufficient? Alternatives: – Parts-based models [Felzenszwalb et al. CVPR 2008] – Latent SVMs [Felzenszwalb et al. CVPR 2008]

Parts-based models Felzenszwalb, et al., Discriminatively Trained Deformable Part Models,

Latent SVMs Rather than training a single linear SVM separating positive examples… … cluster positive examples into “components” and train a classifier for each (using all negative examples)

Two-component bicycle model “side” component “frontal” component

Six-component car model root filters (coarse) part filters (fine)deformation models side view frontal view

Six-component person model