Carl Vondrick, Aditya Khosla, Tomasz Malisiewicz, Antonio Torralba Massachusetts Institute of Technology

Slides:

Advertisements

Similar presentations

Semantic Contours from Inverse Detectors Bharath Hariharan et.al. (ICCV-11)

Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki

Location Recognition Given: A query image A database of images with known locations Two types of approaches: Direct matching: directly match image features.

Aggregating local image descriptors into compact codes

Zhimin CaoThe Chinese University of Hong Kong Qi YinITCS, Tsinghua University Xiaoou TangShenzhen Institutes of Advanced Technology Chinese Academy of.

EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.

Computer Vision Lecture 18: Object Recognition II

Patch to the Future: Unsupervised Visual Prediction

From Virtual to Reality: Fast Adaptation of Virtual Object Detectors to Real Domains Baochen Sun and Kate Saenko UMass Lowell.

HOGgles: Visualizing Object Detection Features (to be appeared in ICCV 2013) Carl Vondrick Aditya Khosla Tomasz Malisiewicz and Antonio Torralba,MIT Presented.

Watching Unlabeled Video Helps Learn New Human Actions from Very Few Labeled Snapshots Chao-Yeh Chen and Kristen Grauman University of Texas at Austin.

Uncertainty Representation. Gaussian Distribution variance Standard deviation.

Knowing a Good HOG Filter When You See It: Efficient Selection of Filters for Detection Ejaz Ahmed 1, Gregory Shakhnarovich 2, and Subhransu Maji 3 1 University.

Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.

Fast intersection kernel SVMs for Realtime Object Detection

Problem: SVM training is expensive – Mining for hard negatives, bootstrapping Solution: LDA (Linear Discriminant Analysis). – Extremely fast training,

Landmark Classification in Large- scale Image Collections Yunpeng Li David J. Crandall Daniel P. Huttenlocher ICCV 2009.

Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.

HCI Final Project Robust Real Time Face Detection Paul Viola, Michael Jones, Robust Real-Time Face Detetion, International Journal of Computer Vision,

An opposition to Window- Scanning Approaches in Computer Vision Presented by Tomasz Malisiewicz March 6, 2006 Advanced The Robotics Institute.

Implementing a reliable neuro-classifier

Object Class Recognition Using Discriminative Local Features Gyuri Dorko and Cordelia Schmid.

5/30/2006EE 148, Spring Visual Categorization with Bags of Keypoints Gabriella Csurka Christopher R. Dance Lixin Fan Jutta Willamowski Cedric Bray.

COS 429 PS5: Finding Nemo. Exemplar -SVM Still a rigid template,but train a separate SVM for each positive instance For each category it can has exemplar.

Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.

Computer vision: models, learning and inference Chapter 6 Learning and Inference in Vision.

Wang, Z., et al. Presented by: Kayla Henneman October 27, 2014 WHO IS HERE: LOCATION AWARE FACE RECOGNITION.

Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.

CS55 Tianfan Xue Adviser: Bo Zhang, Jianmin Li.

SVCL Automatic detection of object based Region-of-Interest for image compression Sunhyoung Han.

Object Bank Presenter ： Liu Changyu Advisor ： Prof. Alex Hauptmann Interest ： Multimedia Analysis April 4 th, 2013.

General Tensor Discriminant Analysis and Gabor Features for Gait Recognition by D. Tao, X. Li, and J. Maybank, TPAMI 2007 Presented by Iulian Pruteanu.

BING: Binarized Normed Gradients for Objectness Estimation at 300fps

Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.

UNBIASED LOOK AT DATASET BIAS Antonio Torralba Massachusetts Institute of Technology Alexei A. Efros Carnegie Mellon University CVPR 2011.

1 Robust Endpoint Detection and Energy Normalization for Real-Time Speech and Speaker Recognition Qi Li, Senior Member, IEEE, Jinsong Zheng, Augustine.

Prediction of Influencers from Word Use Chan Shing Hei.

Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.

Project 3 Results.

Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.

Histograms of Oriented Gradients for Human Detection(HOG)

Data Mining, ICDM '08. Eighth IEEE International Conference on Duy-Dinh Le National Institute of Informatics Hitotsubashi, Chiyoda-ku Tokyo,

Recognition Using Visual Phrases

FACE DETECTION : AMIT BHAMARE. WHAT IS FACE DETECTION ? Face detection is computer based technology which detect the face in digital image. Trivial task.

Poselets: Body Part Detectors Trained Using 3D Human Pose Annotations ZUO ZHEN 27 SEP 2011.

Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.

Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.

A Discriminatively Trained, Multiscale, Deformable Part Model Yeong-Jun Cho Computer Vision and Pattern Recognition,2008.

Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,

1 Kernel Machines A relatively new learning methodology (1992) derived from statistical learning theory. Became famous when it gave accuracy comparable.

Zhaoxia Fu, Yan Han Measurement Volume 45, Issue 4, May 2012, Pages 650–655 Reporter: Jing-Siang, Chen.

Shape2Pose: Human Centric Shape Analysis CMPT888 Vladimir G. Kim Siddhartha Chaudhuri Leonidas Guibas Thomas Funkhouser Stanford University Princeton University.

Does one size really fit all? Evaluating classifiers in a Bag-of-Visual-Words classification Christian Hentschel, Harald Sack Hasso Plattner Institute.

Action-Grounded Push Affordance Bootstrapping of Unknown Objects

Data Driven Attributes for Action Detection

Lecture 25: Introduction to Recognition

Digit Recognition using SVMS

Object detection as supervised classification

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science

A Tutorial on HOG Human Detection

Students: Meiling He Advisor: Prof. Brain Armstrong

HOGgles Visualizing Object Detection Features

Lecture 25: Introduction to Recognition

Brief Review of Recognition + Context

Generally Discriminant Analysis

Lecture 6: Feature matching

THE ASSISTIVE SYSTEM SHIFALI KUMAR BISHWO GURUNG JAMES CHOU

Robust Feature Matching and Fast GMS Solution

Presentation transcript:

Carl Vondrick, Aditya Khosla, Tomasz Malisiewicz, Antonio Torralba Massachusetts Institute of Technology

 Introduction  Feature Visualization Algorithms  Evaluation of Visualizations  Conclusion

 Figure 1 shows a high scoring detection from an object detector with HOG features and a linear SVM classifier trained on PASCAL.

 Figure 2 shows the output from our visualization on the features for the false car detection.

 Figure 3 inverts more top detections on PASCAL for a few categories

 Figure 4 for a comparison between our visualization and HOG glyphs.

 We pose the feature visualization problem as one of feature inversion.  be an image and be the corresponding HOG feature descriptor.

 Baseline A: Exemplar LDA (ELDA) Consider the top detections for the exemplar object detector for a few images shown in Figure 5

 We use this simple observation to produce our first inversion baseline  Suppose we wish to invert HOG feature y.  We first train an exemplar LDA detector for this query

 The HOG inverse is then the average of the top K detections in RGB space

 Baseline B: Ridge Regression  We present a fast, parametric inversion baseline based off ridge regression.

 In order to invert a HOG feature y, we calculate the most likely image from the conditional Gaussian distribution

 Baseline C: Direct Optimization  We now provide a baseline that attempts to find images that, when we compute HOG on it, sufficiently match the original descriptor.  Let

 3.4. Algorithm D: Paired Dictionary Learning

 Inversion Benchmark

 Visualization Benchmark

 We believe visualizations can be a powerful tool for understanding object detection systems and advancing research in computer vision.  Since object detection researchers analyze HOG glyphs everyday and nearly every recent object detection paper includes HOG visualizations