Locality-constrained Linear Coding for Image Classification

Slides:



Advertisements
Similar presentations
Sparse Coding and Its Extensions for Visual Recognition
Advertisements

Improving the Fisher Kernel for Large-Scale Image Classification Florent Perronnin, Jorge Sanchez, and Thomas Mensink, ECCV 2010 VGG reading group, January.
Aggregating local image descriptors into compact codes
Automatic Image Annotation Using Group Sparsity
Visual Dictionaries George Papandreou CVPR 2014 Tutorial on BASIS
Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.
Multi-layer Orthogonal Codebook for Image Classification Presented by Xia Li.
CS395: Visual Recognition Spatial Pyramid Matching Heath Vinicombe The University of Texas at Austin 21 st September 2012.
Patch-based Image Deconvolution via Joint Modeling of Sparse Priors Chao Jia and Brian L. Evans The University of Texas at Austin 12 Sep
1 Part 1: Classical Image Classification Methods Kai Yu Dept. of Media Analytics NEC Laboratories America Andrew Ng Computer Science Dept. Stanford University.
Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.
Ziming Zhang *, Ze-Nian Li, Mark Drew School of Computing Science, Simon Fraser University, Vancouver, B.C., Canada {zza27, li, Learning.
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
Nonlinear Unsupervised Feature Learning How Local Similarities Lead to Global Coding Amirreza Shaban.
Robust Object Tracking via Sparsity-based Collaborative Model
Object-centric spatial pooling for image classification Olga Russakovsky, Yuanqing Lin, Kai Yu, Li Fei-Fei ECCV 2012.
Large-Scale, Real-World Face Recognition in Movie Trailers Week 2-3 Alan Wright (Facial Recog. pictures taken from Enrique Gortez)
Empowering visual categorization with the GPU Present by 陳群元 我是強壯 !
Lecture 28: Bag-of-words models
Image Classification using Sparse Coding: Advanced Topics
Global and Efficient Self-Similarity for Object Classification and Detection CVPR 2010 Thomas Deselaers and Vittorio Ferrari.
Review: Intro to recognition Recognition tasks Machine learning approach: training, testing, generalization Example classifiers Nearest neighbor Linear.
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
School of Electronic Information Engineering, Tianjin University Human Action Recognition by Learning Bases of Action Attributes and Parts Jia pingping.
Bag-of-Words based Image Classification Joost van de Weijer.
Project 2 SIFT Matching by Hierarchical K-means Quantization
A Thousand Words in a Scene P. Quelhas, F. Monay, J. Odobez, D. Gatica-Perez and T. Tuytelaars PAMI, Sept
Fast Approximate Energy Minimization via Graph Cuts
Nonparametric Part Transfer for Fine-grained Recognition Presenter Byungju Kim.
Svetlana Lazebnik, Cordelia Schmid, Jean Ponce
Group Sparse Coding Samy Bengio, Fernando Pereira, Yoram Singer, Dennis Strelow Google Mountain View, CA (NIPS2009) Presented by Miao Liu July
Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.
Deformable Part Model Presenter : Liu Changyu Advisor : Prof. Alex Hauptmann Interest : Multimedia Analysis April 11 st, 2013.
Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.
Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.
IIIT Hyderabad Document Image Retrieval using Bag of Visual Words Model Ravi Shekhar CVIT, IIIT Hyderabad Advisor : Prof. C.V. Jawahar.
Bag-of-Words based Image Classification (week I) Joost van de Weijer.
Nonlinear Learning Using Local Coordinate Coding K. Yu, T. Zhang and Y. Gong, NIPS 2009 Improved Local Coordinate Coding Using Local Tangents K. Yu and.
Smooth Side-Match Classified Vector Quantizer with Variable Block Size IEEE Transaction on image processing, VOL. 10, NO. 5, MAY 2001 Department of Applied.
P ROBING THE L OCAL -F EATURE S PACE OF I NTEREST P OINTS Wei-Ting Lee, Hwann-Tzong Chen Department of Computer Science National Tsing Hua University,
Hierarchical Matching with Side Information for Image Classification
A Multiresolution Symbolic Representation of Time Series Vasileios Megalooikonomou Qiang Wang Guo Li Christos Faloutsos Presented by Rui Li.
Unsupervised Auxiliary Visual Words Discovery for Large-Scale Image Object Retrieval Yin-Hsi Kuo1,2, Hsuan-Tien Lin 1, Wen-Huang Cheng 2, Yi-Hsuan Yang.
Local Naïve Bayes Nearest Neighbor for image classification Scancho McCann David G.Lowe University of British Columbia 2012 CVPR WonJun Na.
Locally Linear Support Vector Machines Ľubor Ladický Philip H.S. Torr.
Regionlets for Generic Object Detection IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 37, NO. 10, OCTOBER 2015 Xiaoyu Wang, Ming.
From Dictionary of Visual Words to Subspaces: Locality-constrained Affine Subspace Coding (LASC) Peihua Li, Xiaoxiao Lu, Qilong Wang Presented by Peihua.
Jianchao Yang, John Wright, Thomas Huang, Yi Ma CVPR 2008 Image Super-Resolution as Sparse Representation of Raw Image Patches.
ItemBased Collaborative Filtering Recommendation Algorithms 1.
Facial Smile Detection Based on Deep Learning Features Authors: Kaihao Zhang, Yongzhen Huang, Hong Wu and Liang Wang Center for Research on Intelligent.
NICTA SML Seminar, May 26, 2011 Modeling spatial layout for image classification Jakob Verbeek 1 Joint work with Josip Krapac 1 & Frédéric Jurie 2 1: LEAR.
Deeply learned face representations are sparse, selective, and robust
Presented by David Lee 3/20/2006
Reporter :Liang, Min-Chen Date :2003/04/15
Learning Mid-Level Features For Recognition
Nonparametric Semantic Segmentation
ICCV Hierarchical Part Matching for Fine-Grained Image Classification
Digit Recognition using SVMS
Speaker: Lingxi Xie Authors: Lingxi Xie, Qi Tian, Bo Zhang
Mean transform , a tutorial
CVPR 2014 Orientational Pyramid Matching for Recognizing Indoor Scenes
Source :Journal of visual Communication and Image Representation
Multiple Feature Learning for Action Classification
Sparselet Models for Efficient Multiclass Object Detection
Outline Background Motivation Proposed Model Experimental Results
Dynamic embedding strategy of VQ-based information hiding approach
A Graph-Matching Kernel for Object Categorization
A Self-Reference Watermarking Scheme Based on Wet Paper Coding
Predictive Grayscale Image Coding Scheme Using VQ and BTC
Presentation transcript:

Locality-constrained Linear Coding for Image Classification Presenter : Han-Mu Park

Contents Introduction Coding methods Proposed method Locality-constrained Linear Coding for Image Classification, CVPR 2010 Contents Introduction Coding methods Proposed method Experimental results Conclusion References

Introduction Bag-of-Words (BoW) model Locality-constrained Linear Coding for Image Classification, CVPR 2010 Introduction Bag-of-Words (BoW) model An image is represented as a collection of visual words. Generally, to represent the collection, histogram of words form is used.

Introduction General Spatial Pyramid Matching frameworks Locality-constrained Linear Coding for Image Classification, CVPR 2010 Introduction General Spatial Pyramid Matching frameworks Feature extraction SIFT HOG etc Coding Vector Quantization Sparse coding Pooling Max pooling Sum pooling Spatial Pyramid Matching framework [J.Wang2010]

Introduction General Spatial Pyramid Matching frameworks Locality-constrained Linear Coding for Image Classification, CVPR 2010 Introduction General Spatial Pyramid Matching frameworks Spatial Pyramid Matching framework [J.Wang2010]

Coding methods Coding methods Vector quantization (VQ) Locality-constrained Linear Coding for Image Classification, CVPR 2010 Coding methods Coding methods Vector quantization (VQ) Sparce coding (SC) Locality-constrained Linear Coding (LLC) [J.Wang2010]

Coding methods Vector quantization (VQ) Hard quantization method Locality-constrained Linear Coding for Image Classification, CVPR 2010 Coding methods Vector quantization (VQ) Hard quantization method A set of 𝐷-dimensional local descriptors 𝑋= 𝑥 1 , 𝑥 2 , …,𝑥 𝑁 ∈ 𝑅 𝐷×𝑁 Codebook with 𝑀 entries 𝐵= 𝑏 1 , 𝑏 2 ,…, 𝑏 𝑀 ∈ 𝑅 𝐷×𝑀 Objective function 𝑎𝑟𝑔 min 𝐶 𝑖=1 𝑁 𝑥 𝑖 −𝐵 𝑐 𝑖 2 𝑠.𝑡. 𝑐 𝑖 𝑙 0 =1, 𝑐 𝑖 𝑙 1 =1, 𝑐 𝑖 ≽0, ∀𝑖 Where 𝐶= 𝑐 1 , 𝑐 2 ,…, 𝑐 𝑁 is the set of codes for X

Coding methods Sparse coding (SC) Soft quantization method Locality-constrained Linear Coding for Image Classification, CVPR 2010 Coding methods Sparse coding (SC) Soft quantization method Relaxed the cardinality constraint Objective function 𝑎𝑟𝑔 min 𝐶 𝑖=1 𝑁 𝑥 𝑖 −𝐵 𝑐 𝑖 2 +𝜆 𝑐 𝑖 𝑙 1 The roles of sparsity regularization term Because the codebook 𝐵 is usually over-complete 𝑀>𝐷 , it is necessary to ensure that the under-determined system has a unique solution. Sparsity allows the learned representation to capture salient patterns of local descriptors. The sparse coding can achieve much less quantization error than VQ.

Proposed method Locality-constrained Linear Coding (LLC) Locality-constrained Linear Coding for Image Classification, CVPR 2010 Proposed method Locality-constrained Linear Coding (LLC) Replaced the sparsity regularization term with new constraint. Objective function 𝑎𝑟𝑔 min 𝐶 𝑖=1 𝑁 𝑥 𝑖 −𝐵 𝑐 𝑖 2 + 𝜆 𝑑 𝑖 ⊙ 𝑐 𝑖 2 𝑠.𝑡. 1 𝑇 𝑐 𝑖 =1, ∀𝑖 ⊙ : the element-wise multiplication 𝑑 𝑖 ∈ 𝑅 𝑀 𝑑 𝑖 = exp 𝑑𝑖𝑠𝑡 𝑥 𝑖 ,𝐵 𝜎 Where 𝑑𝑖𝑠𝑡 𝑥 𝑖 ,𝐵 = 𝑑𝑖𝑠𝑡 𝑥 𝑖 , 𝑏 1 ,𝑑𝑖𝑠𝑡 𝑥 𝑖 , 𝑏 2 ,…,𝑑𝑖𝑠𝑡 𝑥 𝑖 , 𝑏 𝑀 𝑇

Proposed method Properties of LLC Better reconstruction Locality-constrained Linear Coding for Image Classification, CVPR 2010 Proposed method Properties of LLC Better reconstruction Because LLC represents each descriptor by using multiple weighted bases (codewords), it has less reconstruction error than VQ. Local smooth sparsity Because the regularization term of 𝑙 1 𝑛𝑜𝑟𝑚 in SC is not smooth, therefore, SC loses correlations between codes. Analytical solution The solution of LLC can be derived analytically by 𝑐 𝑖 = 1 𝐶 𝑖 +𝜆𝑑𝑖𝑎𝑔 𝑑 𝑐 𝑖 = 𝑐 𝑖 1 𝑇 𝑐 𝑖 Where 𝐶 𝑖 = 𝐵−1 𝑥 𝑖 𝑇 𝐵−1 𝑥 𝑖 𝑇 𝑇

Proposed method Approximated LLC for fast encoding Locality-constrained Linear Coding for Image Classification, CVPR 2010 Proposed method Approximated LLC for fast encoding The LLC selects the local bases for each descriptor to form a local coordinate system. To speedup the encoding process, authors used 𝐾 (𝐾<𝐷<𝑀) nearest neighbors of 𝑥 𝑖 as the local bases 𝐵 𝑖 , and solve a much smaller linear system to get the codes min 𝐶 𝑖=1 𝑁 𝑥 𝑖 − 𝑐 𝑖 𝐵 𝑖 2 𝑠.𝑡. 1 𝑇 𝑐 𝑖 =1, ∀𝑖 The reduced computation complexity 𝑂 𝑀 2 ⟶𝑂 𝑀+ 𝐾 2 , where 𝐾≪𝑀

Proposed method Codebook optimization Locality-constrained Linear Coding for Image Classification, CVPR 2010 Proposed method Codebook optimization To improve the accuracy, authors trained the codebook to optimize for LLC codes. The optimal codebook 𝐵 ∗ can be obtained by 𝑎𝑟𝑔 min 𝐶,𝐵 𝑖=1 𝑁 𝑥 𝑖 −𝐵 𝑐 𝑖 2 +𝜆 𝑑 𝑖 ⊙ 𝑐 𝑖 2 𝑠.𝑡. 1 𝑇 𝑐 𝑖 =1, ∀𝑖 𝑏 𝑗 2 ≤1, ∀𝑗 This can be solved by using Coordinate Descent method. However, because the number of training descriptors 𝑁 is usually very large, the huge memory space is needed to solve that problem.

Proposed method Incremental codebook optimization Locality-constrained Linear Coding for Image Classification, CVPR 2010 Proposed method Incremental codebook optimization First, initialize 𝐵 by using K-means clustering. Then loop through all the training descriptors to update 𝐵 incrementally. In each iteration, we take in a single (or a small set of) examples 𝑥 𝑖 , and solve original objective function to obtain the corresponding LLC codes. [J.Wang2010]

Proposed method Incremental codebook optimization Locality-constrained Linear Coding for Image Classification, CVPR 2010 Proposed method Incremental codebook optimization Then select bases 𝐵 𝑖 whose corresponding weights are larger than predefined threshold, and refit 𝑥 𝑖 without the locality constraint. The obtained code is used to update the basis in a gradient descent fashion. [J.Wang2010]

Experimental results Performance of codebook Locality-constrained Linear Coding for Image Classification, CVPR 2010 Experimental results Performance of codebook

Experimental results Performance under different neighbors Locality-constrained Linear Coding for Image Classification, CVPR 2010 Experimental results Performance under different neighbors

Experimental results Results using Pascal VOC 2007 Locality-constrained Linear Coding for Image Classification, CVPR 2010 Experimental results Results using Pascal VOC 2007

Conclusion Contribution Locality-constrained Linear Coding for Image Classification, CVPR 2010 Conclusion Contribution In this paper, the Locality-constrained Linear Coding (LCC) method is proposed Better reconstruction Local smooth sparsity Analytical solution For speedup, K-nearest neighbors algorithm is used. To optimize the accuracy, incremental codebook learning is proposed for LCC.

Locality-constrained Linear Coding for Image Classification, CVPR 2010 References [1] J. Wang, J. Yang, K. Yu, F. Lv, T. Huang, Y. Gong, “Locality-constrained Linear Coding for Image Classification,” CVPR 2010.

Thank you!