Image Classification for Automatic Annotation

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Image Retrieval: Current Techniques, Promising Directions, and Open Issues Yong Rui, Thomas Huang and Shih-Fu Chang Published in the Journal of Visual.
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Data Mining Feature Selection. Data reduction: Obtain a reduced representation of the data set that is much smaller in volume but yet produces the same.
DIMENSIONALITY REDUCTION: FEATURE EXTRACTION & FEATURE SELECTION Principle Component Analysis.
Automatic Video Shot Detection from MPEG Bit Stream Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
São Paulo Advanced School of Computing (SP-ASC’10). São Paulo, Brazil, July 12-17, 2010 Looking at People Using Partial Least Squares William Robson Schwartz.
Discriminative and generative methods for bags of features
Content-based Video Indexing, Classification & Retrieval Presented by HOI, Chu Hong Nov. 27, 2002.
Event prediction CS 590v. Applications Video search Surveillance – Detecting suspicious activities – Illegally parked cars – Abandoned bags Intelligent.
Li-Jia Li Yongwhan Lim Li Fei-Fei Chong Wang David M. Blei B UILDING AND U SING A S EMANTIVISUAL I MAGE H IERARCHY CVPR, 2010.
Morris LeBlanc.  Why Image Retrieval is Hard?  Problems with Image Retrieval  Support Vector Machines  Active Learning  Image Processing ◦ Texture.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson
MCS 2005 Round Table In the context of MCS, what do you believe to be true, even if you cannot yet prove it?
Multimedia Search and Retrieval Presented by: Reza Aghaee For Multimedia Course(CMPT820) Simon Fraser University March.2005 Shih-Fu Chang, Qian Huang,
1 Integrating User Feedback Log into Relevance Feedback by Coupled SVM for Content-Based Image Retrieval 9-April, 2005 Steven C. H. Hoi *, Michael R. Lyu.
Traditional Database Indexing Techniques for Video Database Indexing Jianping Fan Department of Computer Science University of North Carolina at Charlotte.
Presentation in IJCNN 2004 Biased Support Vector Machine for Relevance Feedback in Image Retrieval Hoi, Chu-Hong Steven Department of Computer Science.
Presented by Zeehasham Rasheed
CS292 Computational Vision and Language Visual Features - Colour and Texture.
Latent Semantic Analysis (LSA). Introduction to LSA Learning Model Uses Singular Value Decomposition (SVD) to simulate human learning of word and passage.
Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.
DVMM Lab, Columbia UniversityVideo Event Recognition Video Event Recognition: Multilevel Pyramid Matching Dong Xu and Shih-Fu Chang Digital Video and Multimedia.
Information Retrieval in Practice
MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.
Aardvark Anatomy of a Large-Scale Social Search Engine.
Exploiting Ontologies for Automatic Image Annotation M. Srikanth, J. Varner, M. Bowden, D. Moldovan Language Computer Corporation
Marcin Marszałek, Ivan Laptev, Cordelia Schmid Computer Vision and Pattern Recognition, CVPR Actions in Context.
A Comparison Between Bayesian Networks and Generalized Linear Models in the Indoor/Outdoor Scene Classification Problem.
80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.
Experiments Test different parking lot images captured in different luminance conditions The test samples include 1300 available parking spaces and 1500.
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.
IBM QBIC: Query by Image and Video Content Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC 28223
A Two-level Pose Estimation Framework Using Majority Voting of Gabor Wavelets and Bunch Graph Analysis J. Wu, J. M. Pedersen, D. Putthividhya, D. Norgaard,
Ensemble Learning Spring 2009 Ben-Gurion University of the Negev.
Chapter 4: Pattern Recognition. Classification is a process that assigns a label to an object according to some representation of the object’s properties.
PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.
Hierarchical Classification
Challenges in Mining Large Image Datasets Jelena Tešić, B.S. Manjunath University of California, Santa Barbara
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.
Probabilistic Latent Query Analysis for Combining Multiple Retrieval Sources Rong Yan Alexander G. Hauptmann School of Computer Science Carnegie Mellon.
An Approximate Nearest Neighbor Retrieval Scheme for Computationally Intensive Distance Measures Pratyush Bhatt MS by Research(CVIT)
DDM Kirk. LSST-VAO discussion: Distributed Data Mining (DDM) Kirk Borne George Mason University March 24, 2011.
Data Mining, ICDM '08. Eighth IEEE International Conference on Duy-Dinh Le National Institute of Informatics Hitotsubashi, Chiyoda-ku Tokyo,
Image Classification over Visual Tree Jianping Fan Dept of Computer Science UNC-Charlotte, NC
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Yixin Chen and James Z. Wang The Pennsylvania State University
Data Mining and Decision Support
Multimedia Analytics Jianping Fan Department of Computer Science University of North Carolina at Charlotte.
Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC Relevance Feedback for Image Retrieval.
Next, this study employed SVM to classify the emotion label for each EEG segment. The basic idea is to project input data onto a higher dimensional feature.
1 A Methodology for automatic retrieval of similarly shaped machinable components Mark Ascher - Dept of ECE.
Mustafa Gokce Baydogan, George Runger and Eugene Tuv INFORMS Annual Meeting 2011, Charlotte A Bag-of-Features Framework for Time Series Classification.
Adaboost (Adaptive boosting) Jo Yeong-Jun Schapire, Robert E., and Yoram Singer. "Improved boosting algorithms using confidence- rated predictions."
Feature learning for multivariate time series classification Mustafa Gokce Baydogan * George Runger * Eugene Tuv † * Arizona State University † Intel Corporation.
Digital Video Library - Jacky Ma.
Automatic Video Shot Detection from MPEG Bit Stream
Supervised Time Series Pattern Discovery through Local Importance
Unsupervised Riemannian Clustering of Probability Density Functions
Semantic Video Classification
Personalized Social Image Recommendation
Machine Learning Basics
Outline Texture modeling - continued Filtering-based approaches.
Project Implementation for ITCS4122
Multimedia Information Retrieval
Objects as Attributes for Scene Classification
Presentation transcript:

Image Classification for Automatic Annotation Jianping Fan Department of Computer Science University of North Carolina at Charlotte http://www.cs.uncc.edu/~jfan

Feature Extraction Object-Based Approach Visual Features Input Image Salient Objects Color histogram, Tamura texture, Locations ……. Visual Features

Feature Extraction Image-Based Approach

Image Representation Images Salient Objects Color Color histogram Tamura Texture Wavelet Texture histogram Shape Interest Point set

Feature-Based Image Representation dimension Feature dimension Feature dimension Curse of Dimensionality

Feature-Based Image Representation Tree-Based Database Indexing

Feature-Based Image Representation Tree-Based Database Indexing Nearest Neighbor Search Overlapping between Different nodes What’s the solution?

Feature-Based Image Representation Where idea for tree-based indexing come from? Library: 12,000,000 books

Feature-Based Image Representation Books in Library Natural Sciences Social Sciences Computer Science Electrical Engineering Dancing Computer Languages Researches I get it! Database Multimedia Too easy! 11!

Feature-Based Image Representation What is the solution? Let's go back to pinciple! Concept Hierarchy or Ontology

Hierarchical Concept Organization Concept Ontology

Hierarchical Concept Organization Concept Ontology

6. Classifier Training for Atomic Image Concepts Beach Atomic Image Concept Different Patterns of Co-Appearances of Salient Objects Water & Sky Water & Sand Water, Sand, Sky, … Feature Subset 9 Feature Subset 1 Feature Subset 2 Feature Subsets

6. Classifier Training for Atomic Image Concepts Curse of Dimensionality # samples needed increase with # dimensions (generally exponentially) . Human labeling is expensive Some features are redundant Proposal Joint SVM boosting and feature selection

Subspace 1 Subspace 2 Subspace 3 … Subspace N Higher Level Classifier Boosting SVM Classifier Training PCA PCA PCA PCA High dimensional feature space Subspace 1 Subspace 2 Subspace 3 … Subspace N SVM SVM SVM SVM Low-level classifiers … Weak Classifier 1 Weak Classifier 2 Weak Classifier 3 Weak Classifier N Boosting for optimal combination Higher Level Classifier High-level classifier Less training samples due to dimension reduction Reuse training results on low-level concepts More selection opportunities compared to filter and wrapper

6. Classifier Training for Atomic Image Concepts Kernel-Based Data Warping Kernel Function:

6. Classifier Training for Atomic Image Concepts Kernel for Color Histogram Statistical Image Similarity Kernel

6. Classifier Training for Atomic Image Concepts Wavelet Filter Bank Kernel

6. Classifier Training for Atomic Image Concepts Wavelet Filter Bank Kernel

6. Classifier Training for Atomic Image Concepts Interest Point Matching Kernel

6. Classifier Training for Atomic Image Concepts Interest Point Matching Kernel

6. Classifier Training for Atomic Image Concepts Multiple Kernel Learning SVM Image Classifier

6. Classifier Training for Atomic Image Concepts Dual Problem Subject to:

6. Classifier Training for Atomic Image Concepts

6. Classifier Training for Atomic Image Concepts Some Results Beach Scene Garden Scene

High-Level Image Concept Modeling Inter-Concept Similarity Modeling Nature Scene Flower View Garden Beach Nature Scene: Larger Hypothesis Space & Large Variations of Visual Properties! Garden, Beach, Flower view: Different but share common visual properties!

7. Classifier Training for High-Level Image Concepts Challenging Problems Error Transmission Problems Training Cost Issue Knowledge Transferability and Task Relatedness Exploitation

7. Classifier Training for High-Level Image Concepts Error Transmission Problem The classifiers for low-level image concepts cannot recover the errors for the classifiers of high-level image concepts!

7. Classifier Training for High-Level Image Concepts Error Transmission Problem Outdoor Flower View Garden Beach Errors for the classifiers of atomic image concepts may be transmitted to the classifiers for the high-level image concepts!

7. Classifier Training for High-Level Image Concepts Training Cost Issue Multiple Hypotheses outdoor garden flower view beach Large Diversity of Contents

7. Classifier Training for High-Level Image Concepts Knowledge Transferability & Task Relatedness Exploitation Outdoor Flower View Garden Beach They are different but strongly related!

7. Classifier Training for High-Level Image Concepts Multi-Task Learning Which tasks are strongly related? How to quantify the task relatedness? How to integrate such task relatedness for training large-scale related image classifiers?

7. Classifier Training for High-Level Image Concepts Related Learning Tasks Nature Scene Flower View Garden Beach They are different but strongly related! Concept Ontology can provide a good environment for multi-task learning!

7. Classifier Training for High-Level Image Concepts Related Learning Tasks

7. Classifier Training for High-Level Image Concepts Relatedness Modelling outdoor garden flower view beach : Common Prediction Structure

7. Classifier Training for High-Level Image Concepts Joint Objective Function Subject to:

7. Classifier Training for High-Level Image Concepts Dual Problem Subject to:

7. Classifier Training for High-Level Image Concepts Biased Classifier Training Dual Problem Subject to:

7. Classifier Training for High-Level Image Concepts Common Prediction Structure Nature Scene Flower View Garden Beach Common Visual Properties

7. Classifier Training for High-Level Image Concepts Hierarchical Boosting

7. Classifier Training for High-Level Image Concepts Biased Classifier for Parent Node

7. Classifier Training for High-Level Image Concepts Hierarchical Boosting to Generate Classifier for Parent Node

7. Classifier Training for High-Level Image Concepts Performance Evaluation

7. Classifier Training for High-Level Image Concepts Performance Evaluation

7. Classifier Training for High-Level Image Concepts Advantages of Hierarchical Boosting Handling inter-concept similarity via multi-task learning Reducing training cost Enhancing discrimination power of the classifiers

8. Hierarchical Image Classification Overall Probability Parent Node Path 1 Path 2 Path C Children Node 1 Children Node 2 Children Node C

8. Hierarchical Image Classification Some Results

10. Query Result Evaluation Allow Users to See Global View!

10. Query Result Evaluation Allow Users to See Similarity Direction!

10. Query Result Evaluation Allow Users to Zoom into Images of Interest!

10. Query Result Evaluation: Red Flower Allow Users to Select Query Example Interactively!

10. Query Result Evaluation: Sunset Allow Users to Look for Particular Images!

11. Training Image Observation

11. Training Image Observation

11. Training Image Observation