Presentation in IJCNN 2004 Biased Support Vector Machine for Relevance Feedback in Image Retrieval Hoi, Chu-Hong Steven Department of Computer Science.

Slides:



Advertisements
Similar presentations
Using Large-Scale Web Data to Facilitate Textual Query Based Retrieval of Consumer Photos.
Advertisements

Image Retrieval With Relevant Feedback Hayati Cam & Ozge Cavus IMAGE RETRIEVAL WITH RELEVANCE FEEDBACK Hayati CAM Ozge CAVUS.
Ranking Multimedia Databases via Relevance Feedback with History and Foresight Support / 12 I9 CHAIR OF COMPUTER SCIENCE 9 DATA MANAGEMENT AND EXPLORATION.
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Search Engines Information Retrieval in Practice All slides ©Addison Wesley, 2008.
Multimedia Database Systems Relevance Feedback Department of Informatics Aristotle University of Thessaloniki Fall 2008.
Image Information Retrieval Shaw-Ming Yang IST 497E 12/05/02.
LPP-HOG: A New Local Image Descriptor for Fast Human Detection Andy Qing Jun Wang and Ru Bo Zhang IEEE International Symposium.
Detecting Cartoons a Case Study in Automatic Video-Genre Classification Tzvetanka Ianeva Arjen de Vries Hein Röhrig.
Discriminative and generative methods for bags of features
Learning similarity measure for natural image retrieval with relevance feedback Reporter: Francis 2005/3/3.
Relevance Feedback Content-Based Image Retrieval Using Query Distribution Estimation Based on Maximum Entropy Principle Irwin King and Zhong Jin Nov
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Fuzzy Support Vector Machines (FSVMs) Weijia Wang, Huanren Zhang, Vijendra Purohit, Aditi Gupta.
1 Content Based Image Retrieval Using MPEG-7 Dominant Color Descriptor Student: Mr. Ka-Man Wong Supervisor: Dr. Lai-Man Po MPhil Examination Department.
1 Learning on Relevance Feedback in Content-based Image Retrieval Hoi, Chu-Hong ( Steven ) Supervisor: Prof. Michael R. Lyu Venue:RM1027 Date: 11:00a.m.
Morris LeBlanc.  Why Image Retrieval is Hard?  Problems with Image Retrieval  Support Vector Machines  Active Learning  Image Processing ◦ Texture.
NCKU CSIE Visualization & Layout for Image Libraries Baback Moghaddam, Qi Tian IEEE Int’l Conf. on CVPR 2001 Speaker: 蘇琬婷.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
ACM Multimedia th Annual Conference, October , 2004
Expectation Maximization Method Effective Image Retrieval Based on Hidden Concept Discovery in Image Database By Sanket Korgaonkar Masters Computer Science.
Support Vector Machines Pattern Recognition Sergios Theodoridis Konstantinos Koutroumbas Second Edition A Tutorial on Support Vector Machines for Pattern.
Relevance Feedback based on Parameter Estimation of Target Distribution K. C. Sia and Irwin King Department of Computer Science & Engineering The Chinese.
SVM Active Learning with Application to Image Retrieval
1 Integrating User Feedback Log into Relevance Feedback by Coupled SVM for Content-Based Image Retrieval 9-April, 2005 Steven C. H. Hoi *, Michael R. Lyu.
A novel log-based relevance feedback technique in content- based image retrieval Reporter: Francis 2005/6/2.
Image Categorization by Learning and Reasoning with Regions Yixin Chen, University of New Orleans James Z. Wang, The Pennsylvania State University Published.
1 An Empirical Study on Large-Scale Content-Based Image Retrieval Group Meeting Presented by Wyman
Dept. of Computer Science & Engineering, CUHK Pseudo Relevance Feedback with Biased Support Vector Machine in Multimedia Retrieval Steven C.H. Hoi 14-Oct,
SVMs Finalized. Where we are Last time Support vector machines in grungy detail The SVM objective function and QP Today Last details on SVMs Putting it.
A Study of the Relationship between SVM and Gabriel Graph ZHANG Wan and Irwin King, Multimedia Information Processing Laboratory, Department of Computer.
Relevance Feedback Content-Based Image Retrieval Using Query Distribution Estimation Based on Maximum Entropy Principle Irwin King and Zhong Jin The Chinese.
SVMs, cont’d Intro to Bayesian learning. Quadratic programming Problems of the form Minimize: Subject to: are called “quadratic programming” problems.
Optimizing Learning with SVM Constraint for Content-based Image Retrieval* Steven C.H. Hoi 1th March, 2004 *Note: The copyright of the presentation material.
SIEVE—Search Images Effectively through Visual Elimination Ying Liu, Dengsheng Zhang and Guojun Lu Gippsland School of Info Tech,
Jinhui Tang †, Shuicheng Yan †, Richang Hong †, Guo-Jun Qi ‡, Tat-Seng Chua † † National University of Singapore ‡ University of Illinois at Urbana-Champaign.
Active Learning for Class Imbalance Problem
Wavelet-Based Multiresolution Matching for Content-Based Image Retrieval Presented by Tienwei Tsai Department of Computer Science and Engineering Tatung.
Using Large-Scale Web Data to Facilitate Textual Query Based Retrieval of Consumer Photos Yiming Liu, Dong Xu, Ivor W. Tsang, Jiebo Luo Nanyang Technological.
Content-Based Image Retrieval
Glasgow 02/02/04 NN k networks for content-based image retrieval Daniel Heesch.
COLOR HISTOGRAM AND DISCRETE COSINE TRANSFORM FOR COLOR IMAGE RETRIEVAL Presented by 2006/8.
IEEE Int'l Symposium on Signal Processing and its Applications 1 An Unsupervised Learning Approach to Content-Based Image Retrieval Yixin Chen & James.
Relevance Feedback in Image Retrieval Systems: A Survey Part II Lin Luo, Tao Huang, Chengcui Zhang School of Computer Science Florida International University.
Toward Consistent Evaluation of Relevance Feedback Approaches in Multimedia Retrieval Xiangyu Jin, James French, Jonathan Michel July, 2005.
Greedy is not Enough: An Efficient Batch Mode Active Learning Algorithm Chen, Yi-wen( 陳憶文 ) Graduate Institute of Computer Science & Information Engineering.
PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.
Competence Centre on Information Extraction and Image Understanding for Earth Observation 29th March 2007 Category - based Semantic Search Engine 1 Mihai.
1 A Compact Feature Representation and Image Indexing in Content- Based Image Retrieval A presentation by Gita Das PhD Candidate 29 Nov 2005 Supervisor:
Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.
Design of PCA and SVM based face recognition system for intelligent robots Department of Electrical Engineering, Southern Taiwan University, Tainan County,
Semi-Automatic Image Annotation Liu Wenyin, Susan Dumais, Yanfeng Sun, HongJiang Zhang, Mary Czerwinski and Brent Field Microsoft Research.
Image Classification for Automatic Annotation
MindReader: Querying databases through multiple examples Yoshiharu Ishikawa (Nara Institute of Science and Technology, Japan) Ravishankar Subramanya (Pittsburgh.
Consensus Group Stable Feature Selection
2004 謝俊瑋 NTU, CSIE, CMLab 1 A Rule-Based Video Annotation System Andres Dorado, Janko Calic, and Ebroul Izquierdo, Senior Member, IEEE.
A Distributed Multimedia Data Management over the Grid Kasturi Chatterjee Advisors for this Project: Dr. Shu-Ching Chen & Dr. Masoud Sadjadi Distributed.
26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.
A Genetic Algorithm-Based Approach to Content-Based Image Retrieval Bo-Yen Wang( 王博彥 )
Multimedia Analytics Jianping Fan Department of Computer Science University of North Carolina at Charlotte.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Attila Kiss, Tamás Németh, Szabolcs Sergyán, Zoltán Vámossy, László Csink Budapest Tech Recognition of a Moving Object in a Stereo Environment Using a.
Relevance Feedback in Image Retrieval System: A Survey Tao Huang Lin Luo Chengcui Zhang.
Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC Relevance Feedback for Image Retrieval.
Content-Based Image Retrieval Using Color Space Transformation and Wavelet Transform Presented by Tienwei Tsai Department of Information Management Chihlee.
The Chinese University of Hong Kong Learning Larger Margin Machine Locally and Globally Dept. of Computer Science and Engineering The Chinese University.
Efficient Image Classification on Vertically Decomposed Data
Image Segmentation Techniques
SVMs for Document Ranking
Presentation transcript:

Presentation in IJCNN 2004 Biased Support Vector Machine for Relevance Feedback in Image Retrieval Hoi, Chu-Hong Steven Department of Computer Science & Engineering The Chinese University of Hong Kong Shatin, Hong Kong Budapest, 25-29 July, 2004

Outline Background & Motivation Biased Support Vector Machine (Biased SVM) Relevance Feedback by Biased SVM Experimental Results Conclusions This is the outline of my presentation. First, I will briefly give the background & motivation of our work. Then, I will present and formulate a modified SVM algorithm, Biased SVM, for attacking the imbalance problem. Followed by the formulation, a relevance feedback scheme by Biased SVM is proposed. After that, experimental results will be presented and conclusions are given at the end.

Background Challenges in Content-based Image Retrieval (CBIR) Semantic gap, low-level features, high-level concepts Subjectivity of human being, … Relevance Feedback (RF) Refine retrieval results by incorporating users’ interactions A technique to narrow down the semantic gap, subjectivity Methods: heuristic weighting [Rui98, MARS99], optimization [MindReader98, Rui00], classification [MacArthur99], other learning techniques [Huang01], … Popular method proposed recently: Support Vector Machines (SVM) [Hong00, Chen01, Tong01, Zhang01] Content-based Image retrieval has been an important topic in visual information retrieval for a long days. However, some problems in CBIR are still challenging, include the semantic gap between low-level features and high-level honcepts, and also the subjectivity of human being. To overcome the challenges, relevance feedback has been proposed as a powerful technique to narrow down the semantic gap. There are many techniques proposed for relevance feedback including heuristic weighting schemes, optimization framework, classification, and many other machine learning techniques. One of the most popular technique engaged for relevance feedback recently is the Support Vector machine, which has excellent performance in many pattern recognition applications.

Motivation A case of regular SVM Imbalance dataset problem? Optimal separating hyperplane margin Here, we briefly show the motivation of our work. We first briefly introduce the basic idea of SVM. In regular binary SVM, there are two classes: positive and negative. The algorithm of SVM is to find the optimal separating hyperplane for separating the data with a maximum margin. There is no imbalance consideration in regular SVM. However, in real-world relevance feedback problem, the number of negative samples is often large than the positive. Positive samples will be overwhelmed by negative ones. The imbalance problem will degrade the learning performance. (Positives clustered, negative scattered) A case of regular SVM Imbalance dataset problem? # negative >> # positive Positive overwhelmed by negative

Motivation Limitation of regular SVMs for RF Our solution: Biased SVM Regular binary SVM Simply treat as a strict binary classification problem without imbalance consideration Regular 1-SVM Exclude negative information Our solution: Biased SVM A modified 1-SVM incorporating negative information with bias control From the above examples, we can see the limitations of regular SVMs techniques for relevance feedback as follows: Regular binary SVM simply regards the relevance feedback problem as a strict binary classification problem without imbalance consideration Regular one class SVM do not consider the negative information. (Do not notice the imbalance problem) To attack the challenge, we proposed a modified one class SVM method incorporating the negative information with bias control.

Biased SVM Problem formulation Training data: The objective function c Now we formulate the Biased support vector machine technique. Suppose we are given the training data: Here, x_i is vector, y_i the label of class, l is the number of training samples, The goal of Biased SVM learning is to find the optimal separating hypersphere which best describes the data. The objective function of Biased SVM is given as follow. Here, we introduce the bias control parameter – b – for imbalance consideration. C here is the centroid of the hypersphere, R here is the radius of the hyersphere ball. R

Biased SVM (cont.) Optimization by Lagrange multipliers Take the partial derivatives of L with respect to R,ξ,c, andρ: In order to solve the optimization problem, we introduce the Lagrange multipliers, We take the partial derivatives of L with respect to the parameters as follows:

Biased SVM (cont.) Dual problem (Quadratic Programming (QP) ) The decision function f (x)<0 f (x)>=0 Then, we can obtain the dual problem. This can be solved by typical quadratic programming technique. After solving the problem, we can acquire the decision function.

Relevance Feedback by Biased SVM One of differences with regular SVM Visual comparison Biased SVM: Regular SVM: Here, we show some differences between Biased SVM and regular SVM. One difference can be seen from the dual optimization function, in Biased SVM, we have a bias factor- b – to estimate the bias between the positive class and negative class, while in regular SVM, there is no imbalance consideration. One the other hand, we can also see the difference from visual comparison: e.g. in the figure, we can see that Biased SVM can better describe the data by incorporating the negative information.

Relevance Feedback by Biased SVM Obtained decision function Simplified evaluation function To formulate the relevance feedback algorithm with Biased SVM, we can derive the simplified evaluation function from original decision function. Here, the evaluation function are obtained by removing the constant values from the original decision function.

Experimental Results Datasets One synthetic dataset: 40-Cat, each contains 100 data points randomly generated by 7 Gaussian in a 40-dimensional space. Two real-world image datasets selected from COREL image CDs 20-Cat: 2,000 images 50-Cat: 5,000 images In the experiments, we evaluate the algorithms on both the synthetic dataset and real-world image datasets. One synthetic dataset: 40-Cat, each contains 100 data points randomly generated by 7 Gaussian in a 40-dimensional space. We pick two real-world datasets by selecting the images from COREL image CDs. One is 20 category dataset containing 2000 images, another is 50 category. Each category has a different semantic meaning. Means and covariance matrices of the Gaussians in each category are randomly generated in the range of [0,10].

Experimental Results (cont.) Image Representation Color Moment 9-dimension Edge Direction Histogram 18-dimension Canny detector 18 bins, each of 20 degrees Wavelet-based texture Daubechies-4 wavelet, 3-level DWT 9 subimages to generate the feature Image representation is an important step for CBIR. Three kinds of features: color, shape, and texture are engaged . For color feature, we use 9-dimensional color moment. For shape, we use 18-dimesion edge direction histogram in which Canny detector is used for edge detection and histogram are divided in 18bins each with 20degrees. For texture, we use 9-dimension wavelet texture, in which DB-4wavlet is used and 3-leve DWT decmposition is performed. 9 subimages are selected to describe the texture.

Experimental Results (cont.) Compared Schemes Relevance Feedback by regular nu-SVM Relevance Feedback with 1-SVM Relevance Feedback with Biased SVM Experimental Setup Metric: Average precision = #relevant / #returned Pick 10 instances, label pos. or neg. First iteration, 2 pos. and 8 neg. Same kernel and settings for compared schemes 200 relevance feedback simulation rounds are executed for each compare scheme. We compare there relevance feedback scheme as follows: The first case is by regular binary SVM (here nu-SVM is engaged) Second scheme is the regular 1-SVM technique. The third one is our Biased SVM. In the experimental evaluation, we employ average precision as the metric. And in each relevance feedback iteration, 10 image instances are picked for lablelling In first iteration, 2 positive and 8 negative samples are given. To enable objective evaluation, kernel and settings are same for the compared scheme.

Experimental Results (cont.) This shows the experimental results of systhetic dataset and real-world image datasets. We can see our proposed method outperform than regular approaches. And it is interested that better results are obtained in the real-world cases. We also see that one-class SVM can perform well initially, but degrade later without including negative information. Synthetic dataset 20-Cat COREL Images

Experimental Results (cont.) In the 50-category dataset, we also see the similar results. Our method have better performance than other two schemes. 50-Cat COREL Images

Experimental Results (cont.) This table show the average precision of three schemes after 10 feedback iterations. From the results, we can Biased SVM outperform than other two schemes both on the 20-cat and 50-cat datasets.

Conclusions Address the imbalance problem of relevance feedback in CBIR. Propose a modified SVM technique, i.e. Biased SVM, to attack the imbalance problem of relevance feedback problem in CBIR. Demonstrate effectiveness of the proposed scheme from experiments. In this presentation, we addressed the imbalance problem of relevance feedback in CBIR. We proposed a modified SVM technique, Biased SVM, to attack the imbalance problem in relevance feedback. We demonstrated the effectiveness of our proposed scheme from convinced CBIR experiments.

Budapest, Hungary, July, 2004 Thank you!

References [Rui98] Y. Rui, T. S. Huang, M. Ortega, and S. Mehrotra, “Relevance Feedback: A Power Tool in Interactive Content-Based Image Retrieval”, IEEE Tran Circuits and Systems for Video Technology, Vol 8 No 5, 1998, 644-655 [MARS99] K. Porkaew, S. Mehrotra, and M. Ortega, “Query Reformulation for Content Based Multimedia Retrieval in MARS”, IEEE Int’l Conf. Multimedia Computing and Systems (ICMCS’99), June, 1999 [MindReader98] Y. Ishikawa, R. Subramanya, and C. Faloutsos, “MindReader: Query databases through multiple examples”, 24th VLDB Conf. (New York), 1998 [Zhang01] L. Zhang, F. Lin, and B. Zhang, “SUPPORT VECTOR MACHINE LEARNING FOR IMAGE RETRIEVAL”, ICIP’2001, 2001 [Rui00] Y. Rui, T. S. Huang, “Optimizing learning in image retrieval”, CVPR’00, Hilton Head Island, SC, June 2000 [MacArthur99] S. MacArthur, C. Brodley, and C. Shyu, “Relevance Feedback Decision Trees in Content-Based Image Retrieval,” workshop CBAIVL, CVPR’00, June 12, 2000. [Tong01] S. Tong, and E. Chang, “Support vector machine active learning for image retrieval”, ACM MM’2001, 2001 [Chen01] Y. Chen, X. S. Zhou, T. S. Huang, “One-class SVM for Learning in Image Retrieval”, ICIP'2001, Thessaloniki, Greece, October 7-10, 2001 [Hong00] P. Hong, Q. Tian, T. S. Huang, "Incorporate Support Vector Machines to Content-Based Image Retrieval with Relevance Feedback", ICIP'2000, Vancouver, Sep 10-13, 2000.