ICIP 2004, Singapore, October 25-27 A Comparison of Continuous vs. Discrete Image Models for Probabilistic Image and Video Retrieval Arjen P. de Vries.

Slides:



Advertisements
Similar presentations
Image Retrieval With Relevant Feedback Hayati Cam & Ozge Cavus IMAGE RETRIEVAL WITH RELEVANCE FEEDBACK Hayati CAM Ozge CAVUS.
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Information Retrieval and Organisation Chapter 12 Language Models for Information Retrieval Dell Zhang Birkbeck, University of London.
Metadata in Carrot II Current metadata –TF.IDF for both documents and collections –Full-text index –Metadata are transferred between different nodes Potential.
Language Models Naama Kraus (Modified by Amit Gross) Slides are based on Introduction to Information Retrieval Book by Manning, Raghavan and Schütze.
Chapter 5: Introduction to Information Retrieval
1 Language Models for TR (Lecture for CS410-CXZ Text Info Systems) Feb. 25, 2011 ChengXiang Zhai Department of Computer Science University of Illinois,
SEARCHING QUESTION AND ANSWER ARCHIVES Dr. Jiwoon Jeon Presented by CHARANYA VENKATESH KUMAR.
TRECVID 2004 Tzvetanka (‘Tzveta’) I. Ianeva Lioudmila (‘Mila’) Boldareva Thijs Westerveld Roberto Cornacchia Djoerd Hiemstra (the 1 and only) Arjen P.
Segmentation and Fitting Using Probabilistic Methods
K-means clustering Hongning Wang
Information Retrieval Visualization CPSC 533c Class Presentation Qixing Zheng March 22, 2004.
ICASSP, May Arjen P. de Vries Thijs Westerveld Tzvetanka I. Ianeva Combining Multiple Representations on the TRECVID Search Task.
IR Challenges and Language Modeling. IR Achievements Search engines  Meta-search  Cross-lingual search  Factoid question answering  Filtering Statistical.
Automatic Image Annotation and Retrieval using Cross-Media Relevance Models J. Jeon, V. Lavrenko and R. Manmathat Computer Science Department University.
Introduction to Information Retrieval Introduction to Information Retrieval Hinrich Schütze and Christina Lioma Lecture 12: Language Models for IR.
Introduction to Information Retrieval Introduction to Information Retrieval Hinrich Schütze and Christina Lioma Lecture 11: Probabilistic Information Retrieval.
Expectation Maximization Method Effective Image Retrieval Based on Hidden Concept Discovery in Image Database By Sanket Korgaonkar Masters Computer Science.
1 CS 430 / INFO 430 Information Retrieval Lecture 12 Probabilistic Information Retrieval.
1 CS 430 / INFO 430 Information Retrieval Lecture 12 Probabilistic Information Retrieval.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Dimensional reduction, PCA
Relevance Feedback based on Parameter Estimation of Target Distribution K. C. Sia and Irwin King Department of Computer Science & Engineering The Chinese.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
Switch to Top-down Top-down or move-to-nearest Partition documents into ‘k’ clusters Two variants “Hard” (0/1) assignment of documents to clusters “soft”
Vector Space Model CS 652 Information Extraction and Integration.
ICME 2004 Tzvetanka I. Ianeva Arjen P. de Vries Thijs Westerveld A Dynamic Probabilistic Multimedia Retrieval Model.
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
EE462 MLCV 1 Lecture 3-4 Clustering (1hr) Gaussian Mixture and EM (1hr) Tae-Kyun Kim.
Relevance Feedback Users learning how to modify queries Response list must have least some relevant documents Relevance feedback `correcting' the ranks.
1 Probabilistic Language-Model Based Document Retrieval.
Using Probabilistic Models for Multimedia Retrieval Arjen P. de Vries (Joint research with Thijs Westerveld) Centrum voor Wiskunde en Informatica.
Language Models for IR Debapriyo Majumdar Information Retrieval Indian Statistical Institute Kolkata Spring 2015 Credit for several slides to Jimmy Lin.
Topic Models in Text Processing IR Group Meeting Presented by Qiaozhu Mei.
04/30/13 Last class: summary, goggles, ices Discrete Structures (CS 173) Derek Hoiem, University of Illinois 1 Image: wordpress.com/2011/11/22/lig.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Autumn Web Information retrieval (Web IR) Handout #0: Introduction Ali Mohammad Zareh Bidoki ECE Department, Yazd University
Relevance Feedback Hongning Wang What we have learned so far Information Retrieval User results Query Rep Doc Rep (Index) Ranker.
Web Image Retrieval Re-Ranking with Relevance Model Wei-Hao Lin, Rong Jin, Alexander Hauptmann Language Technologies Institute School of Computer Science.
University of Malta CSA3080: Lecture 6 © Chris Staff 1 of 20 CSA3080: Adaptive Hypertext Systems I Dr. Christopher Staff Department.
Lecture 1: Overview of IR Maya Ramanath. Who hasn’t used Google? Why did Google return these results first ? Can we improve on it? Is this a good result.
LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.
A Model for Learning the Semantics of Pictures V. Lavrenko, R. Manmatha, J. Jeon Center for Intelligent Information Retrieval Computer Science Department,
Chapter 23: Probabilistic Language Models April 13, 2004.
Effective Automatic Image Annotation Via A Coherent Language Model and Active Learning Rong Jin, Joyce Y. Chai Michigan State University Luo Si Carnegie.
Recuperação de Informação Cap. 01: Introdução 21 de Fevereiro de 1999 Berthier Ribeiro-Neto.
Relevance-Based Language Models Victor Lavrenko and W.Bruce Croft Department of Computer Science University of Massachusetts, Amherst, MA SIGIR 2001.
Language Modeling Putting a curve to the bag of words Courtesy of Chris Jordan.
Jen-Tzung Chien, Meng-Sung Wu Minimum Rank Error Language Modeling.
CpSc 881: Information Retrieval. 2 Using language models (LMs) for IR ❶ LM = language model ❷ We view the document as a generative model that generates.
Multimedia Retrieval. Outline Overview Indexing Multimedia Generative Models & MMIR –Probabilistic Retrieval –Language models, GMMs Experiments –Corel.
Relevance Feedback Hongning Wang
Multimedia Retrieval. Outline Overview Indexing Multimedia Generative Models & MMIR –Probabilistic Retrieval –Language models, GMMs Experiments –Corel.
Xiaoying Gao Computer Science Victoria University of Wellington COMP307 NLP 4 Information Retrieval.
A Study of Poisson Query Generation Model for Information Retrieval
Machine Learning Expectation Maximization and Gaussian Mixtures CSE 473 Chapter 20.3.
Language Modeling Again So are we smooth now? Courtesy of Chris Jordan.
Introduction to Information Retrieval Introduction to Information Retrieval Lecture 14: Language Models for IR.
(Note: a lot of input from Thijs Westerveld)
Lecture 13: Language Models for IR
Statistical Models for Automatic Speech Recognition
Language Models for Information Retrieval
Image Segmentation Techniques
Non-parametric Filters
John Lafferty, Chengxiang Zhai School of Computer Science
Language Model Approach to IR
CS590I: Information Retrieval
Non-parametric Filters
INF 141: Information Retrieval
Information Retrieval and Web Design
Presentation transcript:

ICIP 2004, Singapore, October A Comparison of Continuous vs. Discrete Image Models for Probabilistic Image and Video Retrieval Arjen P. de Vries and Thijs Westerveld

ICIP 2004, Singapore, October Theory

ICIP 2004, Singapore, October Generative Models… A statistical model for generating data –Probability distribution over samples in a given ‘language’ M P ( | M )= P ( | M ) P ( | M, ) © Victor Lavrenko, Aug aka ‘Language Modelling’

ICIP 2004, Singapore, October Basic question: –What is the likelihood that this document is relevant to this query? P(rel|I,Q) = P(I,Q|rel)P(rel) / P(I,Q) … in Information Retrieval P(I,Q|rel) = P(Q|I,rel)P(I|rel)

ICIP 2004, Singapore, October Retrieval (Query generation) Models P(Q|M 1 ) P(Q|M 4 ) P(Q|M 3 ) P(Q|M 2 ) Query Docs

ICIP 2004, Singapore, October ‘Language Modeling’ Not just ‘English’ But also, the language of –author –newspaper –text document –image Shakespeare or Dickens? Indeed the short and the long. Marry, ‘tis a noble Lepidus.

ICIP 2004, Singapore, October ‘Language Modeling’ Guardian or Times?Not just ‘English’ But also, the language of –author –newspaper –text document –image

ICIP 2004, Singapore, October ‘Language Modeling’ or ?Not just English! But also, the language of –author –newspaper –text document –image

ICIP 2004, Singapore, October The Fundamental Problem Usually, we don’t know the model M –But have a sample representative of that model First estimate a model from a sample Then compute the observation probability P ( | M ( ) ) M © Victor Lavrenko, Aug. 2002

ICIP 2004, Singapore, October Urn metaphor Unigram Language Models © Victor Lavrenko, Aug P( | ) ~ P ( | ) P ( | ) P ( | ) P ( | ) = 4/9 * 2/9 * 4/9 * 3/9

ICIP 2004, Singapore, October The Zero-frequency Problem Suppose some event not in our example –Model may assign zero probability to that event –And to any set of events involving the unseen event ?

ICIP 2004, Singapore, October Smoothing Idea: shift part of probability mass to unseen events Interpolation with background model –Reflects expected frequency of events –Plays role of IDF (inverse document freq.) – +(1- )

ICIP 2004, Singapore, October The IDF Role of Smoothing P(x| ) +(1- ) P(x|) P(x| ) = +1 (1- ) P(x|) –Ranking independent of

ICIP 2004, Singapore, October Practise

ICIP 2004, Singapore, October Pixel level: no semantics Pixel blocks/regions Image Retrieval

ICIP 2004, Singapore, October Modelling Images Compute local features –Eg., blueness and yellowness

ICIP 2004, Singapore, October 25-27

Discrete Model yellow blue

ICIP 2004, Singapore, October Discrete Model

ICIP 2004, Singapore, October Modelling Images blue yellow Histogram also models empty regions in the feature space Boundaries are hard

ICIP 2004, Singapore, October Continuous Model Build Gaussian Mixture model using expectation maximisation (EM) 2 Components –Centers, covariance –Random intialisation blue yellow

ICIP 2004, Singapore, October Continuous Model

ICIP 2004, Singapore, October Discrete vs. Continuous Discrete Model –Low indexing cost (binning) –Low retrieval cost (inverted file) –But… how to partition the indexing space? Continuous Model –High indexing cost (EM algorithm) –High retrieval cost (access all data) –But… less overfitting  better generalisation

ICIP 2004, Singapore, October Experiments TRECVID2003 search task –Discrete vs. Continuous –Regions vs. full Query examples –All examples vs. designated only Mean average precision

ICIP 2004, Singapore, October Results Continuous Model significantly better on almost all queries However, Discrete Model significantly better for small number of highly focused queries (e.g., flames, airplane taking off) –More analysis needed though

ICIP 2004, Singapore, October Conclusions Language modelling approach to IR also applicable to retrieval of other media Discrete vs. Continuous Model –Continuous Model almost always better –Unfortunately, Discrete Model far easier to implement efficiently

ICIP 2004, Singapore, October Future Work Improve Sampling Process –Better texture representation? –Overlapping, multi-scale image patches? Improve Discrete Model –Partitioning of feature space in grid cells Compare the performance of the two models in interactive setting with relevance feedback –Higher quality per iteration vs. many iterations