Query Relevance Feedback and Ontologies How to Make Queries Better.

Slides:



Advertisements
Similar presentations
Relevance Feedback User tells system whether returned/disseminated documents are relevant to query/information need or not Feedback: usually positive sometimes.
Advertisements

Chapter 5: Introduction to Information Retrieval
Introduction to Information Retrieval
Basic IR: Modeling Basic IR Task: Slightly more complex:
Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.
Introduction to Information Retrieval (Manning, Raghavan, Schutze) Chapter 6 Scoring term weighting and the vector space model.
Leveraging Your Taxonomy to Increase User Productivity MAIQuery and TM Navtree.
Text Operations: Preprocessing. Introduction Document preprocessing –to improve the precision of documents retrieved –lexical analysis, stopwords elimination,
Precision and Recall.
Web Search - Summer Term 2006 II. Information Retrieval (Basics Cont.)
IR Models: Overview, Boolean, and Vector
Evaluating Search Engine
Search Engines and Information Retrieval
Information Retrieval Ling573 NLP Systems and Applications April 26, 2011.
ISP 433/533 Week 2 IR Models.
Query Operations: Automatic Local Analysis. Introduction Difficulty of formulating user queries –Insufficient knowledge of the collection –Insufficient.
© Tefko Saracevic, Rutgers University1 Search strategy & tactics Governed by effectiveness & feedback.
Basic IR: Queries Query is statement of user’s information need. Index is designed to map queries to likely to be relevant documents. Query type, content,
Search Strategies Online Search Techniques. Universal Search Techniques Precision- getting results that are relevant, “on topic.” Recall- getting all.
WMES3103 : INFORMATION RETRIEVAL
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) IR Queries.
Modeling Modern Information Retrieval
© Tefko Saracevic1 Search strategy & tactics Governed by effectiveness&feedback.
An investigation of query expansion terms Gheorghe Muresan Rutgers University, School of Communication, Information and Library Science 4 Huntington St.,
Information Retrieval
Recuperação de Informação. IR: representation, storage, organization of, and access to information items Emphasis is on the retrieval of information (not.
CS246 Basic Information Retrieval. Today’s Topic  Basic Information Retrieval (IR)  Bag of words assumption  Boolean Model  Inverted index  Vector-space.
Chapter 5: Information Retrieval and Web Search
Modeling (Chap. 2) Modern Information Retrieval Spring 2000.
Query Expansion.
Search Engines and Information Retrieval Chapter 1.
APPLICATIONS OF DATA MINING IN INFORMATION RETRIEVAL.
Analyzing and Evaluating Query Reformulation Strategies in Web Search Logs ReporterHsan-Yu Lin.
COMP423.  Query expansion  Two approaches ◦ Relevance feedback ◦ Thesaurus-based  Most Slides copied from ◦
1 Chapter 19: Information Retrieval Chapter 19: Information Retrieval Relevance Ranking Using Terms Relevance Using Hyperlinks Synonyms., Homonyms,
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Query Expansion By: Sean McGettrick. What is Query Expansion? Query Expansion is the term given when a search engine adding search terms to a user’s weighted.
1 Query Operations Relevance Feedback & Query Expansion.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
Planning a search strategy.  A search strategy may be broadly defined as a conscious approach to decision making to solve a problem or achieve an objective.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
1 Automatic Classification of Bookmarked Web Pages Chris Staff Second Talk February 2007.
Introduction to Digital Libraries hussein suleman uct cs honours 2003.
LIS618 lecture 3 Thomas Krichel Structure of talk Document Preprocessing Basic ingredients of query languages Retrieval performance evaluation.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
Query Expansion By: Sean McGettrick. What is Query Expansion? Query Expansion is the term given when a search engine adding search terms to a user’s weighted.
© 2004 Chris Staff CSAW’04 University of Malta of 15 Expanding Query Terms in Context Chris Staff and Robert Muscat Department of.
1 Automatic indexing Salton: When the assignment of content identifiers is carried out with the aid of modern computing equipment the operation becomes.
1 Latent Concepts and the Number Orthogonal Factors in Latent Semantic Analysis Georges Dupret
Information Retrieval CSE 8337 Spring 2007 Introduction/Overview Some Material for these slides obtained from: Modern Information Retrieval by Ricardo.
Information Retrieval
Information Retrieval Transfer Cycle Dania Bilal IS 530 Fall 2007.
What Does the User Really Want ? Relevance, Precision and Recall.
Xiaoying Gao Computer Science Victoria University of Wellington COMP307 NLP 4 Information Retrieval.
Query expansion COMP423. Menu Query expansion Two approaches Relevance feedback Thesaurus-based Most Slides copied from
Knowledge and Information Retrieval Dr Nicholas Gibbins 32/4037.
University Of Seoul Ubiquitous Sensor Network Lab Query Dependent Pseudo-Relevance Feedback based on Wikipedia 전자전기컴퓨터공학 부 USN 연구실 G
Database System Concepts, 5th Ed. ©Sang Ho Lee Chapter 19: Information Retrieval.
Multimedia Information Retrieval
Modern Information Retrieval
Basic Information Retrieval
موضوع پروژه : بازیابی اطلاعات Information Retrieval
CSE 635 Multimedia Information Retrieval
Introduction to Information Retrieval
CS246: Information Retrieval
Semantic Similarity Methods in WordNet and their Application to Information Retrieval on the Web Yizhe Ge.
Retrieval Utilities Relevance feedback Clustering
Information Retrieval and Web Design
Chapter 19: Information Retrieval
Presentation transcript:

Query Relevance Feedback and Ontologies How to Make Queries Better

Overview Ranked Retrieval Relevance Feedback The Semantic Web and Ontologies

Typical Web Retrieval Process Link Following NeedKeyword Query More Like this

Ranked Retrieval How can we present the “best” item to the user first

What are we trying to do in IR Find the Document which is most similar to the query Ranking Interpretation –show the best most similar document first –then the next best most similar document –and so on

Bag of Words Model of Text Ignore the order of words in the document Just record whether a word appears in a document

Similarity Measures Cosine Formula Measures how like a document is to a query/document See Kowalski Chapter 7

Similarity as Ranking Use the Similarity Measure to rank the documents

Relevance Feedback More Like this done properly

Observation The user is probably in the best position to judge the relevance of a document Likewise the user is probably in the best position to judge which returned (highly ranked) documents are irrelevant

Retrieval Process NeedAnalytic Query More Like this No More Like This

Relevance Feedback in Nutshell Perform an initial retrieval Ask the user to indicate which documents are relevant/irrelevant –Add all terms from relevant documents –Remove all terms from irrelevant documents –requery

Variants Using Ranking and Weighting Pseudo relevance feedback –use terms from all (highly ranked) retrieved documents –Assumes highly ranked documents are a homogenous mass of relevant documents (Croft) very helpful if very few documents retrieved  perpetuates errors/misunderstandings from original query

Exercise What are advantages of positive feedback ? What are advantages of negative feedback ? Which is best ?

Relevance Feedback Conclusion Consistently proven an effective way to improve retrieval Biggest problem is getting users to engage in the interaction, especially if no highly relevant documents are in the initially retrieved set

Ontologies

The Semantic Web Introduced by Tim Berners Lee and others in 2001 – D2-1C70-84A9809EC588EF21http:// D2-1C70-84A9809EC588EF21 Essentially about allowing computers and people to share the same world Central to the communication is the notion of an Ontology

Ontology Definition To standardize semantic terms, many areas use specific ontologies, which are hierarchical taxonomies of terms describing certain knowledge topics (Baeza-Yates & Ribeiro-Neto, 1999, p143). Thesauri: Ontologies for Information Retrieval. Entities, Relations.

O example Car Drop head coupe Automobile Hot Hatch Engine Wheels Seat Parts Sort of Also Known as

Improving Recall and/or Precision If you get too few documents  Use more general terms in the query Use “automobile” instead of “drop head coupe”  Use an alternative term which is more common  Use “car” rather than “automobile”  If you get too many (overall) –Use a more specific term Use “hot hatch” rather than “car”

Issues How are thesauri different from Ontologies –Are we representing the world or words –Is Wordnet an ontology ? Are Ontologies meant to be –General –Universal –For a specific purpose ?

Thesauri Provide a map of a given field of knowledge: concepts, relations. Provide a standard vocabulary for consistent indexing. Assist users with locating terms for proper query formulation. Ensure only one term from a synonym set is used for indexing and searching: otherwise a searcher who uses one synonym and retrieves some useful documents may think the correct term has been used and the search has been exhaustive, without knowing that there are other useful documents under other synonyms. Provide classified hierarchies for broadening or narrowing a search if too many or too few documents are retrieved. Retrieval based on concepts rather than words (Baeza-Yates & Ribeiro-Neto, 1999).

WordNet Relations Examples are: Synonyms e.g. couch / sofa / lounge Antonyms e.g. love / hate Hypernyms (broader) e.g. cat / tabby Hyponyms (narrower) e.g. cat / animal Meronym (part-of) e.g. finger / hand Meronym (made-of) e.g. snowflake / snow

WordNet Demos See vancouver-webpages.com/wordnet See marimba.d.umn.edu/cgi- bin/similarity.cgi

Conclusions Ranked Retrieval –similarity matching Relevance Feedback –positive and negative feedback The Semantic Web and Ontologies