Personalizing XML Text Search in Piment Sihem Amer-Yahia AT&T Labs Research - USA Irini Fundulaki Bell Labs - USA Prateek Jain IIT-Kanpur - India Laks.

Slides:



Advertisements
Similar presentations
1 Evaluations in information retrieval. 2 Evaluations in information retrieval: summary The following gives an overview of approaches that are applied.
Advertisements

Processing XML Keyword Search by Constructing Effective Structured Queries Jianxin Li, Chengfei Liu, Rui Zhou and Bo Ning Swinburne University of Technology,
Efficient IR-Style Keyword Search over Relational Databases Vagelis Hristidis University of California, San Diego Luis Gravano Columbia University Yannis.
Computer Science 1000 Information Searching Permission to redistribute these slides is strictly prohibited without permission.
IS530 Lesson 12 Boolean vs. Statistical Retrieval Systems.
Reasoning and Identifying Relevant Matches for XML Keyword Search Yi Chen Ziyang Liu, Yi Chen Arizona State University.
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
Maintenance Modifying the data –Add records –Delete records –Update records Modifying the design –Add fields into tables –Remove fields from a table –Change.
Basic IR: Queries Query is statement of user’s information need. Index is designed to map queries to likely to be relevant documents. Query type, content,
Intelligent Information Retrieval CS 336 –Lecture 2: Query Language Xiaoyan Li Spring 2006 Modified from Lisa Ballesteros’s slides.
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) IR Queries.
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
Shared Ontology for Knowledge Management Atanas Kiryakov, Borislav Popov, Ilian Kitchukov, and Krasimir Angelov Meher Shaikh.
Flexible and Efficient XML Search with Complex Full-Text Predicates Sihem Amer-Yahia - AT&T Labs Research → Yahoo! Research Emiran Curtmola - University.
Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University
Search engines. The number of Internet hosts exceeded in in in in in
Microsoft ® Official Course Interacting with the Search Service Microsoft SharePoint 2013 SharePoint Practice.
Modern Information Retrieval Chapter 4 Query Languages.
COMP 630L Paper Presentation Presenter: Le Jianwei
WMES3103 : INFORMATION RETRIEVAL INDEXING AND SEARCHING.
Module 9 Designing an XML Strategy. Module 9: Designing an XML Strategy Designing XML Storage Designing a Data Conversion Strategy Designing an XML Query.
Chapter 5: Information Retrieval and Web Search
Overview of Search Engines
DAY 21: MICROSOFT ACCESS – CHAPTER 5 MICROSOFT ACCESS – CHAPTER 6 MICROSOFT ACCESS – CHAPTER 7 Akhila Kondai October 30, 2013.
1 IDAR 2007 Emiran Curtmola A Platform for Efficient Full-Text SEARCH on the Web.
H. Lundbeck A/S3-Oct-151 Assessing the effectiveness of your current search and retrieval function Anna G. Eslau, Information Specialist, H. Lundbeck A/S.
AnswerBus Question Answering System Zhiping Zheng School of Information, University of Michigan HLT 2002.
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
1 Evaluating top-k Queries over Web-Accessible Databases Paper By: Amelie Marian, Nicolas Bruno, Luis Gravano Presented By Bhushan Chaudhari University.
Querying Structured Text in an XML Database By Xuemei Luo.
SharePoint 2010 Search Architecture The Connector Framework Enhancing the Search User Interface Creating Custom Ranking Models.
User Experience Takes user input, displays results Search Engine Builds index, returns results Content Processing Retrieves content, prepares for indexing.
Ontologies and Lexical Semantic Networks, Their Editing and Browsing Pavel Smrž and Martin Povolný Faculty of Informatics,
NoteSearch - Find what you’re looking for. Prototype Team B.
The Internet 8th Edition Tutorial 4 Searching the Web.
Chapter 6: Information Retrieval and Web Search
Module 10 Administering and Configuring SharePoint Search.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
ICDL 2004 Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science Old Dominion University.
Q2Semantic: A Lightweight Keyword Interface to Semantic Search Haofen Wang 1, Kang Zhang 1, Qiaoling Liu 1, Thanh Tran 2, and Yong Yu 1 1 Apex Lab, Shanghai.
GUIDED BY DR. A. J. AGRAWAL Search Engine By Chetan R. Rathod.
4 1 SEARCHING THE WEB Using Search Engines and Directories Effectively New Perspectives on THE INTERNET.
Templated Search over Relational Databases Date: 2015/01/15 Author: Anastasios Zouzias, Michail Vlachos, Vagelis Hristidis Source: ACM CIKM’14 Advisor:
Copyright © 2006 Pilothouse Consulting Inc. All rights reserved. Search Overview Search Features: WSS and Office Search Architecture Content Sources and.
Introduction to Information Retrieval Aj. Khuanlux MitsophonsiriCS.426 INFORMATION RETRIEVAL.
CPT 499 Internet Skills for Educators Session Three Class Notes.
Data Integration Hanna Zhong Department of Computer Science University of Illinois, Urbana-Champaign 11/12/2009.
Crawling the Hidden Web Authors: Sriram Raghavan, Hector Garcia-Molina VLDB 2001 Speaker: Karthik Shekar 1.
Information Retrieval
Using OARE Search Engines. Environmental Index (EBSCO) Advanced Search.
What Does the User Really Want ? Relevance, Precision and Recall.
Overviews of the Library of Texas & ZLOT Project Dr. William E. Moen Principal Investigator.
Generating Query Substitutions Alicia Wood. What is the problem to be solved?
Multilingual Information Retrieval using GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of Kaohsiung.
Search and Retrieval: Query Languages Prof. Marti Hearst SIMS 202, Lecture 19.
Welcome to CPSC 534B: Information Integration Laks V.S. Lakshmanan Rm. 315.
1 CS 8803 AIAD (Spring 2008) Project Group#22 Ajay Choudhari, Avik Sinharoy, Min Zhang, Mohit Jain Smart Seek.
MICROSOFT ACCESS – CHAPTER 5 MICROSOFT ACCESS – CHAPTER 6 MICROSOFT ACCESS – CHAPTER 7 Sravanthi Lakkimsety Mar 14,2016.
General Architecture of Retrieval Systems 1Adrienn Skrop.
In this session, you will learn to: Create and manage views Implement a full-text search Implement batches Objectives.
Databases and Information Retrieval: Rethinking the Great Divide SIGMOD Panel 14 Jun 2005 Jayavel Shanmugasundaram Cornell University.
Text Search over XML Documents Jayavel Shanmugasundaram Cornell University.
Architecture Components
OUTLINE Basic ideas of traditional retrieval systems
SIS: A system for Personal Information Retrieval and Re-Use
Dagstuhl Seminar on Ranked XML Querying
Combining Keyword and Semantic Search for Best Effort Information Retrieval  Andrew Zitzelberger 1.
CS246: Information Retrieval
Information Retrieval and Web Design
CoXML: A Cooperative XML Query Answering System
Presentation transcript:

Personalizing XML Text Search in Piment Sihem Amer-Yahia AT&T Labs Research - USA Irini Fundulaki Bell Labs - USA Prateek Jain IIT-Kanpur - India Laks Lakshmanan University of British Columbia - Canada VLDB 2005 demonstration

Motivation XML Full-Text languages combine queries on structure and keywords such as Boolean, proximity distance, stemming, thesaurus, case sensitivity, … Key challenge in XML Full-Text search: match users‘ expectations and determine the most relevant query answers. Query personalization as a way to take user profiles into account to customize queries. PimenT enables query personalization by query rewriting and answer ranking as opposed to user-defined scoring functions.

System Architecture A profile in the User profile Repository is a set of user profile rules. User Profile Rules are of the form where condition and conclusion are XQuery Full-Text expressions and action is one of {add, remove, replace} Through the User Profile Provisioning Interface the users manage profiles (e.g., create new rules, delete or modify existing rules). The Query Customizer retrieves relevant profile rules from the user profile repository and rewrites the initial user query into a customized query. The customized query sent to GalaTex and answers returned ranked. Query Customizer User Profile Repository User Profile Provisioning Interface Customized Query Ranking Module PimenT Engine Ranked Answers User Query

User Profile Rules Add/Replace/Remove conditions on keywords If (//book[abstract ftcontains ’Query Evaluation’]) add //book[abstract ftcontains ’Query Evaluation’ with thesaurus ‘WordNet’ case insensitive] If (//book[abstract ftcontains ’Query Evaluation’ case insensitive]) replace //book[abstract ftcontains ’Query Evaluation’ with stemming uppercase] If (//book[abstract ftcontains ’Query Evaluation’ with stemming uppercase]) remove //book[abstract ftcontains ’Query Evaluation’ uppercase] Add/Replace/Remove structure If (//book[abstract ftcontains ’Query Evaluation’]) add //book[abstract ftcontains ’Query Evaluation’]/title If (//book[abstract ftcontains ’Query Evaluation’]/title) replace //book[abstract ftcontains ’Query Evaluation’]/author Drop FT Expressions If (//book[abstract ftcontains ’Query Evaluation’]/title) remove (//book[abstract ftcontains ’Query Evaluation’])

Query Customization Example User Query: //book[abstract ftcontains ‘Query Evaluation’] User Profile Rules –If (//book[abstract ftcontains ’Query Evaluation’] ) add 1.//book[abstract ftcontains ’Query Evaluation’ with thesaurus ‘WordNet’ case insensitive] 2.//book[abstract ftcontains ’Query Evaluation’ with stemming] 3.//book[abstract ftcontains ’Query Evaluation’ && ‘XML’ distance at most 5 words] Customized Query

Query Customization Example User Query: //book[abstract ftcontains ‘Query Evaluation’] User Profile Rules –If (//book[abstract ftcontains ’Query Evaluation’] ) add 1.//book[abstract ftcontains ’Query Evaluation’ with thesaurus ‘WordNet’ case insensitive] 2.//book[abstract ftcontains ’Query Evaluation’ with stemming] 3.//book[abstract ftcontains ’Query Evaluation’ && ‘XML’ distance at most 5 words] Customized Query 1.//book[abstract ftcontains ’Query Evaluation’ with thesaurus ‘WordNet’ case insensitive]

Query Customization Example User Query: //book[abstract ftcontains ‘Query Evaluation’] User Profile Rules –If (//book[abstract ftcontains ’Query Evaluation’] ) add 1.//book[abstract ftcontains ’Query Evaluation’ with thesaurus ‘WordNet’ case insensitive] 2.//book[abstract ftcontains ’Query Evaluation’ with stemming] 3.//book[abstract ftcontains ’Query Evaluation’ && ‘XML’ distance at most 5 words] Customized Query 1.//book[abstract ftcontains ’Query Evaluation’ with thesaurus ‘WordNet’ case insensitive] 2.//book[abstract ftcontains ’Query Evaluation’ with thesaurus ‘WordNet’ case insensitive with stemming]

Query Customization Example User Query: //book[abstract ftcontains ‘Query Evaluation’] User Profile Rules –If (//book[abstract ftcontains ’Query Evaluation’] ) add 1.//book[abstract ftcontains ’Query Evaluation’ with thesaurus ‘WordNet’ case insensitive] 2.//book[abstract ftcontains ’Query Evaluation’ with stemming] 3.//book[abstract ftcontains ’Query Evaluation’ && ‘XML’ distance at most 5 words] Customized Query 1.//book[abstract ftcontains ’Query Evaluation’ with thesaurus ‘WordNet’ case insensitive] 2.//book[abstract ftcontains ’Query Evaluation’ with thesaurus ‘WordNet’ case insensitive with stemming] 3.//book[abstract ftcontains (’Query Evaluation’ with thesaurus ‘WordNet’ case insensitive with stemming) && (‘XML’) distance at most 5 words]

Query Customization Example Scored User Query: //book[abstract ftcontains ‘Query Evaluation’] User Profile Rules –If (//book[abstract ftcontains ’Query Evaluation’] ) add //book[abstract ftcontains ’Query Evaluation’ && ’XML’ ] Scored Customized Query //book[abstract ftcontains ’Query Evaluation’ && ‘XML’] changes result ranking and not set of results.

Profile Enforcement Strategies 1.Strategy «Apply add, replace, remove» Find all relevant rules for the user query and then apply all add, replace and remove rules in that order 2.Strategy «Apply rules in definition order» Find all relevant rules for the user query and then apply rules in the order in which they are defined 3.Strategy «Customize(Find Relevant(Customize(..)))» Choose one relevant rule and customize the query and repeat this process until no customization is possible

Open Issues Formally define what user's expectation means (definition of a user model etc.) Interaction with the user to decide that a specific choice of profile rules and enforcement policies are relevant to the user and derive new user profiles. Which rule application strategy for which application?