Contextual IR Naama Kraus Slides are based on the papers: Searching with Context, Kraft, Chang, Maghoul, Kumar Context-Sensitive Query Auto-Completion,

Slides:

Advertisements

Similar presentations

Context-Sensitive Query Auto-Completion AUTHORS:NAAMA KRAUS AND ZIV BAR-YOSSEF DATE OF PUBLICATION:NOVEMBER 2010 SPEAKER:RISHU GUPTA 1.

Advertisements

Google News Personalization: Scalable Online Collaborative Filtering

Relevance Feedback Limitations –Must yield result within at most 3-4 iterations –Users will likely terminate the process sooner –User may get irritated.

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki

Semantic Access to Data from the Web Raquel Trillo *, Laura Po +, Sergio Ilarri *, Sonia Bergamaschi + and E. Mena * 1st International Workshop on Interoperability.

Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)

Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan Susan T.Dumains Eric Horvitz MIT,CSAILMicrosoft Researcher Microsoft.

Query Operations: Automatic Local Analysis. Introduction Difficulty of formulating user queries –Insufficient knowledge of the collection –Insufficient.

PROBLEM BEING ATTEMPTED Privacy -Enhancing Personalized Web Search Based on:  User's Existing Private Data Browsing History s Recent Documents 

Semantic Search Jiawei Rong Authors Semantic Search, in Proc. Of WWW Author R. Guhua (IBM) Rob McCool (Stanford University) Eric Miller.

Measuring Semantic Similarity between Words Using Web Search Engines Danushka Bollegala, Yutaka Matsuo, Mitsuru Ishizuka Topic  Semantic similarity measures.

6/16/20151 Recent Results in Automatic Web Resource Discovery Soumen Chakrabartiv Presentation by Cui Tao.

1 Ranked Queries over sources with Boolean Query Interfaces without Ranking Support Vagelis Hristidis, Florida International University Yuheng Hu, Arizona.

MANISHA VERMA, VASUDEVA VARMA PATENT SEARCH USING IPC CLASSIFICATION VECTORS.

Automating Keyphrase Extraction with Multi-Objective Genetic Algorithms (MOGA) Jia-Long Wu Alice M. Agogino Berkeley Expert System Laboratory U.C. Berkeley.

Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University

Link Structure and Web Mining Shuying Wang

University of Kansas Department of Electrical Engineering and Computer Science Dr. Susan Gauch April 2005 I T T C Dr. Susan Gauch Personalized Search Based.

JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 30, (2014) BERLIN CHEN, YI-WEN CHEN, KUAN-YU CHEN, HSIN-MIN WANG2 AND KUEN-TYNG YU Department of Computer.

Combining Keyword Search and Forms for Ad Hoc Querying of Databases Eric Chu, Akanksha Baid, Xiaoyong Chai, AnHai Doan, Jeffrey Naughton University of.

Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.

MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.

1 The BT Digital Library A case study in intelligent content management Paul Warren

Leveraging Conceptual Lexicon ： Query Disambiguation using Proximity Information for Patent Retrieval Date : 2013/10/30 Author : Parvaz Mahdabi, Shima.

S nippet Sleuth Question to Query Question to Query Information Fluency Information Fluency Illinois Mathematics and Science Academy, Aurora, IL Soccer.

AnswerBus Question Answering System Zhiping Zheng School of Information, University of Michigan HLT 2002.

A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.

1 Applying Collaborative Filtering Techniques to Movie Search for Better Ranking and Browsing Seung-Taek Park and David M. Pennock (ACM SIGKDD 2007)

UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.

Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.

Querying Structured Text in an XML Database By Xuemei Luo.

A Probabilistic Graphical Model for Joint Answer Ranking in Question Answering Jeongwoo Ko, Luo Si, Eric Nyberg (SIGIR ’ 07) Speaker: Cho, Chin Wei Advisor:

Context-Sensitive Information Retrieval Using Implicit Feedback Xuehua Shen : department of Computer Science University of Illinois at Urbana-Champaign.

Wei Feng , Jiawei Han, Jianyong Wang , Charu Aggarwal , Jianbin Huang

1 Automatic Classification of Bookmarked Web Pages Chris Staff Second Talk February 2007.

LATENT SEMANTIC INDEXING Hande Zırtıloğlu Levent Altunyurt.

Web Image Retrieval Re-Ranking with Relevance Model Wei-Hao Lin, Rong Jin, Alexander Hauptmann Language Technologies Institute School of Computer Science.

Ensemble Learning Spring 2009 Ben-Gurion University of the Negev.

Contextual Ranking of Keywords Using Click Data Utku Irmak, Vadim von Brzeski, Reiner Kraft Yahoo! Inc ICDE 09’ Datamining session Summarized.

Deep Learning Powered In- Session Contextual Ranking using Clickthrough Data Xiujun Li 1, Chenlei Guo 2, Wei Chu 2, Ye-Yi Wang 2, Jude Shavlik 1 1 University.

IR Theory: Relevance Feedback. Relevance Feedback: Example  Initial Results Search Engine2.

Query Suggestion Naama Kraus Slides are based on the papers: Baeza-Yates, Hurtado, Mendoza, Improving search engines by query clustering Boldi, Bonchi,

Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.

Automatic Set Instance Extraction using the Web Richard C. Wang and William W. Cohen Language Technologies Institute Carnegie Mellon University Pittsburgh,

Modern Information Retrieval Chapter 9: Parallel and Distributed IR Section 9.1: Introduction Section : MIMD Architectures Inverted Files November.

Personalizing Web Search using Long Term Browsing History Nicolaas Matthijs, Cambridge Filip Radlinski, Microsoft In Proceedings of WSDM

Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI

Personalization with user’s local data Personalizing Search via Automated Analysis of Interests and Activities 1 Sungjick Lee Department of Electrical.

Thesis Proposal: Prediction of popular social annotations Abon.

Date: 2012/11/29 Author: Chen Wang, Keping Bi, Yunhua Hu, Hang Li, Guihong Cao Source: WSDM’12 Advisor: Jia-ling, Koh Speaker: Shun-Chen, Cheng.

Generating Query Substitutions Alicia Wood. What is the problem to be solved?

Ranking of Database Query Results Nitesh Maan, Arujn Saraswat, Nishant Kapoor.

CONTEXTUAL SEARCH AND NAME DISAMBIGUATION IN USING GRAPHS EINAT MINKOV, WILLIAM W. COHEN, ANDREW Y. NG SIGIR’06 Date: 2008/7/17 Advisor: Dr. Koh,

Personalization Services in CADAL Zhang yin Zhuang Yuting Wu Jiangqin College of Computer Science, Zhejiang University November 19,2006.

Modern Information Retrieval

Personalizing Web Search Jaime Teevan, MIT with Susan T. Dumais and Eric Horvitz, MSR.

Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.

2016/3/11 Exploiting Internal and External Semantics for the Clustering of Short Texts Using World Knowledge Xia Hu, Nan Sun, Chao Zhang, Tat-Seng Chu.

Contextual Text Cube Model and Aggregation Operator for Text OLAP

CS791 - Technologies of Google Spring A Webbased Kernel Function for Measuring the Similarity of Short Text Snippets By Mehran Sahami, Timothy.

1 Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan, MIT Susan T. Dumais, Microsoft Eric Horvitz, Microsoft SIGIR 2005.

1 CS 430 / INFO 430: Information Retrieval Lecture 20 Web Search 2.

1 Dongheng Sun 04/26/2011 Learning with Matrix Factorizations By Nathan Srebro.

Neighborhood - based Tag Prediction

Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance Hello everyone,

What is IR? In the 70’s and 80’s, much of the research focused on document retrieval In 90’s TREC reinforced the view that IR = document retrieval Document.

Searching with context

Relevance and Reinforcement in Interactive Browsing

Information Retrieval and Web Design

Topic: Semantic Text Mining

Presentation transcript:

Contextual IR Naama Kraus Slides are based on the papers: Searching with Context, Kraft, Chang, Maghoul, Kumar Context-Sensitive Query Auto-Completion, Bar-Yossef and Kraus

Ambiguous queries: jaguar General queries: haifa Terminology differences (synonyms) between user and corpus stars - planets The Problem (recap) User queries are an imperfect description of their information needs Examples:

Contextual IR Leverage context to better understand the user’s information need Context types – Short-term context Current time and location, recent queries, recent page visits, current page viewed, recent tweets, recent e- mails … – Long-term context (user profile/model) Long-term search history, user interests, user demographics (gender, education…), s, desktop files… Today’s focus: short-term context

Example jaguar recently viewed page Document retrieval – use context to disambiguate queries

Searching with Context Kraft, Chang, Maghoul, Kumar, WWW’06

Searching with Context Goal: improve document retrieval Capture user’s recent context – Piece of text – Extract terms from a page a user is currently viewing, a file a user is currently editing … Proposes three different methods – Query rewriting (QR) Add terms to the user’s original query – Rank biasing (RB) Re-rank results – Iterative filtering meta-search (IFM) Generate sub-queries and aggregate results

Query Rewriting Send one simple query to a standard search engine Augment top context terms to original query – AND semantics – Parameter: how many terms to add Query q Context term weighted vector (a b c d e) – Terms are ranked by their weight Q_new = (q a b) for parameter 2

Rank-Biasing Send complex query that contains ranking instructions to the search engine Does not change the original result set, only the ranking = Selection terms – original query terms Optional terms – context terms – boost is a function of their weight new query definition must appear terms optional terms with boost factor (influence on ranking)

Iterative Filtering Meta-Search Intuition: “explore” different ways to express an information need Algorithm outline – Generate sub-queries – Send to search engine – Aggregate results

Sub-query Generation Use a query template Example: – Query q ; context = (a, b,c) – Sub-queries q a, q b, q c q a b, q b c q a b c

Ranking and Filtering Issue k sub-queries to standard SE Obtain results Challenge – how to combine, rank and filter results ? Use rank aggregation techniques

Rank Averaging A rank aggregation method (out of many…) Given: k lists of top results Assign score to each position in the list – E.g., 1 to first position, 2 to second position … For each document, average over its scores in the k lists The final list is constructed using the average scores

Context-Sensitive Query Auto- Completion Z. Bar-Yossef and N. Kraus, WWW’11

Query Auto-Completion An integral part of the user’s search experience Use Cases Predict the user’s intended query – Save her key strokes Assist a user to formulate her information need

Motivating Example I am attending WWW 2011 I need some information about Hyderabad hyderabad hyderabad airport hyderabad history hyderabad maps hyderabad india hyderabad hotels hyderabad www Current Desired

MostPopular is not always good enough User queries follow a power law distribution  A heavy tail of unpopular queries  MostPopular is likely to mis-predict when given a small number of keystrokes MostPopular Completion

Nearest Completion www 2011 Idea: leverage recent query context Intuition: the user’s intended query is similar to her context query  need a similarity measure between queries (refer to paper) hyderabad airport hyderabad maps hyderabad india hydroxycuthyperbola hyundai hyatt

Nearest Completion: Framework Nearest Neighbors Search Nearest Neighbors Search context candidate completions Repository top k context- related completions offline 1.Expand completions 2.Index completions online 1. Expand context query 2. Search for similar completions 3. Return top k completions

HybridCompletion Problem If context queries are irrelevant to current query, NearestCompletion fails to predict user’s query. Solution HybridCompletion: a combination of highly popular and highly context-similar completions – Completions that are both popular and context-similar get promoted hybscore(q) = c Zsimscore(q) + (1-c) Zpopscore(q), c [0,1] – Convex combination

MostPopular, Nearest, and Hybrid (1)

MostPopular, Nearest, and Hybrid (2)

Anecdotal Examples contextqueryMostPopularNearestHybrid french flagitalian flaginternet im help irs ikea internet explorer italian flag itunes and french ireland italy irealand internet italian flag itunes and french im help irs neptuneuranusups usps united airlines usbank used cars uranus uranas university university of chic… ultrasound uranus uranas ups united airlines usps improving acer laptop battery bank of america bank of america bankofamerica best buy bed bath and b… battery powered … battery plus cha… bank of america best buy battery powered …