Context-Sensitive Query Auto-Completion AUTHORS:NAAMA KRAUS AND ZIV BAR-YOSSEF DATE OF PUBLICATION:NOVEMBER 2010 SPEAKER:RISHU GUPTA 1.

Slides:

Advertisements

Similar presentations

Introduction to Information Retrieval Introduction to Information Retrieval Lecture 7: Scoring and results assembly.

Advertisements

Answering Approximate Queries over Autonomous Web Databases Xiangfu Meng, Z. M. Ma, and Li Yan College of Information Science and Engineering, Northeastern.

Recommender Systems & Collaborative Filtering

Google News Personalization Scalable Online Collaborative Filtering

Date: 2013/1/17 Author: Yang Liu, Ruihua Song, Yu Chen, Jian-Yun Nie and Ji-Rong Wen Source: SIGIR12 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Adaptive.

Learning to Suggest: A Machine Learning Framework for Ranking Query Suggestions Date: 2013/02/18 Author: Umut Ozertem, Olivier Chapelle, Pinar Donmez,

ACM CIKM 2008, Oct , Napa Valley 1 Mining Term Association Patterns from Search Logs for Effective Query Reformulation Xuanhui Wang and ChengXiang.

Traditional IR models Jian-Yun Nie.

Improvements and extras Paul Thomas CSIRO. Overview of the lectures 1.Introduction to information retrieval (IR) 2.Ranked retrieval 3.Probabilistic retrieval.

A Two-Dimensional Click Model for Query Auto-Completion Yanen Li 1, Anlei Dong 2, Hongning Wang 1, Hongbo Deng 2, Yi Chang 2, ChengXiang Zhai 1 1 University.

Document Clustering Carl Staelin. Lecture 7Information Retrieval and Digital LibrariesPage 2 Motivation It is hard to rapidly understand a big bucket.

Spelling Correction for Search Engine Queries Bruno Martins, Mario J. Silva In Proceedings of EsTAL-04, España for Natural Language Processing Presenter:

Learning Outcomes Participants will be able to analyze assessments

Suleyman Cetintas 1, Monica Rogati 2, Luo Si 1, Yi Fang 1 Identifying Similar People in Professional Social Networks with Discriminative Probabilistic.

Davide Mottin, Senjuti Basu Roy, Alice Marascu, Yannis Velegrakis, Themis Palpanas, Gautam Das A Probabilistic Optimization Framework for the Empty-Answer.

Chapter 5: Introduction to Information Retrieval

Ziv Bar-YossefMaxim Gurevich Google and Technion Technion TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A A AA.

Introduction to Information Retrieval

1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.

Contextual IR Naama Kraus Slides are based on the papers: Searching with Context, Kraft, Chang, Maghoul, Kumar Context-Sensitive Query Auto-Completion,

Catching the Drift: Learning Broad Matches from Clickthrough Data Sonal Gupta, Mikhail Bilenko, Matthew Richardson University of Texas at Austin, Microsoft.

Time-sensitive Personalized Query Auto-Completion

DOMAIN DEPENDENT QUERY REFORMULATION FOR WEB SEARCH Date : 2013/06/17 Author : Van Dang, Giridhar Kumaran, Adam Troy Source : CIKM’12 Advisor : Dr. Jia-Ling.

Toward Whole-Session Relevance: Exploring Intrinsic Diversity in Web Search Date: 2014/5/20 Author: Karthik Raman, Paul N. Bennett, Kevyn Collins-Thompson.

Searchable Web sites Recommendation Date : 2012/2/20 Source : WSDM’11 Speaker : I- Chih Chiu Advisor : Dr. Koh Jia-ling 1.

A New Suffix Tree Similarity Measure for Document Clustering Hung Chim, Xiaotie Deng City University of Hong Kong WWW 2007 Session: Similarity Search April.

George Lee User Context-based Service Control Group

6/2/ An Automatic Personalized Context- Aware Event Notification System for Mobile Users George Lee User Context-based Service Control Group Network.

Evaluating Search Engine

Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan Susan T.Dumains Eric Horvitz MIT,CSAILMicrosoft Researcher Microsoft.

Query Operations: Automatic Local Analysis. Introduction Difficulty of formulating user queries –Insufficient knowledge of the collection –Insufficient.

Time-dependent Similarity Measure of Queries Using Historical Click- through Data Qiankun Zhao*, Steven C. H. Hoi*, Tie-Yan Liu, et al. Presented by: Tie-Yan.

Seesaw Personalized Web Search Jaime Teevan, MIT with Susan T. Dumais and Eric Horvitz, MSR.

Context-Aware Query Classification Huanhuan Cao 1, Derek Hao Hu 2, Dou Shen 3, Daxin Jiang 4, Jian-Tao Sun 4, Enhong Chen 1 and Qiang Yang 2 1 University.

1 Ranked Queries over sources with Boolean Query Interfaces without Ranking Support Vagelis Hristidis, Florida International University Yuheng Hu, Arizona.

Investigation of Web Query Refinement via Topic Analysis and Learning with Personalization Department of Systems Engineering & Engineering Management The.

1 An Empirical Study on Large-Scale Content-Based Image Retrieval Group Meeting Presented by Wyman

Leveraging Conceptual Lexicon ： Query Disambiguation using Proximity Information for Patent Retrieval Date : 2013/10/30 Author : Parvaz Mahdabi, Shima.

APPLYING EPSILON-DIFFERENTIAL PRIVATE QUERY LOG RELEASING SCHEME TO DOCUMENT RETRIEVAL Sicong Zhang, Hui Yang, Lisa Singh Georgetown University August.

Reyyan Yeniterzi Weakly-Supervised Discovery of Named Entities Using Web Search Queries Marius Pasca Google CIKM 2007.

PageRank for Product Image Search Kevin Jing (Googlc IncGVU, College of Computing, Georgia Institute of Technology) Shumeet Baluja (Google Inc.) WWW 2008.

Understanding and Predicting Graded Search Satisfaction Tang Yuk Yu 1.

A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.

UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.

CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.

Glasgow 02/02/04 NN k networks for content-based image retrieval Daniel Heesch.

Understanding and Predicting Personal Navigation Date : 2012/4/16 Source : WSDM 11 Speaker : Chiu, I- Chih Advisor : Dr. Koh Jia-ling 1.

Universit at Dortmund, LS VIII

윤언근 DataMining lab.  The Web has grown exponentially in size but this growth has not been isolated to good-quality pages.  spamming and.

Probabilistic Query Expansion Using Query Logs Hang Cui Tianjin University, China Ji-Rong Wen Microsoft Research Asia, China Jian-Yun Nie University of.

Recognition of spoken and spelled proper names Reporter : CHEN, TZAN HWEI Author :Michael Meyer, Hermann Hild.

Chapter 6: Information Retrieval and Web Search

Efficient Instant-Fuzzy Search with Proximity Ranking Authors: Inci Centidil, Jamshid Esmaelnezhad, Taewoo Kim, and Chen Li IDCE Conference 2014 Presented.

Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.

LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.

A Word Clustering Approach for Language Model-based Sentence Retrieval in Question Answering Systems Saeedeh Momtazi, Dietrich Klakow University of Saarland,Germany.

Post-Ranking query suggestion by diversifying search Chao Wang.

Bloom Cookies: Web Search Personalization without User Tracking Authors: Nitesh Mor, Oriana Riva, Suman Nath, and John Kubiatowicz Presented by Ben Summers.

Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:

Personalizing Web Search Jaime Teevan, MIT with Susan T. Dumais and Eric Horvitz, MSR.

Navigation Aided Retrieval Shashank Pandit & Christopher Olston Carnegie Mellon & Yahoo.

© Prentice Hall1 DATA MINING Web Mining Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University Companion slides.

哈工大信息检索研究室 HITIR ’ s Update Summary at TAC2008 Extractive Content Selection Using Evolutionary Manifold-ranking and Spectral Clustering Reporter: Ph.d.

CS791 - Technologies of Google Spring A Webbased Kernel Function for Measuring the Similarity of Short Text Snippets By Mehran Sahami, Timothy.

PAIR project progress report Yi-Ting Chou Shui-Lung Chuang Xuanhui Wang.

Harnessing the Deep Web : Present and Future -Tushar Mhaskar Jayant Madhavan, Loredana Afanasiev, Lyublena Antova, Alon Halevy January 7,

User Modeling for Personal Assistant

Learning to Personalize Query Auto-Completion

Author: Kazunari Sugiyama, etc. (WWW2004)

INF 141: Information Retrieval

Presentation transcript:

Context-Sensitive Query Auto-Completion AUTHORS:NAAMA KRAUS AND ZIV BAR-YOSSEF DATE OF PUBLICATION:NOVEMBER 2010 SPEAKER:RISHU GUPTA 1

digital camera reviews digital camera buying guide digital camera with wifi digital camera deals digital camera world digital picture frame digital copy Motivating Example I want to buy a good Digital Camera Current Result Desired Result 2

Most Challenging Auto-Completion Scenario Challenge :Query Auto-Completion predicts the correct users query with only 12.8% probability. Goal :To predict the users intended query reliably when user has entered only one character. Advantages: Makes search experience faster Reduces load on servers in Instant Search 3

QAC Algorithms User enters the prefix x of Query q Returns a List of K Completions Hit occurs if c=q Need efficient data structure for faster lookup 4 Completion c of Top K Completion List QAC Algorithm should also work if c is semantically equal to q Ordered By Quality Score Hash Table or Trie

Context-Sensitive Auto- Completion How to Compensate for the lack of information ?? Observation: User searches within some context. User context reflects users intent. Context examples Recent queries Recently visited pages Recent Tweets etc….. Our focus – Recent queries Accessible by search engines 49% of searches are preceded by a different query in the same session For simplicity, in this presentation we focus on the most recent query 5

Recent Query Use Approaches Cluster Similar Queries (Use of Techniques like HMMs) Nearest Completion Algorithm (Assumption:Context relevant to the query) Generalize Most Popular Completion Algorithm None of these previous studies took the user input (prefix) into account in the prediction In 37% of the query pairs the former query has not occurred in the log before Problem with this approach ?? How to tackle this problem ??? 6

Nearest Completion:Measure of Similarity Challenge: Choosing similarity measure that is correlated and universally applicable Completions must be semantically related to the context query. Recommendation Based Query Expansion Represent queries and contexts as high- dimensional term-weighted vectors and resort to cosine similarity. Idea :rich representation of a query is constructed not from its search results, but rather from its recommendation tree. Recommendation Based Query Outputs list of recommendations which are reformulations of previous query. Problem occurs when none of the recommendation compatible with user query How to Overcome this challenge ?? 7

Evaluation EVALUATION METRIC MRR-Mean Reciprocal Rank A standard IR measure to evaluate a retrieval of a specific object at a high rank wMRR-Weighted MRR Weight sample pairs according toprediction difficulty (total # of candidate completions) EVALUATION FRAMEWORK Evaluation Set A random sample of (context, query) pairs from the AOL log Prediction Task Given context query and first character of intended query predict intended query at as high rank as possible 8

Analysis NearestCompletion Fails when the context is irrelevant (difficult to predict whether the context is relevant) MostPopularCompletion Fails when the intended query is not highly popular (long tail) Solution: HybridCompletion HybridCompletion: a combination of Most popular Completion and Nearest Completions Its MRR is 31.5% higher than that of MostPopularCompletion. 9

Most Popular VS Nearest Completion 10 Relevant Context:MRR of NearestCompletion (with depth-3 traversal) is higher in 48% than that of MostPopular-Completion. NearestCompletion becomes destructive, so its MRR is 19% lower than that of MostPopularCompletion.

How Hybrid Completion Works?? Produce Lists Produce top k completions of NearestCompletion Produce top k completions of MostPopularCompletion Standardize Hybrid Score is Convex Combination hybscore(q) = α · Zsimscore(q) + (1 α) · Zpopscore(q) 0 α 1 is a tunable parameter Prior probability that context is relevant 11

MostPopular, Nearest, and Hybrid (2) HybridCompletion is shown to be at least as good as NearestCompletio n when the context is relevant and almost as good as MostPopularCom pletion when the context is irrelevant.

Examples 13

Conclusion 14 Query Auto Completion HybridCompletion Algorithm Nearest Completion Algorithm MostPopularCompletion Algorithm Context Sensitive-Query Auto Completion Based on Popular Queries(AOL Query Log) Convex Combination of NearestCompletion and MostPopular Relevent Context:Based on Users Recent Queries Recommendation Based Algorithm: Rich Query Representatin

Future NearestCompletition: More effective session segmentation technique Predicting the first query in a session still remains an open problem Use of Other Context Resources like Recently Visited Web Pages or Search History Measure of Quality Evaluation should be more relaxed Rich query representation may be further fine tuned. 15