Ruirui Li, Ben Kao, Bin Bi, Reynold Cheng, Eric Lo Speaker: Ruirui Li 1 The University of Hong Kong.

Slides:



Advertisements
Similar presentations
Date: 2013/1/17 Author: Yang Liu, Ruihua Song, Yu Chen, Jian-Yun Nie and Ji-Rong Wen Source: SIGIR12 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Adaptive.
Advertisements

A Framework for Result Diversification
DQR : A Probabilistic Approach to Diversified Query recommendation Date: 2013/05/20 Author: Ruirui Li, Ben Kao, Bin Bi, Reynold Cheng, Eric Lo Source:
Search in Source Code Based on Identifying Popular Fragments Eduard Kuric and Mária Bieliková Faculty of Informatics and Information.
Diversity Maximization Under Matroid Constraints Date : 2013/11/06 Source : KDD’13 Authors : Zeinab Abbassi, Vahab S. Mirrokni, Mayur Thakur Advisor :
Diversified Retrieval as Structured Prediction Redundancy, Diversity, and Interdependent Document Relevance (IDR ’09) SIGIR 2009 Workshop Yisong Yue Cornell.
A Machine Learning Approach for Improved BM25 Retrieval
VisualRank: Applying PageRank to Large-Scale Image Search Yushi Jing, Member, IEEE, and Shumeet Baluja, Member, IEEE.
CUBELSI : AN EFFECTIVE AND EFFICIENT METHOD FOR SEARCHING RESOURCES IN SOCIAL TAGGING SYSTEMS Bin Bi, Sau Dan Lee, Ben Kao, Reynold Cheng The University.
WSCD INTRODUCTION  Query suggestion has often been described as the process of making a user query resemble more closely the documents it is expected.
Context-aware Query Suggestion by Mining Click-through and Session Data Authors: H. Cao et.al KDD 08 Presented by Shize Su 1.
CIKM’2008 Presentation Oct. 27, 2008 Napa, California
1 Entity Ranking Using Wikipedia as a Pivot (CIKM 10’) Rianne Kaptein, Pavel Serdyukov, Arjen de Vries, Jaap Kamps 2010/12/14 Yu-wen,Hsu.
Towards the Web of Concepts: Extracting Concepts from Large Datasets Aditya G. Parameswaran Stanford University Joint work with: Hector Garcia-Molina (Stanford)
Personalized Search Result Diversification via Structured Learning
Context-Aware Query Classification Huanhuan Cao 1, Derek Hao Hu 2, Dou Shen 3, Daxin Jiang 4, Jian-Tao Sun 4, Enhong Chen 1 and Qiang Yang 2 1 University.
J. Chen, O. R. Zaiane and R. Goebel An Unsupervised Approach to Cluster Web Search Results based on Word Sense Communities.
Affinity Rank Yi Liu, Benyu Zhang, Zheng Chen MSRA.
«Tag-based Social Interest Discovery» Proceedings of the 17th International World Wide Web Conference (WWW2008) Xin Li, Lei Guo, Yihong Zhao Yahoo! Inc.,
Personalization in Local Search Personalization of Content Ranking in the Context of Local Search Philip O’Brien, Xiao Luo, Tony Abou-Assaleh, Weizheng.
Tag Clouds Revisited Date : 2011/12/12 Source : CIKM’11 Speaker : I- Chih Chiu Advisor : Dr. Koh. Jia-ling 1.
SEEKING STATEMENT-SUPPORTING TOP-K WITNESSES Date: 2012/03/12 Source: Steffen Metzger (CIKM’11) Speaker: Er-gang Liu Advisor: Dr. Jia-ling Koh 1.
1 Context-Aware Search Personalization with Concept Preference CIKM’11 Advisor : Jia Ling, Koh Speaker : SHENG HONG, CHUNG.
A Comparative Study of Search Result Diversification Methods Wei Zheng and Hui Fang University of Delaware, Newark DE 19716, USA
1 On Querying Historical Evolving Graph Sequences Chenghui Ren $, Eric Lo *, Ben Kao $, Xinjie Zhu $, Reynold Cheng $ $ The University of Hong Kong $ {chren,
User Browsing Graph: Structure, Evolution and Application Yiqun Liu, Yijiang Jin, Min Zhang, Shaoping Ma, Liyun Ru State Key Lab of Intelligent Technology.
Understanding and Predicting Graded Search Satisfaction Tang Yuk Yu 1.
1 Formal Models for Expert Finding on DBLP Bibliography Data Presented by: Hongbo Deng Co-worked with: Irwin King and Michael R. Lyu Department of Computer.
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
Personalized Search Cheng Cheng (cc2999) Department of Computer Science Columbia University A Large Scale Evaluation and Analysis of Personalized Search.
Exploring Online Social Activities for Adaptive Search Personalization CIKM’10 Advisor : Jia Ling, Koh Speaker : SHENG HONG, CHUNG.
Querying Structured Text in an XML Database By Xuemei Luo.
Probabilistic Query Expansion Using Query Logs Hang Cui Tianjin University, China Ji-Rong Wen Microsoft Research Asia, China Jian-Yun Nie University of.
Intent Subtopic Mining for Web Search Diversification Aymeric Damien, Min Zhang, Yiqun Liu, Shaoping Ma State Key Laboratory of Intelligent Technology.
Contextual Ranking of Keywords Using Click Data Utku Irmak, Vadim von Brzeski, Reiner Kraft Yahoo! Inc ICDE 09’ Datamining session Summarized.
Probabilistic Models of Novel Document Rankings for Faceted Topic Retrieval Ben Cartrette and Praveen Chandar Dept. of Computer and Information Science.
Query Suggestion Naama Kraus Slides are based on the papers: Baeza-Yates, Hurtado, Mendoza, Improving search engines by query clustering Boldi, Bonchi,
Diversifying Search Result WSDM 2009 Intelligent Database Systems Lab. School of Computer Science & Engineering Seoul National University Center for E-Business.
Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.
Wikipedia as Sense Inventory to Improve Diversity in Web Search Results Celina SantamariaJulio GonzaloJavier Artiles nlp.uned.es UNED,c/Juan del Rosal,
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
Templated Search over Relational Databases Date: 2015/01/15 Author: Anastasios Zouzias, Michail Vlachos, Vagelis Hristidis Source: ACM CIKM’14 Advisor:
Marina Drosou, Evaggelia Pitoura Computer Science Department
Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.
Jiafeng Guo(ICT) Xueqi Cheng(ICT) Hua-Wei Shen(ICT) Gu Xu (MSRA) Speaker: Rui-Rui Li Supervisor: Prof. Ben Kao.
Institute of Computing Technology, Chinese Academy of Sciences 1 A Unified Framework of Recommending Diverse and Relevant Queries Speaker: Xiaofei Zhu.
Probabilistic Latent Query Analysis for Combining Multiple Retrieval Sources Rong Yan Alexander G. Hauptmann School of Computer Science Carnegie Mellon.
Vector Space Models.
Diversifying Search Results Rakesh Agrawal, Sreenivas Gollapudi, Alan Halverson, Samuel Ieong Search Labs, Microsoft Research WSDM, February 10, 2009 TexPoint.
Diversifying Search Results Rakesh AgrawalSreenivas GollapudiSearch LabsMicrosoft Research Alan HalversonSamuel.
COLLABORATIVE SEARCH TECHNIQUES Submitted By: Shikha Singla MIT-872-2K11 M.Tech(2 nd Sem) Information Technology.
Enhancing Web Search by Promoting Multiple Search Engine Use Ryen W. W., Matthew R. Mikhail B. (Microsoft Research) Allison P. H (Rice University) SIGIR.
+ User-induced Links in Collaborative Tagging Systems Ching-man Au Yeung, Nicholas Gibbins, Nigel Shadbolt CIKM’09 Speaker: Nonhlanhla Shongwe 18 January.
Intelligent DataBase System Lab, NCKU, Taiwan Josh Jia-Ching Ying, Eric Hsueh-Chan Lu, Wen-Ning Kuo and Vincent S. Tseng Institute of Computer Science.
Named Entity Recognition in Query Jiafeng Guo 1, Gu Xu 2, Xueqi Cheng 1,Hang Li 2 1 Institute of Computing Technology, CAS, China 2 Microsoft Research.
Date: 2012/08/21 Source: Zhong Zeng, Zhifeng Bao, Tok Wang Ling, Mong Li Lee (KEYS’12) Speaker: Er-Gang Liu Advisor: Dr. Jia-ling Koh 1.
Post-Ranking query suggestion by diversifying search Chao Wang.
More Than Relevance: High Utility Query Recommendation By Mining Users' Search Behaviors Xiaofei Zhu, Jiafeng Guo, Xueqi Cheng, Yanyan Lan Institute of.
Ranking of Database Query Results Nitesh Maan, Arujn Saraswat, Nishant Kapoor.
Query Suggestions in the Absence of Query Logs Sumit Bhatia, Debapriyo Majumdar,Prasenjit Mitra SIGIR’11, July 24–28, 2011, Beijing, China.
DivQ: Diversification for Keyword Search over Structured Databases Elena Demidova, Peter Fankhauser, Xuan Zhou and Wolfgang Nejfl L3S Research Center,
Divided Pretreatment to Targets and Intentions for Query Recommendation Reporter: Yangyang Kang /23.
Chapter. 3: Retrieval Evaluation 1/2/2016Dr. Almetwally Mostafa 1.
Unsupervised Streaming Feature Selection in Social Media
Personalizing Web Search Jaime Teevan, MIT with Susan T. Dumais and Eric Horvitz, MSR.
1 Learning to Impress in Sponsored Search Xin Supervisors: Prof. King and Prof. Lyu.
Finding similar items by leveraging social tag clouds Speaker: Po-Hsien Shih Advisor: Jia-Ling Koh Source: SAC 2012’ Date: October 4, 2012.
A Collaborative Quality Ranking Framework for Cloud Components
Improving Search Relevance for Short Queries in Community Question Answering Date: 2014/09/25 Author : Haocheng Wu, Wei Wu, Ming Zhou, Enhong Chen, Lei.
Mining Query Subtopics from Search Log Data
Presentation transcript:

Ruirui Li, Ben Kao, Bin Bi, Reynold Cheng, Eric Lo Speaker: Ruirui Li 1 The University of Hong Kong

Outline Motivation Problem Statement DQR Model Experiments & Evaluation 2

Motivation Massive information arose on the Internet trillion 8 billion ??? Numbers from Google Annul report in 2004 and its official blog in 2008 year Number of Indexed URLs by Google

Motivation User activities in the searching Process. 4 query Search results user Search intent : CIKM living place : CIKM 2012 Hotel : Maui Hotel clicks Lodging CIKM

Motivation 5 query Search results user Search intent : CIKM living place : CIKM 2012 Hotel : Maui Hotel clicks Mine Lodging CIKM

Motivation The effectiveness of IR depends on input queries. Users suffer: Translating human thoughts (search intent) into a concise set of keywords (query) is never straightforward. 6 Search results Search intent : CIKM living place : CIKM 2012 Hotel : Maui Hotel clicks Lodging CIKM

Motivation Input queries are short. Composed of only one or two terms. Number of terms in a query. 7

Motivation Short queries lead to two issues. Issue 1. Ambiguity: Example: query ``jaguar’’ Issue 2. Not specific enough: Example: query ``Disney’’ 8 Cat CartoonStorePark NFL TeamAutomobile Brand

Motivation Most traditional approaches focus on relevance. 1. The most relevant queries to the input query tend to be similar to each other. 2. This generates redundant and monotonic recommendations. 3. Such recommendations provide limit coverage of the recommendation space. 9

Motivation A recommender should provide queries that are not only relevant but also diversified. With diversified recommendations: 1. We can cover multiple potential search intents of the user. 2. The risk users won’t be satisfied is minimized. 3. Finally, users find desired targets in fewer recommendation cycles. 10

Problem statement 11 Output: a list of recommended queries Y. Input: a query q and an integer m. Query recommender m: Number of recommended queries Recommended queries Y Query q GOAL: At least one query in Y is relevant to the user’s search intent.

Problem statement 12 Output: a list of recommended queries Y. Input: a query q and an integer m. Query recommender m: Number of recommended queries Recommended queries Y Query q GOAL: At least one query in Y is relevant to the user’s search intent.

Problem statement 13 Query recommender m: Number of recommended queries Recommended queries Y Query q Five properties: 1. Relevance. 2. Redundancy-free. 3. Diversity. 4. Ranking. 5. Real time response.

DQR: framework 14 Offline: Redundancy-free issue. Mine query concepts from search log. Online: Diversity issue. Propose a probabilistic diversification model.

DQR: offline Mining query concepts. The same search intent can be expressed by different queries. Example: ``Microsoft Research Asia’’, ``MSRA’’, ``MS Research Beijing’’. A query concept is a set of queries which express the same or similar search intents. 15 Microsoft Research Asia MSRA MS Research Beijing

DQR: online 16

DQR: online Greedy strategy: 17 Concept selection Concept pool Input query: m. … …

DQR: diversification : query concept belongs. : query concepts already selected. : query concept to be selected. Objective function: Favor query concepts which are relevant to the input query. Penalize query concepts which are relevant to the query concepts already selected. 18

DQR: diversification Objective function: Estimation: 19

DQR: diversification Click set s: A set of clicked URLs. 20

DQR: diversification Objective function: Relevance: Diversity: 21

Experiments Datasets: Search log collected from search engine. AOL time period: 01 March, May, SOGOU time period: 01 June, June,

Baseline No golden standard for query commendation 23

Evaluation User study 12 users, 60 test queries 24

For a test query q and recommendations by a certain approach. Three relevance levels: Irrelevant (0 points) Partially relevant (1 point) Relevant (2 points) Evaluation 25 Recommendations

Evaluation Three performance Metrics: Relevance Diversity Ranking 26

Relevance 27 Results on AOL Query levelConcept level

Diversity Metric: Intent-Coverage It measures the unique search intents covered by the top m recommended queries. Since each intent represents a specified user search intent, higher Intent-Coverage indicates higher probability to satisfy different users. 28

For a test query q and recommendations by a certain approach. Three relevance levels: Irrelevant (0 points) Partially relevant (1 point) Relevant (2 points) Evaluation 29 Recommendations

Diversity Metric: Intent-Coverage 30 Results on AOL

Ranking Metric: Normalized Discounted Cumulative Gain (NDCG) 31 Results on AOL

Thanks! Questions Suggestions 32

Diversity ranking Metric: 33 Results on AOL Results on SOGOU

Motivation Diversification is highly needed by the use of mobile devices. One in Seven queries come from mobile devices. With limited space inch 17.0 inch 15.4 inch 13.3 inch Screen size is much smaller Numbers from Global mobile statistics 2012 ( mobiThinking )

DQR: clustering A Hawaii restaurant: Unlimited tables. Each table can hold unlimited customers. Customers arrives in a stream. Problem: whenever a customer arrives, assign him to a table. Properties: Familiar people together. Unfamiliar people apart. 35

DQR: clustering Customer stream 36 Compactness control:

DQR Extract representative query from query concept. Voting strategy: Compute a score for each query A score for each query q is therefore computed as: 37

Relevance 38 Results on SOGOU

CIKM Proc. of 2012 Int. Conf. on Information and Knowledge Management (CIKM’12), Maui, Hawaii, Oct. 2012, to appear 39