Advisor: Hsin-Hsi Chen Reporter: Chi-Hsin Yu Date: 2010.08.05.

Slides:



Advertisements
Similar presentations
Date: 2013/1/17 Author: Yang Liu, Ruihua Song, Yu Chen, Jian-Yun Nie and Ji-Rong Wen Source: SIGIR12 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Adaptive.
Advertisements

Learning to Suggest: A Machine Learning Framework for Ranking Query Suggestions Date: 2013/02/18 Author: Umut Ozertem, Olivier Chapelle, Pinar Donmez,
1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
Modelling Relevance and User Behaviour in Sponsored Search using Click-Data Adarsh Prasad, IIT Delhi Advisors: Dinesh Govindaraj SVN Vishwanathan* Group:
Query Chains: Learning to Rank from Implicit Feedback Paper Authors: Filip Radlinski Thorsten Joachims Presented By: Steven Carr.
SEARCHING QUESTION AND ANSWER ARCHIVES Dr. Jiwoon Jeon Presented by CHARANYA VENKATESH KUMAR.
Web Mining Research: A Survey Authors: Raymond Kosala & Hendrik Blockeel Presenter: Ryan Patterson April 23rd 2014 CS332 Data Mining pg 01.
Sequence Clustering and Labeling for Unsupervised Query Intent Discovery Speaker: Po-Hsien Shih Advisor: Jia-Ling Koh Source: WSDM’12 Date: 1 November,
WSCD INTRODUCTION  Query suggestion has often been described as the process of making a user query resemble more closely the documents it is expected.
Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.
1 Learning User Interaction Models for Predicting Web Search Result Preferences Eugene Agichtein Eric Brill Susan Dumais Robert Ragno Microsoft Research.
Context-aware Query Suggestion by Mining Click-through and Session Data Authors: H. Cao et.al KDD 08 Presented by Shize Su 1.
Evaluating Search Engine
Click Evidence Signals and Tasks Vishwa Vinay Microsoft Research, Cambridge.
Search Engines and Information Retrieval
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
1 CS 430 / INFO 430 Information Retrieval Lecture 8 Query Refinement: Relevance Feedback Information Filtering.
Time-dependent Similarity Measure of Queries Using Historical Click- through Data Qiankun Zhao*, Steven C. H. Hoi*, Tie-Yan Liu, et al. Presented by: Tie-Yan.
Presented by Li-Tal Mashiach Learning to Rank: A Machine Learning Approach to Static Ranking Algorithms for Large Data Sets Student Symposium.
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
WebMiningResearch ASurvey Web Mining Research: A Survey By Raymond Kosala & Hendrik Blockeel, Katholieke Universitat Leuven, July 2000 Presented 4/18/2002.
Recall: Query Reformulation Approaches 1. Relevance feedback based vector model (Rocchio …) probabilistic model (Robertson & Sparck Jones, Croft…) 2. Cluster.
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
J. Chen, O. R. Zaiane and R. Goebel An Unsupervised Approach to Cluster Web Search Results based on Word Sense Communities.
Modern Retrieval Evaluations Hongning Wang
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Web Usage Mining with Semantic Analysis Date: 2013/12/18 Author: Laura Hollink, Peter Mika, Roi Blanco Source: WWW’13 Advisor: Jia-Ling Koh Speaker: Pei-Hao.
Search Engines and Information Retrieval Chapter 1.
Page 1 WEB MINING by NINI P SURESH PROJECT CO-ORDINATOR Kavitha Murugeshan.
1 Context-Aware Search Personalization with Concept Preference CIKM’11 Advisor : Jia Ling, Koh Speaker : SHENG HONG, CHUNG.
Leveraging Conceptual Lexicon : Query Disambiguation using Proximity Information for Patent Retrieval Date : 2013/10/30 Author : Parvaz Mahdabi, Shima.
Gradual Adaption Model for Estimation of User Information Access Behavior J. Chen, R.Y. Shtykh and Q. Jin Graduate School of Human Sciences, Waseda University,
1 Cross-Lingual Query Suggestion Using Query Logs of Different Languages SIGIR 07.
A Survey for Interspeech Xavier Anguera Information Retrieval-based Dynamic TimeWarping.
Ramakrishnan Srikant Sugato Basu Ni Wang Daryl Pregibon 1.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Tag Data and Personalized Information Retrieval 1.
IJCAI 2003 Workshop on Learning Statistical Models from Relational Data First-Order Probabilistic Models for Information Extraction Advisor: Hsin-His Chen.
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
Giorgos Giannopoulos (IMIS/”Athena” R.C and NTU Athens, Greece) Theodore Dalamagas (IMIS/”Athena” R.C., Greece) Timos Sellis (IMIS/”Athena” R.C and NTU.
Implicit User Feedback Hongning Wang Explicit relevance feedback 2 Updated query Feedback Judgments: d 1 + d 2 - d 3 + … d k -... Query User judgment.
Personalized Search Xiao Liu
Understanding User’s Query Intent with Wikipedia G 여 승 후.
Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.
Mining real world data Web data. World Wide Web Hypertext documents –Text –Links Web –billions of documents –authored by millions of diverse people –edited.
Jiafeng Guo(ICT) Xueqi Cheng(ICT) Hua-Wei Shen(ICT) Gu Xu (MSRA) Speaker: Rui-Rui Li Supervisor: Prof. Ben Kao.
Implicit User Feedback Hongning Wang Explicit relevance feedback 2 Updated query Feedback Judgments: d 1 + d 2 - d 3 + … d k -... Query User judgment.
A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,
1 Introduction to Data Mining C hapter 1. 2 Chapter 1 Outline Chapter 1 Outline – Background –Information is Power –Knowledge is Power –Data Mining.
Post-Ranking query suggestion by diversifying search Chao Wang.
More Than Relevance: High Utility Query Recommendation By Mining Users' Search Behaviors Xiaofei Zhu, Jiafeng Guo, Xueqi Cheng, Yanyan Lan Institute of.
Generating Query Substitutions Alicia Wood. What is the problem to be solved?
NTU & MSRA Ming-Feng Tsai
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Why Decision Engine Bing Demos Search Interaction model Data-driven Research Problems Q & A.
Learning to Rank: From Pairwise Approach to Listwise Approach Authors: Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li Presenter: Davidson Date:
Web Analytics Xuejiao Liu INF 385F: WIRED Fall 2004.
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
1 Learning to Impress in Sponsored Search Xin Supervisors: Prof. King and Prof. Lyu.
1 CS 430 / INFO 430 Information Retrieval Lecture 12 Query Refinement and Relevance Feedback.
To Personalize or Not to Personalize: Modeling Queries with Variation in User Intent Presented by Jaime Teevan, Susan T. Dumais, Daniel J. Liebling Microsoft.
Usefulness of Quality Click- through Data for Training Craig Macdonald, ladh Ounis Department of Computing Science University of Glasgow, Scotland, UK.
1 Context-Aware Ranking in Web Search (SIGIR 10’) Biao Xiang, Daxin Jiang, Jian Pei, Xiaohui Sun, Enhong Chen, Hang Li 2010/10/26.
Data Mining: Concepts and Techniques Course Outline
Struggling and Success in Web Search
Date : 2013/1/10 Author : Lanbo Zhang, Yi Zhang, Yunfei Chen
Date: 2012/11/15 Author: Jin Young Kim, Kevyn Collins-Thompson,
Web Mining Research: A Survey
Learning to Rank with Ties
Presentation transcript:

Advisor: Hsin-Hsi Chen Reporter: Chi-Hsin Yu Date:

Introduction Topic A: Relevance Assignment Using Query Log to Assign Actual Relevance of Documents for User Communities Topic B: Knowledge Transfer (cloud wisdom in QL) Transfer Learning in Query Log Topic C: Query Understanding Semantic Relation of Query Terms in Query Log Discussions

Query Log From: Intent Based Clustering of Search Engine Query Log, 2009

Issues Perceived relevance v.s. Actual relevance Clicks Click bias in positions Relevance Query – document User intent/goal – queries – documents User community – queries – documents Cost Editorial judgments v.s. model predicted judgments

From: SIGIR 2010 Tutorials

Task (original) Assign relevance judgment for a q-d pair Actual Relevance From: Intent Based Clustering of Search Engine Query Log, 2009

Applications of the predicted relevance judgments (p r ) As meta-features As actual relevance Low cost Samples (Matrix) prpr Ranking Algorithms Performance in dataset (editorial judgment)

But A Dynamic Bayesian Network Click Model for Web Search Ranking (WWW2009, Track: Data Mining/Session: Click Models) Experiments Predicting click-through rate Predicted relevance as a ranking feature Learning a ranking function with predicted relevance From: (WWW2009, Track: Data Mining/Session: Click Models)

Task (Revised) Assign relevance judgment for a ((user community, q), d) pair Not q-d pair Pseudo-query: (user community, q) Models GA DBN: same as proposed click models in WWW09 papers Difficulties Pseudo-query generation (include user information) User clustering/classification Evaluation Joint training (as in the WWW09 papers) Application For detail analysis of personalized search because we can use predicted relevance to substitute the editorial judgment

Query log = cloud wisdom Task: Mining/leverage cloud wisdom in QL Use transfer learning Use QL to learn meta-features

Task Leverage useful structure/knowledge in QL to boost performance of existing datasets (human judgment) Algorithm SCL: structure correspondence learning Difficulties Selection of extended feature s in QL Evaluation As common IR evaluation metrics Expected results (planned experiments) Can improve performance when use whole training dataset Can improve performance when using small training dataset

SCL ACL 2005, EMNLP 2006 Domain Adaptation with Structural Correspondence Learning (EMNLP 2006) From: EMNLP 2006

From Google search suggestions Interpretation “machine learning wiki/amazon”  Concept + in site “machine learning stanford”  concept + in organization “machine learning tutorial/tool/ppt/journal”  concept + in topic/resources “machine learning kernel”  concept + topic

Compare to compound noun semantics Diarmuid ´O S´eaghdha, 2008 From: Diarmuid ´O S´eaghdha, 2008

Beyond static semantic relations Dynamic semantic relations recognition What is the patterns in the process of query reformulation? Is this useful to identify user goal in a session? Can we build new click model based on semantic relation? Pseudo-session A 1.Apple 2.Apple ipod 3.Apple ipod discount Pseudo-session A

Current works (incomplete) From: SIGIR 2010 Tutorials

Research plan Definition of semantic relations in QL Use Google query suggestions to study the types of semantic relations Segmentation of query terms Mapping segmented query terms to ontology Classification of semantic relation in QL Mining important statistics from QL Applications Ranking strategies based on SRs Click models based on SRs

Task Study of semantic relations of query terms Algorithm Query Segmentation, classification, statistics mining Difficulties Depends... Evaluation Depends... Expected results New problem in NLP and in IR