Search User Behavior: Expanding The Web Search Frontier

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

Recommender Systems & Collaborative Filtering
Modelling Relevance and User Behaviour in Sponsored Search using Click-Data Adarsh Prasad, IIT Delhi Advisors: Dinesh Govindaraj SVN Vishwanathan* Group:
Optimizing search engines using clickthrough data
Hashtags as Milestones in Time Identifying the hashtags for meaningful events using Twitter search logs and Wikipedia data Stewart Whiting University of.
1.Accuracy of Agree/Disagree relation classification. 2.Accuracy of user opinion prediction. 1.Task extraction performance on Bing web search log with.
1 Learning User Interaction Models for Predicting Web Search Result Preferences Eugene Agichtein Eric Brill Susan Dumais Robert Ragno Microsoft Research.
Mining the Search Trails of Surfing Crowds: Identifying Relevant Websites from User Activity Data Misha Bilenko and Ryen White presented by Matt Richardson.
Context-aware Query Suggestion by Mining Click-through and Session Data Authors: H. Cao et.al KDD 08 Presented by Shize Su 1.
Search Engines and Information Retrieval
Basic IR: Queries Query is statement of user’s information need. Index is designed to map queries to likely to be relevant documents. Query type, content,
A Web of Concepts Dalvi, et al. Presented by Andrew Zitzelberger.
Time-dependent Similarity Measure of Queries Using Historical Click- through Data Qiankun Zhao*, Steven C. H. Hoi*, Tie-Yan Liu, et al. Presented by: Tie-Yan.
The PageRank Citation Ranking “Bringing Order to the Web”
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
Sigir’99 Inside Internet Search Engines: Search Jan Pedersen and William Chang.
Web Archive Information Retrieval Miguel Costa, Daniel Gomes (speaker) Portuguese Web Archive.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Overview of Web Data Mining and Applications Part I
WebPage Summarization Using Clickthrough Data JianTao Sun & Yuchang Lu, TsingHua University, China Dou Shen & Qiang Yang, HK University of Science & Technology.
Search Engines and Information Retrieval Chapter 1.
Introduction to Information Retrieval Introduction to Information Retrieval BM25, BM25F, and User Behavior Chris Manning, Pandu Nayak and Prabhakar Raghavan.
PageRank for Product Image Search Kevin Jing (Googlc IncGVU, College of Computing, Georgia Institute of Technology) Shumeet Baluja (Google Inc.) WWW 2008.
1 Cross-Lingual Query Suggestion Using Query Logs of Different Languages SIGIR 07.
Improving Web Search Ranking by Incorporating User Behavior Information Eugene Agichtein Eric Brill Susan Dumais Microsoft Research.
Fan Guo 1, Chao Liu 2 and Yi-Min Wang 2 1 Carnegie Mellon University 2 Microsoft Research Feb 11, 2009.
1 Mining User Behavior Mining User Behavior Eugene Agichtein Mathematics & Computer Science Emory University.
Modeling User Interactions in Web Search and Social Media Eugene Agichtein Intelligent Information Access Lab Emory University.
Lecture 2 Jan 13, 2010 Social Search. What is Social Search? Social Information Access –a stream of research that explores methods for organizing users’
Center for E-Business Technology Seoul National University Seoul, Korea BrowseRank: letting the web users vote for page importance Yuting Liu, Bin Gao,
Implicit User Feedback Hongning Wang Explicit relevance feedback 2 Updated query Feedback Judgments: d 1 + d 2 - d 3 + … d k -... Query User judgment.
Search Engine Optimization 101 What is SEM? SEO? How can I use SEO on my blogs and/or my personal web space?
Question Answering over Implicitly Structured Web Content
Lecture 2 Jan 15, 2008 Social Search. What is Social Search? Social Information Access –a stream of research that explores methods for organizing users’
Personalizing Web Search using Long Term Browsing History Nicolaas Matthijs, Cambridge Filip Radlinski, Microsoft In Proceedings of WSDM
Web Search Module 6 INST 734 Doug Oard. Agenda The Web Crawling  Web search.
Jun Li, Peng Zhang, Yanan Cao, Ping Liu, Li Guo Chinese Academy of Sciences State Grid Energy Institute, China Efficient Behavior Targeting Using SVM Ensemble.
Qi Guo Emory University Ryen White, Susan Dumais, Jue Wang, Blake Anderson Microsoft Presented by Tetsuya Sakai, Microsoft Research.
IR, IE and QA over Social Media Social media (blogs, community QA, news aggregators)  Complementary to “traditional” news sources (Rathergate)  Grow.
Finding high-Quality contents in Social media BY : APARNA TODWAL GUIDED BY : PROF. M. WANJARI.
Implicit User Feedback Hongning Wang Explicit relevance feedback 2 Updated query Feedback Judgments: d 1 + d 2 - d 3 + … d k -... Query User judgment.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Augmenting (personal) IR Readings Review Evaluation Papers returned & discussed Papers and Projects checkin time.
KAIST TS & IS Lab. CS710 Know your Neighbors: Web Spam Detection using the Web Topology SIGIR 2007, Carlos Castillo et al., Yahoo! 이 승 민.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Date: 2013/9/25 Author: Mikhail Ageev, Dmitry Lagun, Eugene Agichtein Source: SIGIR’13 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Improving Search Result.
The Development of a search engine & Comparison according to algorithms Sung-soo Kim The final report.
Why Decision Engine Bing Demos Search Interaction model Data-driven Research Problems Q & A.
Learning to Rank: From Pairwise Approach to Listwise Approach Authors: Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li Presenter: Davidson Date:
Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.
UOS Personalized Search Zhang Tao 장도. Zhang Tao Data Mining Contents Overview 1 The Outride Approach 2 The outride Personalized Search System 3 Testing.
Navigation Aided Retrieval Shashank Pandit & Christopher Olston Carnegie Mellon & Yahoo.
IntroGoalCrowdPredictionWrap-up 1/26 Learning Debugging and Hacking the User Remco Chang Assistant Professor Tufts University.
Usefulness of Quality Click- through Data for Training Craig Macdonald, ladh Ounis Department of Computing Science University of Glasgow, Scotland, UK.
1 Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan, MIT Susan T. Dumais, Microsoft Eric Horvitz, Microsoft SIGIR 2005.
SEARCH AND CONTEXT Susan Dumais, Microsoft Research INFO 320.
User Modeling for Personal Assistant
Recommender Systems & Collaborative Filtering
A Contextual Computing approach towards Personalized Search
Augmenting (personal) IR
Content-Aware Click Modeling
Eugene Agichtein Mathematics & Computer Science Emory University
Mining Query Subtopics from Search Log Data
Evidence from Behavior
Web Information retrieval (Web IR)
Introduction to Information Retrieval
Michal Rosen-Zvi University of California, Irvine
CS246: Leveraging User Feedback
Learning to Rank with Ties
Discussion Class 9 Google.
Presentation transcript:

Search User Behavior: Expanding The Web Search Frontier Eugene Agichtein Mathematics & Computer Science Emory University

Web Search Ranking Millions of users interact with SEs daily Rank pages using hundreds of features: Content match e.g., page terms, anchor text, term weights Prior document quality e.g., web topology, spam features Millions of users interact with SEs daily

Mining Search User Behavior: “best bet” results for navigational queries [Agichtein & Zheng, KDD 2006] Result clicks are valuable, and there has been much work attempting to exploit click data for ranking and evaluation. For example navigational query bank of america, the clickthrough of users clearly prefer the first result. But we can do much better than clicks!

Web Search Ranking Revisited: Rich User Behavior Feature Space [Agichtein et al., SIGIR2006a, Agichtein et al., SIGIR 2006b, IEEE DEBull Dec. 2006] Observed and distributional features Aggregated over all interactions for each query and result pair Distributional features: deviations from the “expected” behavior Represent user interactions as vectors in user behavior space Presentation: what a user sees before a click Clickthrough: frequency and timing of clicks Browsing: what users do after a click Mine patterns in search behavior To predict user preferences for search results Incorporate behavior features into ranking Search abuse, query segmentation, …

One result: search ranking From [Agichtein, Brill, & Dumais, SIGIR 2006b] BM25 (keyword-based ranking) + user behavior is better than full model with hundreds of features – keyword, web structure, et al. Method P@1 Gain RN 0.632 RN+All 0.693 0.061(10%) BM25 0.525 BM25+All 0.687 0.162 (31%)

Sounds good, but… Some challenges: Next: User behavior “in the wild” is not reliable Difficult to access behavior features at runtime Aggregation, deviations, over streams required Interactions are sparse – what about the “tail” queries? Personalization? – multiply the problems by 1B! Next: Author and searcher understanding

Primary References http://www.mathcs.emory.edu/~eugene/ Improving Web Search Ranking by Incorporating User Behavior, E. Agichtein, E. Brill, and S. Dumais, in SIGIR 2006 Learning User Interaction Models for Predicting Web Search Result Preferences, E. Agichtein, E. Brill, S. Dumais, and R. Ragno, in SIGIR 2006 Identifying ”best bet” web search results by mining past user behavior, E. Agichtein and Z. Zheng, in KDD 2006 Web Information Extraction and User Modeling: Towards Closing the Gap, E. Agichtein, IEEE Data Engineering Bulletin, Dec. 2006 This and other work on Information Extraction and Text Mining: http://www.mathcs.emory.edu/~eugene/