Information Re-Retrieval: Repeat Queries in Yahoo’s Logs Jaime Teevan, Eytan Adar, Rosie Jones, Michael A. S. Potts SIGIR 2007.

Slides:



Advertisements
Similar presentations
Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London.
Advertisements

Struggling or Exploring? Disambiguating Long Search Sessions
SIGIR 2013 Recap September 25, 2013.
1.Accuracy of Agree/Disagree relation classification. 2.Accuracy of user opinion prediction. 1.Task extraction performance on Bing web search log with.
Implicit Queries for Vitor R. Carvalho (Joint work with Joshua Goodman, at Microsoft Research)
FindAll: A Local Search Engine for Mobile Phones Aruna Balasubramanian University of Washington.
Personalization and Search Jaime Teevan Microsoft Research.
Evaluating Search Engine
THE WEB CHANGES EVERYTHING Jaime Teevan, Microsoft Research.
Machine Learning Reading: Chapter 18, Agenda and Announcements Machine Learning assignment will go out on Thursday. Tutorial in class on tool for.
Why we search: Visualizing and predicting user behavior E Adar, D S Weld, B N Bershad, S Gribble Raju Balakrishnan.
Data Mining – Intro.
Introduction to . What is ? is electronic mail that allows users to send messages and files (pictures, documents, etc) to another person.
Query Log Analysis Naama Kraus Slides are based on the papers: Andrei Broder, A taxonomy of web search Ricardo Baeza-Yates, Graphs from Search Engine Queries.
Section 2: Finding and Refinding Jaime Teevan Microsoft Research 1.
Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan.
Information Re-Retrieval Repeat Queries in Yahoo’s Logs Jaime Teevan (MSR), Eytan Adar (UW), Rosie Jones and Mike Potts (Yahoo) Presented by Hugo Zaragoza.
SLOW SEARCH Jaime Teevan, Kevyn Collins-Thompson, Ryen White, Susan Dumais and Yubin Kim.
TwitterSearch : A Comparison of Microblog Search and Web Search
Jaime Teevan Microsoft Research Finding and Re-Finding Personal Information.
From Devices to People: Attribution of Search Activity in Multi-User Settings Ryen White, Ahmed Hassan, Adish Singla, Eric Horvitz Microsoft Research,
Analyzing and Evaluating Query Reformulation Strategies in Web Search Logs ReporterHsan-Yu Lin.
Facets of Personalization Jaime Teevan Microsoft Research (CLUES) with S. Dumais, E. Horvitz, D. Liebling, E. Adar, J. Elsas, R. Hughes.
Searching the Web Dr. Frank McCown Intro to Web Science Harding University This work is licensed under Creative Commons Attribution-NonCommercial 3.0Attribution-NonCommercial.
Improving Web Search Ranking by Incorporating User Behavior Information Eugene Agichtein Eric Brill Susan Dumais Microsoft Research.
Query Logs – Used everywhere and for everything Sai Vallurupalli.
1 Can People Collaborate to Improve the relevance of Search Results? Florian Eiteljörge June 11, 2013Florian Eiteljörge.
Improving Web Spam Classification using Rank-time Features September 25, 2008 TaeSeob,Yun KAIST DATABASE & MULTIMEDIA LAB.
Where’s the course website? Log in using your UTORid.
Search Yahoo! With Boolean Operators AND, OR, (), “”, NOT, Domain:
Hao Wu Nov Outline Introduction Related Work Experiment Methods Results Conclusions & Next Steps.
Understanding and Predicting Personal Navigation Date : 2012/4/16 Source : WSDM 11 Speaker : Chiu, I- Chih Advisor : Dr. Koh Jia-ling 1.
Enron Corpus: A New Dataset for Classification By Bryan Klimt and Yiming Yang CEAS 2004 Presented by Will Lee.
Query trends CS 349 Presentation December 2 nd, 2008 Catherine Grevet.
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Unsupervised Learning of Visual Sense Models for Polysemous Words Kate Saenko Trevor Darrell Deepak.
WHAT AND HOW CHILDREN SEARCH ON THE WEB Sergio Duarte Torres, Ingmar Weber.
A Statistical Comparison of Tag and Query Logs Mark J. Carman, Robert Gwadera, Fabio Crestani, and Mark Baillie SIGIR 2009 June 4, 2010 Hyunwoo Kim.
Understanding User Goals in Web Search University of Seoul Computer Science Database Lab. Min Mi-young.
 Who Uses Web Search for What? And How?. Contribution  Combine behavioral observation and demographic features of users  Provide important insight.
Understanding and Predicting Personal Navigation.
Post-Ranking query suggestion by diversifying search Chao Wang.
Learning User Behaviors for Advertisements Click Prediction Chieh-Jen Wang & Hsin-Hsi Chen National Taiwan University Taipei, Taiwan.
THE WEB CHANGES EVERYTHING Jaime Teevan, Microsoft
THE WEB CHANGES EVERYTHING Jaime Teevan, Microsoft
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
KMS & Collaborative Filtering Why CF in KMS? CF is the first type of application to leverage tacit knowledge People-centric view of data Preferences matter.
Personalizing Web Search Jaime Teevan, MIT with Susan T. Dumais and Eric Horvitz, MSR.
ASSOCIATIVE BROWSING Evaluating 1 Jin Y. Kim / W. Bruce Croft / David Smith by Simulation.
Helping People Find Information Better Jaime Teevan, MIT with Christine Alvarado, Mark Ackerman and David Karger.
Unsearch Leveraging Google Instant results. Google Instant Google Instant displays the result of your most probable search You receive the result before.
Jaime Teevan MIT, CSAIL The Re:Search Engine. “Pick a card, any card.”
Potential for Personalization Transactions on Computer-Human Interaction, 17(1), March 2010 Data Mining for Understanding User Needs Jaime Teevan, Susan.
SEARCH AND CONTEXT Susan Dumais, Microsoft Research INFO 320.
The Web Changes Everything How Dynamic Content Affects the Way People Find Online Jaime Teevan Microsoft Research (CLUES) with S. Dumais, D. Liebling,
Jaime Teevan MIT, CSAIL. HCI at MIT ♠HCI Seminar ♥Fridays at 1:30 pm ♥URL: ♥Announcement mailing list ♠Applying HCI to.
Simultaneous Support for Finding and Re-Finding
Fix yahoo error code 1032 Call Toll-free Number
Amazing The Re:Search Engine Jaime Teevan MIT, CSAIL.
SIS: A system for Personal Information Retrieval and Re-Use
Detecting Online Commercial Intention (OCI)
Mining Query Subtopics from Search Log Data
Consistency During Search Without Stagnation
Struggling and Success in Web Search
Transparent Contribution of Memory
/London.html Show the URL for the first location.
CS246: Leveraging User Feedback
Web Programming Assignment 4 - Extra Credit
Transparent Contribution of Memory
Presentation transcript:

Information Re-Retrieval: Repeat Queries in Yahoo’s Logs Jaime Teevan, Eytan Adar, Rosie Jones, Michael A. S. Potts SIGIR 2007

Motivation Re-finding information is a common activity of Web search What is the intention of re-finding information? What factors favor/indicate user’s re- finding of information?

Dataset 114 Yahoo users search trace over 1 year (Aug 2004 – July 2005) –115 queries / trace –Considered as repeat when separated > 30 minutes 119 volunteers in a controlled experiment –users are asked to repeat one query made 30 mins to 1 hour ago

Techniques used Normalizing query terms –Capitalization, stop words removal, duplicate words removal, extra white space, stemming –Word order (e.g. “new york department of state” and “department of state new york”) –Non-alphanumerics (e.g. “sub-urban” vs “sub urban”) –Word merge (e.g. “wal mart” vs “walmart”) –Domain (e.g. hotmail vs hotmail.com) –Words swap (e.g. “american embassy london” vs “american consulate london”) SVM classifier –Applied to predict whether a result will be clicked again

Discovery Navigation query is one major type of re-finding information –Bank, news, mail –.com,.edu,.net Rank changes affects re- finding

Discovery Memory fades –Control experiment 30% are mis-remembered (36/119) 27 out of 36 are equivalent after normalization –Yahoo Logs  Indicators of repeat click –# clicks in first query –# clicks in previous query –# unique clicks in previous query