Ryen W. White Microsoft Research Steven M. Drucker Microsoft Live Labs

Slides:



Advertisements
Similar presentations
1. XP 2 * The Web is a collection of files that reside on computers, called Web servers. * Web servers are connected to each other through the Internet.
Advertisements

Nonlinear & Neural Networks LAB. PART State Reduction ( )
©2011 1www.id-book.com Evaluation studies: From controlled to natural settings Chapter 14.
Fig. S1 Cytoplasm V c Periplasm V pp Microenvironment V e D pp AiAi A pp AeAe k IMOM * * * D.
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Beliefs & Biases in Web Search
Predicting User Interests from Contextual Information
TAX EVASION IN ARMENIA: AN EMPIRICAL STUDY Robert W. McGee Barry University Tatyana B. Maranjyan Moscow State University of Economics, Statistics and Informatics,
Cathy N. Hartman University of North Texas Libraries October 10, 1998 Cathy N. Hartman University of North Texas Libraries October 10, 1998.
Lexical Analysis Dragon Book: chapter 3.
CTER Orientation Tutorial The Use of the WebBoard.
Site Planning LA 222 King Abdulaziz University
Introduction Lesson 1 Microsoft Office 2010 and the Internet
Publish-Subscribe Approach to Social Annotation of News Top-k Publish-Subscribe for Social Annotation of News Joint work with: Maxim Gurevich (RelateIQ)
An Investigation of Digital Reference Interviews: A Dialogue Act Approach Bei Yu, Assistant Professor Keisuke Inoue, PhD Candidate.
Crystal Free USB with mTouch Demonstration
Results from a Mobile Finance Survey. 2 2 Second survey sponsored by CheckFree with fieldwork in April 2008; First survey completed in March ,007.
M obile U ser I nterface Survey March 11, 2011 Hyojin Song.
Struggling or Exploring? Disambiguating Long Search Sessions
1 ARIN – KR Practical 2 OWL & Protégé Suresh Manandhar; Dimitar Kazakov.
Temporal Query Log Profiling to Improve Web Search Ranking Alexander Kotov (UIUC) Pranam Kolari, Yi Chang (Yahoo!) Lei Duan (Microsoft)
Cookies, Sessions. Server Side Includes You can insert the content of one file into another file before the server executes it, with the require() function.
XP New Perspectives on Browser and Basics Tutorial 1 1 Browser and Basics Tutorial 1.
® Microsoft Office 2010 Browser and Basics.
descriptions, and instructions?
Compiler Construction LR(1) Rina Zviel-Girshin and Ohad Shacham School of Computer Science Tel-Aviv University.
1 A Systematic Review of Cross- vs. Within-Company Cost Estimation Studies Barbara Kitchenham Emilia Mendes Guilherme Travassos.
Student Interface for Online Testing Training Module Copyright © 2014 American Institutes for Research. All rights reserved.
Studies of the Onset & Persistence of Medical Concerns in Search Logs Ryen White and Eric Horvitz Microsoft Research, Redmond
Ryen W. White, Microsoft Research Jeff Huang, University of Washington.
Ryen White, Susan Dumais, Jaime Teevan Microsoft Research {ryenw, sdumais,
University of Kansas Department of Electrical Engineering and Computer Science Dr. Susan Gauch April 2005 I T T C Dr. Susan Gauch Personalized Search Based.
Browser and Basics Tutorial 1. Learn about Web browser software and Web pages The Web is a collection of files that reside on computers, called.
Consumers on the Web: Identification of usage patterns Consumers on the Web: Identification of usage patterns by Nina Koiso-Kanttila
© 2004 Keynote Systems Customer Experience Management (CEM) Bonny Brown, Ph.D. Director, Research & Public Services.
1.Learning the Terms Learning the TermsLearning the Terms 2.Accessing the Internet from a PC Accessing the Internet from a PCAccessing the Internet from.
Information Re-Retrieval Repeat Queries in Yahoo’s Logs Jaime Teevan (MSR), Eytan Adar (UW), Rosie Jones and Mike Potts (Yahoo) Presented by Hugo Zaragoza.
TwitterSearch : A Comparison of Microblog Search and Web Search
Ryen W. White, Mikhail Bilenko, Silviu Cucerzan Microsoft Research, Redmond, USA {ryenw, mbilenko,
The internet is a worldwide public network of computers on which people can join and use multiple services such as sharing of information. OR The Internet,
From Devices to People: Attribution of Search Activity in Multi-User Settings Ryen White, Ahmed Hassan, Adish Singla, Eric Horvitz Microsoft Research,
Evaluation of digital collections' user interfaces Radovan Vrana Faculty of Humanities and Social Sciences Zagreb, Croatia
Personalization of the Digital Library Experience: Progress and Prospects Nicholas J. Belkin Rutgers University, USA
Topics and Transitions: Investigation of User Search Behavior Xuehua Shen, Susan Dumais, Eric Horvitz.
XP New Perspectives on Browser and Basics Tutorial 1 1 Browser and Basics Tutorial 1.
Springerlink.com Introduction to SpringerLink springerlink.com.
Understanding and Predicting Graded Search Satisfaction Tang Yuk Yu 1.
Improving Web Search Ranking by Incorporating User Behavior Information Eugene Agichtein Eric Brill Susan Dumais Microsoft Research.
Hao Wu Nov Outline Introduction Related Work Experiment Methods Results Conclusions & Next Steps.
Understanding and Predicting Personal Navigation Date : 2012/4/16 Source : WSDM 11 Speaker : Chiu, I- Chih Advisor : Dr. Koh Jia-ling 1.
Log files presented to : Sir Adnan presented by: SHAH RUKH.
Ryen W. White, Dan Morris Microsoft Research, Redmond, USA {ryenw,
Analysis of Topic Dynamics in Web Search Xuehua Shen (University of Illinois) Susan Dumais (Microsoft Research) Eric Horvitz (Microsoft Research) WWW 2005.
Meet the web: First impressions How big is the web and how do you measure it? How many people use the web? How many use search engines? What is the shape.
Adish Singla, Microsoft Bing Ryen W. White, Microsoft Research Jeff Huang, University of Washington.
CPT 499 Internet Skills for Educators Session Three Class Notes.
XP Browser and Basics COM111 Introduction to Computer Applications.
Living Online Lesson 3 Using the Internet IC3 Basics Internet and Computing Core Certification Ambrose, Bergerud, Buscge, Morrison, Wells-Pusins.
Xinyu Xing, Wei Meng, Dan Doozan, Georgia Institute of Technology Alex C. Snoeren, UC San Diego Nick Feamster, and Wenke Lee, Georgia Institute of Technology.
Post-Ranking query suggestion by diversifying search Chao Wang.
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Predicting User Interests from Contextual Information R. W. White, P. Bailey, L. Chen Microsoft (SIGIR 2009) Presenter : Jae-won Lee.
Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.
Microsoft Office 2008 for Mac – Illustrated Unit D: Getting Started with Safari.
Assessing the Scenic Route: Measuring the Value of Search Trails in Web Logs Ryen W. White1 Jeff Huang2 1Microsoft Research 1University of Washington.
Search Engine Optimisation
Exploring Microsoft Office 2013 PowerPoint Comprehensive
Instructor Name Instructor Title Library Name
Planning and Storyboarding a Web Site
Presentation transcript:

Ryen W. White Microsoft Research Steven M. Drucker Microsoft Live Labs

Jack searches for digital cameras. He knows what he wants (hes did this before) and goes straight to a particular page. Jill searches for digital cameras. She is unsure of what shes looking for, and wants to explore the options. Both type digital cameras into a search engine…

Same interface support for Jack and Jill regardless of prior experience or task No support for decisions beyond this page Jack sees: Jill sees:

Search engines adopt a one-size-fits-all approach to interface design Users benefit from familiarity Cost to user-interface designers minimized Limited support for next steps Important to understand what users are doing beyond the result page, and in what ways one-size-fits-all can be enhanced

Approx consenting users Instrumented client-side logging of URLs visited, timestamps, referral information, etc. 20 weeks (Dec 05 – April 06) Analysis focused on: Interaction patterns (e.g., SBBBSBSbBbBBB) Features of interaction (e.g., time spent) Domains visited

Our analysis based on browse trails Ordered series of page views from opening Internet Explorer until closing browser Example trail as Web Behavior Graph: S2 S3 S4 X S7 S8 S9 S7 S10 S1 S5 S1 hotmail.com S2 S6 Non-result page Search engine result page

Search trails situated within browse trails: Initiated with a query to top-5 search engine Can contain multiple queries Terminate with: Session timeout Visit homepage Type URL Check Web-based or logon to online service S2 S3 S4 X S7 S8 S9 S7 S10 S1 S5 S1 hotmail.com S2 S6

< 30% of interaction with search engines > 70% of interaction is forward motion Takeaway: Post SE interaction important Search engine Interactions beyond the search engine

We studied all search interactions (w/ search engine and post-engine) to better understand: User Interaction Variability Extent of differences within and between users Query Interaction Variability Extent of differences within and between queries

Differences in: Interaction patterns Features of the interaction Domains visited Within each user How consistent is user X? Between all users How consistent are all users together?

1. Represent all users trails as strings 2. For each user compute Edit Distance from each trail to every other trail S2 S3 S4 X S7 S8 S9 S7 S10 S1 S5 S1 SBBBSbS S =search B =browse b = back S6 S2

3. Average Edit Distance from each trail to other trails, e.g., 4. Trail with smallest avg. distance most representative of user interaction patterns SSBB SBBS BBBB Trail 1 Trail 2 Trail 3 S S b S bBb S ED(1,2) = 4 ED(2,1) = 4 ED(3,1) = 4 ED(1,3) = 4 ED(2,3) = 5 ED(3,2) = 5 Average 4 4.5

5. Avg. Edit Distance of representative trail: Low = user interaction patterns consistent High = user interaction patterns variable Interaction variance Navigators Explorers # users Average = 20.1 Median = Boundaries fuzzy

Consistent patterns (most trails same), e.g., Most users interact like this sometimes – Navigators interact like this most of the time Few deviations or regressions Tackled problems sequentially More likely to revisit domains Few deviations/regressions Searched sequentially Likely to revisit domains Cleary defined subtasks e.g., 1. Comparison 2. Review S1 S2 S3 S4 S2 S5 S6 S7 S9S8 Sub-trail 2: Review Sub-trail 1: Compare dpreview.com amazon.com digital cameras amazon Sub-trail 1: Compare

Variable patterns (most trails different), e.g., Almost all of their trails different Explorers: Trails branched frequently Submitted many queries Visited many new domains canon.com amazon.com S1 S3 S4 S3 S5 dpreview.com howstuffworks.com S2 S6 S5 S8 S6 S9 S1 S10S11 S10 S12 S13 S14 amazon pmai.org digitalcamera-hq.com digital cameras digital camera canon canon lenses S7 S6 canon lenses

Studied features of the trails Time spent, Num. queries, Num. steps, Branchiness, Num. revisits, Avg. branch len. Factor analysis revealed three factors that captured 80.6% of variance between users Forward and backward motion (52.5%) Branchiness (i.e., how many sub-trails?) (17.4%) Time (10.7%) Factors can be used to differentiate users

Proportion of domains visited that were unique, computed as: Num unique domains / Num of domains 17% had variance of.1 or less Most of the domains visited were revisits 2% had variance of.9 or more Most of the domains visited were unique Roughly same users at extremities as with interaction variance ( 86% overlap)

Navigators and Explorers extreme cases All users exhibit extreme behavior at times Learn from Navigators and Explorers Decide what interface support they need Offer this support as optional functionality to all users in a search toolkit Default search interface does not change More on this later…

Focus on queries rather than users If interaction variable we may need: Tailored search interfaces for different queries Query segmentation and tailored ranking 385 queries with sufficient interaction data Submitted at least 15 times by at least 15 unique participants Distribution of informational / navigational matched that of much larger query logs

Same analysis as earlier, but with queries Low variance (based on ED): Queries generally navigational (e.g., msn) High variance: Undirected, exploratory searches Searches where peoples tastes differ (e.g., travel, art) Nav. and Explor. query behavior similar to Nav. and Explor. user behavior

Teleportation They follow short directed search trails Jump users direct to targets, offer shortcuts Personal Search Histories They conduct the same search repeatedly Present previous searches on search engine Interaction Hubs They rely on important pages within domains Surface these domains as branching points

Guided Tours and Domain Indices They visit multiple domains Offer list of must see domains for query topic Predictive Retrieval They want serendipity Automatically retrieve novel information Support for Rapid Revisitation They use back and visit previous pages a lot Mechanisms to return them to branching points

Conducted a longitudinal study of Web search behavior involving 2500 users Found differences in interaction flow within and between users and within and between queries Identified two types of user with extremely consistent / variable interaction patterns Learned how to support these users that can be used to help everyone