Understanding Web Browsing Behaviors through Weibull Analysis of Dwell Time Chao Liu, Ryen White, Susan Dumais Microsoft Research at Redmond.

Slides:

Advertisements

Similar presentations

A Comparison of Implicit and Explicit Links for Web Page Classification Dou Shen 1 Jian-Tao Sun 2 Qiang Yang 1 Zheng Chen 2 1 Department of Computer Science.

Advertisements

Struggling or Exploring? Disambiguating Long Search Sessions

Temporal Query Log Profiling to Improve Web Search Ranking Alexander Kotov (UIUC) Pranam Kolari, Yi Chang (Yahoo!) Lei Duan (Microsoft)

1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.

Modelling Relevance and User Behaviour in Sponsored Search using Click-Data Adarsh Prasad, IIT Delhi Advisors: Dinesh Govindaraj SVN Vishwanathan* Group:

PHP Meetup - SEO 2/12/2009. Where to Focus? Ensuring the findability of content Ensuring content is well understood by search engines Maximizing the importance.

A Machine Learning Approach for Improved BM25 Retrieval

Query Chains: Learning to Rank from Implicit Feedback Paper Authors: Filip Radlinski Thorsten Joachims Presented By: Steven Carr.

Overview of Collaborative Information Retrieval (CIR) at FIRE 2012 Debasis Ganguly, Johannes Leveling, Gareth Jones School of Computing, CNGL, Dublin City.

1 Learning User Interaction Models for Predicting Web Search Result Preferences Eugene Agichtein Eric Brill Susan Dumais Robert Ragno Microsoft Research.

Mining the Search Trails of Surfing Crowds: Identifying Relevant Websites from User Activity Data Misha Bilenko and Ryen White presented by Matt Richardson.

Ao-Jan Su † Y. Charlie Hu ‡ Aleksandar Kuzmanovic † Cheng-Kok Koh ‡ † Northwestern University ‡ Purdue University How to Improve Your Google Ranking: Myths.

Click Evidence Signals and Tasks Vishwa Vinay Microsoft Research, Cambridge.

Time-dependent Similarity Measure of Queries Using Historical Click- through Data Qiankun Zhao*, Steven C. H. Hoi*, Tie-Yan Liu, et al. Presented by: Tie-Yan.

Presented by Li-Tal Mashiach Learning to Rank: A Machine Learning Approach to Static Ranking Algorithms for Large Data Sets Student Symposium.

Context-Aware Query Classification Huanhuan Cao 1, Derek Hao Hu 2, Dou Shen 3, Daxin Jiang 4, Jian-Tao Sun 4, Enhong Chen 1 and Qiang Yang 2 1 University.

Online Learning for Web Query Generation: Finding Documents Matching a Minority Concept on the Web Rayid Ghani Accenture Technology Labs, USA Rosie Jones.

Warranty Forecasting of Electronic Boards using Short- term Field Data Mustafa Altun, PhD Assistant Professor Istanbul Technical University

Survival Analysis for Risk-Ranking of ESP System Performance Teddy Petrou, Rice University August 17, 2005.

Supply Chain Management (SCM) Forecasting 3

Cohort Modeling for Enhanced Personalized Search Jinyun YanWei ChuRyen White Rutgers University Microsoft BingMicrosoft Research.

Finding Advertising Keywords on Web Pages Scott Wen-tau YihJoshua Goodman Microsoft Research Vitor R. Carvalho Carnegie Mellon University.

Λ14 Διαδικτυακά Κοινωνικά Δίκτυα και Μέσα

Modern Retrieval Evaluations Hongning Wang

Personalization in Local Search Personalization of Content Ranking in the Context of Local Search Philip O’Brien, Xiao Luo, Tony Abou-Assaleh, Weizheng.

Title Extraction from Bodies of HTML Documents and its Application to Web Page Retrieval Microsoft Research Asia Yunhua Hu, Guomao Xin, Ruihua Song, Guoping.

Topics and Transitions: Investigation of User Search Behavior Xuehua Shen, Susan Dumais, Eric Horvitz.

A Comparative Study of Search Result Diversification Methods Wei Zheng and Hui Fang University of Delaware, Newark DE 19716, USA

 An important problem in sponsored search advertising is keyword generation, which bridges the gap between the keywords bidded by advertisers and queried.

User Browsing Graph: Structure, Evolution and Application Yiqun Liu, Yijiang Jin, Min Zhang, Shaoping Ma, Liyun Ru State Key Lab of Intelligent Technology.

Predicting Content Change On The Web BY : HITESH SONPURE GUIDED BY : PROF. M. WANJARI.

A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.

Improving Web Search Ranking by Incorporating User Behavior Information Eugene Agichtein Eric Brill Susan Dumais Microsoft Research.

Ramakrishnan Srikant Sugato Basu Ni Wang Daryl Pregibon 1.

Fan Guo 1, Chao Liu 2 and Yi-Min Wang 2 1 Carnegie Mellon University 2 Microsoft Research Feb 11, 2009.

PERSONALIZED SEARCH Ram Nithin Baalay. Personalized Search? Search Engine: A Vital Need Next level of Intelligent Information Retrieval. Retrieval of.

1 Mining User Behavior Mining User Behavior Eugene Agichtein Mathematics & Computer Science Emory University.

UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.

CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.

Hao Wu Nov Outline Introduction Related Work Experiment Methods Results Conclusions & Next Steps.

Mining the Web to Create Minority Language Corpora Rayid Ghani Accenture Technology Labs - Research Rosie Jones Carnegie Mellon University Dunja Mladenic.

Implicit Acquisition of Context for Personalization of Information Retrieval Systems Chang Liu, Nicholas J. Belkin School of Communication and Information.

Presenter: Lung-Hao Lee ( 李龍豪 ) January 7, 309.

Implicit User Feedback Hongning Wang Explicit relevance feedback 2 Updated query Feedback Judgments: d 1 + d 2 - d 3 + … d k -... Query User judgment.

--He Xiangnan PhD student Importance Estimation of User-generated Data.

Qi Guo Emory University Ryen White, Susan Dumais, Jue Wang, Blake Anderson Microsoft Presented by Tetsuya Sakai, Microsoft Research.

Algorithmic Detection of Semantic Similarity WWW 2005.

Jiafeng Guo(ICT) Xueqi Cheng(ICT) Hua-Wei Shen(ICT) Gu Xu (MSRA) Speaker: Rui-Rui Li Supervisor: Prof. Ben Kao.

Adish Singla, Microsoft Bing Ryen W. White, Microsoft Research Jeff Huang, University of Washington.

Social Tag Prediction Paul Heymann, Daniel Ramage, and Hector Garcia- Molina Stanford University SIGIR 2008.

Implicit User Feedback Hongning Wang Explicit relevance feedback 2 Updated query Feedback Judgments: d 1 + d 2 - d 3 + … d k -... Query User judgment.

COLLABORATIVE SEARCH TECHNIQUES Submitted By: Shikha Singla MIT-872-2K11 M.Tech(2 nd Sem) Information Technology.

Context-Aware Query Classification Huanhuan Cao, Derek Hao Hu, Dou Shen, Daxin Jiang, Jian-Tao Sun, Enhong Chen, Qiang Yang Microsoft Research Asia SIGIR.

26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.

Divided Pretreatment to Targets and Intentions for Query Recommendation Reporter: Yangyang Kang /23.

Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:

Date: 2013/9/25 Author: Mikhail Ageev, Dmitry Lagun, Eugene Agichtein Source: SIGIR’13 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Improving Search Result.

A Framework to Predict the Quality of Answers with Non-Textual Features Jiwoon Jeon, W. Bruce Croft(University of Massachusetts-Amherst) Joon Ho Lee (Soongsil.

1 Random Walks on the Click Graph Nick Craswell and Martin Szummer Microsoft Research Cambridge SIGIR 2007.

A Framework for Detection and Measurement of Phishing Attacks Reporter: Li, Fong Ruei National Taiwan University of Science and Technology 2/25/2016 Slide.

Predicting User Interests from Contextual Information R. W. White, P. Bailey, L. Chen Microsoft (SIGIR 2009) Presenter : Jae-won Lee.

To Personalize or Not to Personalize: Modeling Queries with Variation in User Intent Presented by Jaime Teevan, Susan T. Dumais, Daniel J. Liebling Microsoft.

Usefulness of Quality Click- through Data for Training Craig Macdonald, ladh Ounis Department of Computing Science University of Glasgow, Scotland, UK.

Opinion spam and Analysis 소프트웨어공학 연구실 G 최효린 1 / 35.

SEARCH AND CONTEXT Susan Dumais, Microsoft Research INFO 320.

Search User Behavior: Expanding The Web Search Frontier

Topics and Transitions: Investigation of User Search Behavior

Evidence from Behavior

Efficient Multiple-Click Models in Web Search

Interactive Information Retrieval

Presentation transcript:

Understanding Web Browsing Behaviors through Weibull Analysis of Dwell Time Chao Liu, Ryen White, Susan Dumais Microsoft Research at Redmond

Dwell Time as User Implicit Feedbacks  The most significant indicator of document relevance besides clickthroughs [Kelly and Belkin, SIGIR’01, SIGIR’04]  Leveraged in various applications  Learning to rank [Agichtein et al., SIGIR’06]  Query expansion [Buscher et al., SIGIR’09]  BrowseRank, assuming an exponential dist. [Liu et al., SIGIR’08]  …

Questions Addressed in this Study  Questions:  How do we model the dwell time distribution Pr(t|d)?  What does Pr(t|d) tell us about user browsing behaviors?  How is the distribution related to page-level features, and can we predict the distribution based on page-level features?  Takeaways  We propose to model Pr(t|d) using Weibull distributions  The fitted Weibull distribution exhibits a strong negative aging effect, which indicates a “screen-and-glean” browsing behavior  We can predict Pr(t|d) based on page features, which effectively extends the application of dwell time to scenarios where dwell time data is not available

Outline  A Primer on Weibull Analysis  Weibull distribution and analysis  Hazard function and aging effects  Weibull Analysis on Dwell Time  Goodness-of-Fit  Screen-and-glean browsing pattern  Screening by categories  Predicting Dwell Time Distribution  Prediction performance  Feature importance  Conclusions

Weibull Analysis  Weibull analysis is a method for modeling positive data sets, such as time-to-failure data  Predicting product life,  Comparing reliability of competing product designs  Establishing warranty policies or proactively managing spare parts inventories  Success beyond reliability engineering  Survival analysis, weather forecasting, fading channels in wireless communication, the length of labor strikes, AIDS mortality and earthquake probabilities, etc.  Unfortunately, no prior Weibull analysis on Web data although Web abounds with temporal data  Page dwell time, session length, time-to-first-click, etc

Weibull Distribution  2-parameter Weibull distribution  λ : scale parameter  k: shape parameter  Exponential dist. when k = 1

Weibull Analysis  Hazard function at time x  Instantaneous failure rate (or hazard rate) at time x  Amount of risk associated with an x-survivor at time x  Hazard function for Weibull distributions

Aging Effects from Hazard Function  k = 1: No aging  Constant failure rate  Exponential distribution  0<k<1: Negative aging  Decreasing failure rate  An initial screening has to be passed in order to survive longer  Smaller k means harsher screening  k > 1: Positive aging  Increasing failure rate  Little to no screening at the beginning but life becomes tougher as time goes by

Weibull Analysis on Dwell Time and Beyond  Web abounds with temporal data  Time to first click, session length, eye fixation, …  Weibull analysis is way beyond hazard functions  Failure forecasting, corrective actions, … Reliability Analysis Dwell Time AnalysisClick Analysis… Datatime-to-failureTime-to-abandonTime-to-first-click… HazardFailure rateAbandon rateClick rate… E(t|t>t 0 )Mean residual lifeMean residual time on page How soon to click… ……………

Outline  A Primer on Weibull Analysis  Weibull distribution and analysis  Hazard function and aging effects  Weibull Analysis on Dwell Time  Goodness-of-Fit  Screen-and-glean browsing pattern  Screening by categories  Predicting Dwell Time Distribution  Prediction performance  Feature importance  Conclusions

Goodness-of-Fit Comparison  Dwell time collected for 205,873 pages (URLs) in English (US) market, each of which has a minimum of 10k dwell times  Comparison on Goodness-of-Fit (GoF)  Dwell times for each page are split into training (80%) and testing (20%)  Model fitting on training and evaluated on testing  Metrics: Log-likelihood and Kolmogorov–Smirnov distance

Fitting λ and k Strong Negative Aging What’s the initial screening? Screen-and-glean browsing pattern?

P( k |Category): Aging Effect w.r.t. Categories Screening is harsher for less-entertaining topics

Outline  A Primer on Weibull Analysis  Weibull distribution and analysis  Hazard function and aging effects  Weibull Analysis on Dwell Time  Goodness-of-Fit  Screen-and-glean browsing pattern  Screening by categories  Predicting Dwell Time Distribution  Prediction performance  Feature importance  Conclusions

Dwell Time Prediction from Page Features  Why predicting dwell time?  Extend dwell time to pages with less or no dwell time  Enable third parties to leverage dwell time even if they don’t have access to real dwell time data  Gain insights into what elements affect dwell time  Why using only page-level features?  Users decide how long to stay with a page based on the experience and perception, rather than PageRank for example  Advanced features like PageRank and inlink counts may not be available to all parties

Experiment Setup  5000 randomly sampled pages with fitted λ and k as the target values  Pages are crawled using a dynamic crawler, which parses the html, executes all dynamic components (e.g., redirections, flashes, javascripts, etc), and finally renders the page  “login” pages are removed as they are likely due to time-out redirection  4771 pages left  Page-level features  HtmlTag: frequencies of 93 Html tags  Content: frequencies of top-1000 terms  Dynamic: statistics from dynamic crawling  Regressor: Multiple Additive Regression Tree (MART)  Effectiveness and feature interpretability

Baseline returns the mean λ and k Prediction Results  Comparisons with various feature configurations  Prediction outperforms the baseline  HtmlTag and Dynamic are similar effectively when separated, and complementary to each other when combined  Content > HtmlTag+Dynamic  Content+Dynamic the best: Dynamic captures what users experience after clicks whereas Content shows what users would see in the end

Important Features

Outline  A Primer on Weibull Analysis  Weibull distribution and analysis  Hazard function and aging effects  Weibull Analysis on Dwell Time  Goodness-of-Fit  Screen-and-glean browsing pattern  Screening by categories  Predicting Dwell Time Distribution  Prediction performance  Feature importance  Conclusions

Conclusions  The first Weibull analysis on Web dwell time  Draws an analogy between dwell time and lifetime  Opens the door to Weibull analysis for temporal implicit feedbacks  Dwell time exhibits a strong negative aging effect, which hints a prevalent “screen and glean” browsing pattern  Harsher screening for less-entertaining topics  Feasible to predict dwell time based on page-level features  Extending applicability to less-visited pages and parties without dwell time data  Future work  Improving prediction accuracy through better feature engineering  Weibull analysis for IR

Acknowledgments  Yutaka Suzue  Krysta Svore  Qiang Wu  Wen-tau Yih  Xiaoxin Yin  Alice Zheng

Q&A Thank You!