Discovering and Using Groups to Improve Personalized Search Jaime Teevan, Merrie Morris, Steve Bush Microsoft Research.

Slides:



Advertisements
Similar presentations
CWS: A Comparative Web Search System Jian-Tao Sun, Xuanhui Wang, § Dou Shen Hua-Jun Zeng, Zheng Chen Microsoft Research Asia University of Illinois at.
Advertisements

Surrey Libraries Computer Learning Centres January 2012 Internet Searching Teaching Script Totally New to Computers Internet Searching.
Introduction Lesson 1 Microsoft Office 2010 and the Internet
A Domain Level Personalization Technique A. Campi, M. Mazuran, S. Ronchi.
bellevue User profile: Content, interaction history.
Personalization and Search Jaime Teevan Microsoft Research.
Digital Firm (II) Soetam Rizky. Before we start……… Digital firm ? B2B, B2C, C2C ? E-Commerce advantages ? Prospectus of digital firm ?
PRODUCT FOCUS 4/14/14 – 4/25/14 INTRODUCTION Our Product Focus for the next two weeks is Microsoft Office 365. Office 365 is Microsoft’s most successful.
1 Learning User Interaction Models for Predicting Web Search Result Preferences Eugene Agichtein Eric Brill Susan Dumais Robert Ragno Microsoft Research.
Recommender Systems Aalap Kohojkar Yang Liu Zhan Shi March 31, 2008.
Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan Susan T.Dumains Eric Horvitz MIT,CSAILMicrosoft Researcher Microsoft.
Learning to Rank: New Techniques and Applications Martin Szummer Microsoft Research Cambridge, UK.
Lecture 14: Collaborative Filtering Based on Breese, J., Heckerman, D., and Kadie, C. (1998). Empirical analysis of predictive algorithms for collaborative.
Seesaw Personalized Web Search Jaime Teevan, MIT with Susan T. Dumais and Eric Horvitz, MSR.
Ryen W. White, Microsoft Research Jeff Huang, University of Washington.
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
Business-class messaging from [PROVIDER NAME] Business-Class Messaging built on Microsoft Exchange [PRODUCT NAME] A Business-Class Messaging Solution from.
Cohort Modeling for Enhanced Personalized Search Jinyun YanWei ChuRyen White Rutgers University Microsoft BingMicrosoft Research.
Educational Data Mining and DataShop John Stamper Carnegie Mellon University 1 9/12/2012 PSLC Corporate Partner Meeting 2012.
 Digital marketing: Uses digital media to develop communications and exchanges with customers  Electronic media (E-marketing): Refers to the strategic.
Section 2: Finding and Refinding Jaime Teevan Microsoft Research 1.
Finding and Re-Finding Through Personalization Jaime Teevan MIT, CSAIL David Karger (advisor), Mark Ackerman, Sue Dumais, Rob Miller (committee), Eytan.
Navigating the ASAE & The Center website: Essential resources for CEOs (select the arrow key to advance the presentation)
©2013 Microsoft Corporation. SharePoint Innovations LLC. All rights reserved. M ODULE 13 – S HARE P OINT 2010 O VERVIEW OF S HARE P OINT 2013.
Personalization in Local Search Personalization of Content Ranking in the Context of Local Search Philip O’Brien, Xiao Luo, Tony Abou-Assaleh, Weizheng.
Facets of Personalization Jaime Teevan Microsoft Research (CLUES) with S. Dumais, E. Horvitz, D. Liebling, E. Adar, J. Elsas, R. Hughes.
INTRODUCTION TO THE STATE OF MICHIGAN’S SHAREPOINT ENVIRONMENT.
User Browsing Graph: Structure, Evolution and Application Yiqun Liu, Yijiang Jin, Min Zhang, Shaoping Ma, Liyun Ru State Key Lab of Intelligent Technology.
Improving Web Search Ranking by Incorporating User Behavior Information Eugene Agichtein Eric Brill Susan Dumais Microsoft Research.
Tag Data and Personalized Information Retrieval 1.
Understanding Query Ambiguity Jaime Teevan, Susan Dumais, Dan Liebling Microsoft Research.
Implicit Acquisition of Context for Personalization of Information Retrieval Systems Chang Liu, Nicholas J. Belkin School of Communication and Information.
Enron Corpus: A New Dataset for Classification By Bryan Klimt and Yiming Yang CEAS 2004 Presented by Will Lee.
Wang-Chien Lee i Pervasive Data Access ( i PDA) Group Pennsylvania State University Mining Social Network Big Data Intelligent.
Google News Personalization: Scalable Online Collaborative Filtering
Giorgos Giannopoulos (IMIS/”Athena” R.C and NTU Athens, Greece) Theodore Dalamagas (IMIS/”Athena” R.C., Greece) Timos Sellis (IMIS/”Athena” R.C and NTU.
Personalized Search Xiao Liu
Recommending Twitter Users to Follow Using Content and Collaborative Filtering Approaches John HannonJohn Hannon, Mike Bennett, Barry SmythBarry Smyth.
The 5 W’s of Collaborative Search Jaime Teevan and Merrie Morris Microsoft Research.
Personalizing Search Jaime Teevan, MIT Susan T. Dumais, MSR and Eric Horvitz, MSR.
Personalizing Web Search using Long Term Browsing History Nicolaas Matthijs, Cambridge Filip Radlinski, Microsoft In Proceedings of WSDM
Personalization with user’s local data Personalizing Search via Automated Analysis of Interests and Activities 1 Sungjick Lee Department of Electrical.
COLLABORATIVE SEARCH TECHNIQUES Submitted By: Shikha Singla MIT-872-2K11 M.Tech(2 nd Sem) Information Technology.
Pairwise Preference Regression for Cold-start Recommendation Speaker: Yuanshuai Sun
Web Information Retrieval Prof. Alessandro Agostini 1 Context in Web Search Steve Lawrence Speaker: Antonella Delmestri IEEE Data Engineering Bulletin.
Supporting Knowledge Discovery: Next Generation of Search Engines Qiaozhu Mei 04/21/2005.
Post-Ranking query suggestion by diversifying search Chao Wang.
Cs Future Direction : Collaborative Filtering Motivating Observations:  Relevance Feedback is useful, but expensive a)Humans don’t often have time.
Information Design Trends Unit Five: Delivery Channels Lecture 2: Portals and Personalization Part 2.
Adaptive Faceted Browsing in Job Offers Danielle H. Lee
Predicting User Interests from Contextual Information R. W. White, P. Bailey, L. Chen Microsoft (SIGIR 2009) Presenter : Jae-won Lee.
Personalizing Web Search Jaime Teevan, MIT with Susan T. Dumais and Eric Horvitz, MSR.
To Personalize or Not to Personalize: Modeling Queries with Variation in User Intent Presented by Jaime Teevan, Susan T. Dumais, Daniel J. Liebling Microsoft.
Introduction Web analysis includes the study of users’ behavior on the web Traffic analysis – Usage analysis Behavior at particular website or across.
1 Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan, MIT Susan T. Dumais, Microsoft Eric Horvitz, Microsoft SIGIR 2005.
1 DATA-DRIVEN SOLUTIONS. 2 KEYWORD-LEVEL SEARCH RETARGETING TARGET USERS BASED ON THEIR RECENT SEARCH HISTORY AND SEARCH QUERIES. A user performs a search.
HRM 326 TUTOR The power of possibility/hrm326tutordotcom.
Seesaw Personalized Web Search Jaime Teevan, MIT with Susan T. Dumais and Eric Horvitz, MSR.
Potential for Personalization Transactions on Computer-Human Interaction, 17(1), March 2010 Data Mining for Understanding User Needs Jaime Teevan, Susan.
Collaborative Filtering With Decoupled Models for Preferences and Ratings Rong Jin 1, Luo Si 1, ChengXiang Zhai 2 and Jamie Callan 1 Language Technology.
CIS 170 MART Teaching Effectively/cis170mart.com FOR MORE CLASSES VISIT HRM 326 MART Inspiring Minds/hrm326mart.com FOR MORE CLASSES.
SEARCH AND CONTEXT Susan Dumais, Microsoft Research INFO 320.
Planning Engagement Kickoff
Recommender Systems & Collaborative Filtering
Office 365 FastTrack Planning Engagement Kickoff
Simultaneous Support for Finding and Re-Finding
Presented by Web Services Propeller Studio
Search Engine comparison
Presented by Web Services Propeller Studio
Author: Kazunari Sugiyama, etc. (WWW2004)
Presentation transcript:

Discovering and Using Groups to Improve Personalized Search Jaime Teevan, Merrie Morris, Steve Bush Microsoft Research

Diego Velasquez, Las Lanzas

People Express Things Differently Differences can be a challenge for Web search – Picture of a man handing over a key. – Oil painting of the surrender of Breda.

People Express Things Differently Differences can be a challenge for Web search – Picture of a man handing over a key. – Oil painting of the surrender of Breda. Personalization – Closes the gap using more about the person Groupization – Closes the gap using more about the group

How to Take Advantage of Groups? Who do we share interests with? Do we talk about things similarly? What algorithms should we use?

Related Work Personalization – Implicit information valuable [Dou et al. 2007; Shen et al. 2005] – More data = better performance [Teevan et al. 2005] Collaborative filtering & recommender systems – Identify related groups Browsed pages [Almeida & Almeida 2004; Sugiyama et al. 2005] Queries [Freyne & Smyth 2006; Lee 2005] Location [Mei & Church 2008], company [Smyth 2007], etc. – Use group data to fill in missing personal data Typically data based on user behavior

Who do we share interests with? – Similarity in query selection – Similarity in what is considered relevant Do we talk about things similarly? – Similarity in user profile What algorithms should we use? – Groupize results using groups of user profiles – Evaluate using groups’ relevance judgments Who do we share interests with? – Similarity in query selection – Similarity in what is considered relevant Do we talk about things similarly? – Similarity in user profile What algorithms should we use? – Groupize results using groups of user profiles – Evaluate using groups’ relevance judgments Who do we share interests with? – Similarity in query selection – Similarity in what is considered relevant Do we talk about things similarly? – Similarity in user profile What algorithms should we use? – Groupize results using groups of user profiles – Evaluate using groups’ relevance judgments Who do we share interests with? – Similarity in query selection – Similarity in what is considered relevant Do we talk about things similarly? – Similarity in user profile What algorithms should we use? – Groupize results using groups of user profiles – Evaluate using groups’ relevance judgments How We Answered the Questions

Interested in Many Group Types Group longevity – Task-based – Trait-based Group identification – Explicit – Implicit Task-basedTrait-based Longevity Identification ImplicitExplicit Task Age Gender Job team Job role LocationInterest group Relevance judgments Query selectionDesktop content

People Studied Trait-based dataset 110 people – Work – Interests – Demographics Microsoft employees Task-based dataset 10 groups x 3 (= 30) Know each other Have common task – “Find economic pros and cons of telecommuting” – “Search for information about companies offering learning services to corporate customers”

Queries Studied Trait-based dataset Challenge – Overlapping queries – Natural motivation Queries picked from 12 – Work c# delegates, live meeting – Interests bread recipes, toilet train dog Task-based dataset Common task – Telecommuting v. office pros and cons of working in an office social comparison telecommuting versus office telecommuting working at home cost benefit

Data Collected Queries evaluated Explicit relevance judgments – results – Personal relevance Highly relevant Relevant Not relevant User profile: Desktop index

Answering the Questions Who do we share interests with? Do we talk about things similarly? What algorithms should we use?

Who do we share interests with? Variation in query selection – Work groups selected similar work queries – Social groups selected similar social queries Variation in relevance judgments – Judgments varied greatly (κ=0.08) – Task-based groups most similar – Similar for one query ≠ similar for another

In task groupNot in groupDifference % In task groupNot in groupDifference All queries % Group queries % Do we talk about things similarly? Group profile similarity – Members more similar to each other than others – Most similar for aspects related to the group Clustering profiles recreates groups Index similarity ≠ judgment similarity – Correlation coefficient of 0.09

What algorithms should we use? Calculate personalized score for each member – Content: User profile as relevance feedback – Behavior: Previously visited URLs and domains – [Teevan et al. 2005] Sum personalized scores across group Produces same ranking for all members (r i +0.5)(N-n i -R+r i +0.5) (n i -r i +0.5)(R-r i +0.5) tf i log Σ terms i

Performance: Task-Based Groups Personalization improves on Web Groupization gains +5% Web Personalized Groupized

Performance: Task-Based Groups Personalization improves on Web Groupization gains +5% Split by query type – On-task v. off-task – Groupization the same as personalization for off-task queries – 11% improvement for on-task queries Off-task queriesOn-task queries Web Personalized Groupized

Performance: Trait-Based Groups Groupization Personalization InterestsWork

Performance: Trait-Based Groups Groupization Personalization Work queries Interest queries InterestsWork

Performance: Trait-Based Groups Groupization Personalization Work queries Interest queries InterestsWork

What We Learned Who do we share interests with? – Depends on the task Do we talk about things similarly? – Variation in profiles even with similar judgments What algorithms should we use? – Groupization can take advantage of variation for group-related tasks

Thank you. Jaime Teevan, Merrie Morris, Steve Bush Microsoft Research

Groupization Performance

Related Work: Collaborative Search People collaborate on search – Students [Twidale et al. 1997], professionals [Morris 2008] – Tasks: Travel, shopping, research, school work Systems to support collaborative search – SearchTogether [Morris & Horvitz 2007] – Cerchiamo [Pickens et al. 2008] – CoSearch [Amershi & Morris 2008] – People form explicit task-based groups

Related Work: Algorithms Personalization – Implicit information valuable [Dou et al. 2007; Shen et al. 2005] – More data = better performance [Teevan et al. 2005] Collaborative filtering & recommender systems – Identify related groups Browsed pages [Almeida & Almeida 2004; Sugiyama et al. 2005] Queries [Freyne & Smyth 2006; Lee 2005] Location [Mei & Church 2008], company [Smyth 2007], etc. – Use group data to fill in missing personal data Typically data based on user behavior

Identifying Groups Explicitly – Tasks: Tools for collaboration [Morris & Horvitz 2007] – Traits: Profiles Implicitly – Interests: Sites visited, queries – Tasks: Query – Location: IP address [Mei & Church 2008] – Gender: Queries [Jones et al. 2007] – Interesting area to explore: Social networks