CMPT 884, SFU, Martin Ester, 1-09 104 Recommender Systems Martin Ester Simon Fraser University School of Computing Science CMPT 884 Spring 2009.

Slides:



Advertisements
Similar presentations
Recommender System A Brief Survey.
Advertisements

Recommender Systems & Collaborative Filtering
Google News Personalization: Scalable Online Collaborative Filtering
Item Based Collaborative Filtering Recommendation Algorithms
A Graph-based Recommender System Zan Huang, Wingyan Chung, Thian-Huat Ong, Hsinchun Chen Artificial Intelligence Lab The University of Arizona 07/15/2002.
Collaborative Filtering Sue Yeon Syn September 21, 2005.
Jeff Howbert Introduction to Machine Learning Winter Collaborative Filtering Nearest Neighbor Approach.
COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.
COLLABORATIVE FILTERING Mustafa Cavdar Neslihan Bulut.
A. Darwiche Learning in Bayesian Networks. A. Darwiche Known Structure Complete Data Known Structure Incomplete Data Unknown Structure Complete Data Unknown.
Learning to Recommend Hao Ma Supervisors: Prof. Irwin King and Prof. Michael R. Lyu Dept. of Computer Science & Engineering The Chinese University of Hong.
1 Combining Provenance with Trust in Social Networks for Semantic Web Content Filtering Jennifer Golbeck University of Maryland, College Park May, 2006.
Intro to RecSys and CCF Brian Ackerman 1. Roadmap Introduction to Recommender Systems & Collaborative Filtering Collaborative Competitive Filtering 2.
2. Introduction Multiple Multiplicative Factor Model For Collaborative Filtering Benjamin Marlin University of Toronto. Department of Computer Science.
SFU, CMPT 741, Fall 2009, Martin Ester 418 Outlook Outline Trends in KDD research Graph mining and social network analysis Recommender systems Information.
Item-based Collaborative Filtering Idea: a user is likely to have the same opinion for similar items [if I like Canon cameras, I might also like Canon.
Mohsen Jamali, Martin Ester Simon Fraser University Vancouver, Canada ACM RecSys 2010.
Using a Trust Network To Improve Top-N Recommendation
EigenTaste: A Constant Time Collaborative Filtering Algorithm Ken Goldberg Students: Theresa Roeder, Dhruv Gupta, Chris Perkins Industrial Engineering.
TrustWalker: A Random Walk Model for Combining Trust-based and Item-based Recommendation Mohsen Jamali & Martin Ester Simon Fraser University, Vancouver,
1 Collaborative Filtering and Pagerank in a Network Qiang Yang HKUST Thanks: Sonny Chee.
Top-N Recommendation Algorithm Based on Item-Graph
Recommendations via Collaborative Filtering. Recommendations Relevant for movies, restaurants, hotels…. Recommendation Systems is a very hot topic in.
Computing Trust in Social Networks
Customizable Bayesian Collaborative Filtering Denver Dash Big Data Reading Group 11/19/2007.
Collaborative Ordinal Regression Shipeng Yu Joint work with Kai Yu, Volker Tresp and Hans-Peter Kriegel University of Munich, Germany Siemens Corporate.
CS 277: Data Mining Recommender Systems
Chapter 12 (Section 12.4) : Recommender Systems Second edition of the book, coming soon.
Item-based Collaborative Filtering Recommendation Algorithms
Performance of Recommender Algorithms on Top-N Recommendation Tasks
Recommender Systems. >1,000,000,000 Finding Trusted Information How many cows in Texas?
Mark Levene, An Introduction to Search Engines and Web Navigation © Pearson Education Limited 2005 Slide 9.1 Chapter 9 : Social Networks What is a social.
Distributed Networks & Systems Lab. Introduction Collaborative filtering Characteristics and challenges Memory-based CF Model-based CF Hybrid CF Recent.
Item Based Collaborative Filtering Recommendation Algorithms Badrul Sarwar, George Karpis, Joseph KonStan, John Riedl (UMN) p.s.: slides adapted from:
Collaborative Filtering: A Tutorial (abridged version of tutorial from my Web page, given at Dimacs W/S in 2003?) William W. Cohen Machine Learning Dept.
LOGO Recommendation Algorithms Lecturer: Dr. Bo Yuan
1 Recommender Systems and Collaborative Filtering Jon Herlocker Assistant Professor School of Electrical Engineering and Computer Science Oregon State.
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
Google News Personalization: Scalable Online Collaborative Filtering
Recommender Systems David M. Pennock NEC Research Institute contributions: John Riedl, GroupLens University of Minnesota.
1 Recommender Systems Collaborative Filtering & Content-Based Recommending.
1 Social Networks and Collaborative Filtering Qiang Yang HKUST Thanks: Sonny Chee.
EigenRank: A Ranking-Oriented Approach to Collaborative Filtering IDS Lab. Seminar Spring 2009 강 민 석강 민 석 May 21 st, 2009 Nathan.
Collaborative Filtering  Introduction  Search or Content based Method  User-Based Collaborative Filtering  Item-to-Item Collaborative Filtering  Using.
Badrul M. Sarwar, George Karypis, Joseph A. Konstan, and John T. Riedl
The Effect of Dimensionality Reduction in Recommendation Systems
1 Computing Trust in Social Networks Huy Nguyen Lab seminar April 15, 2011.
1 Collaborative Filtering & Content-Based Recommending CS 290N. T. Yang Slides based on R. Mooney at UT Austin.
Recommender Systems Debapriyo Majumdar Information Retrieval – Spring 2015 Indian Statistical Institute Kolkata Credits to Bing Liu (UIC) and Angshul Majumdar.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
Recommender Systems. Recommender Systems (RSs) n RSs are software tools providing suggestions for items to be of use to users, such as what items to buy,
Announcements Milestone 2 – autolab disk quota issues: – classifier implementation issues: Almost any Weka classifier can be used on our datasets … with.
Recommender Systems with Social Regularization Hao Ma, Dengyong Zhou, Chao Liu Microsoft Research Michael R. Lyu The Chinese University of Hong Kong Irwin.
Collaborative Filtering via Euclidean Embedding M. Khoshneshin and W. Street Proc. of ACM RecSys, pp , 2010.
Online Evolutionary Collaborative Filtering RECSYS 2010 Intelligent Database Systems Lab. School of Computer Science & Engineering Seoul National University.
User Modeling and Recommender Systems: recommendation algorithms
Item-Based Collaborative Filtering Recommendation Algorithms Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl GroupLens Research Group/ Army.
Learning in Bayesian Networks. Known Structure Complete Data Known Structure Incomplete Data Unknown Structure Complete Data Unknown Structure Incomplete.
Presented By: Madiha Saleem Sunniya Rizvi.  Collaborative filtering is a technique used by recommender systems to combine different users' opinions and.
Collaborative Filtering - Pooja Hegde. The Problem : OVERLOAD Too much stuff!!!! Too many books! Too many journals! Too many movies! Too much content!
Analysis of massive data sets Prof. dr. sc. Siniša Srbljić Doc. dr. sc. Dejan Škvorc Doc. dr. sc. Ante Đerek Faculty of Electrical Engineering and Computing.
Hao Ma, Dengyong Zhou, Chao Liu Microsoft Research Michael R. Lyu
COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.
Trust-aware Recommender Systems
Recommender Systems & Collaborative Filtering
Methods and Metrics for Cold-Start Recommendations
Adopted from Bin UIC Recommender Systems Adopted from Bin UIC.
Collaborative Filtering Nearest Neighbor Approach
ITEM BASED COLLABORATIVE FILTERING RECOMMENDATION ALGORITHEMS
Recommendation Systems
Presentation transcript:

CMPT 884, SFU, Martin Ester, Recommender Systems Martin Ester Simon Fraser University School of Computing Science CMPT 884 Spring 2009

CMPT 884, SFU, Martin Ester, Recommender Systems Outline Introduction motivation, applications, issues Collaborative filtering user-based, item-based, challenges Trust-based recommendation deterministic, random walks, challenges Model-based recommendation  [Konstan 2008] [Cohen 2002]

CMPT 884, SFU, Martin Ester, Recommender Systems Introduction search engine users just type in a few keywords search engine overwhelms user with a flood of results ranking mechanism based on similarity between query keywords and web pages and on prestige of pages search engine‘s answers do not take into account user feedback and users‘ preferences  Information needs more complex than keywords or topics: quality and taste

CMPT 884, SFU, Martin Ester, Recommender Systems Introduction Users are not willing to spend a lot of time to specify their personal information needs Recommender systems automatically identify relevant information or products relevant for a given user, learning from available data Data can be transactions of all users / customers of a website or profile of an individual user  users who bought this book also bought... (Amazon.com)

CMPT 884, SFU, Martin Ester, Recommender Systems Personalization Level Generic everyone receives same recommendations Demographic matches a demographic group Personalized matches an individual, everybody gets different recommendations Ephemeral matches current activity Persistent matches long-term interests

CMPT 884, SFU, Martin Ester, Recommender Systems Types of Systems Filtering interfaces filters, clipping services Recommendation interfaces suggestion lists, “top-n,” offers and promotions Prediction interfaces evaluate candidates, predicted ratings

CMPT 884, SFU, Martin Ester, Recommender Systems Collaborative Filtering Main idea users rate items users are correlated with other users personal predictions for unrated items Nearest-Neighbor Approach find people with history of agreement aggregate their ratings to predict rating of user assume stable tastes  employs data about the target user and other users

CMPT 884, SFU, Martin Ester, Recommender Systems Targetuser Aggregator Prediction

CMPT 884, SFU, Martin Ester, Recommender Systems Collaborative Filtering

CMPT 884, SFU, Martin Ester, Recommender Systems Collaborative Filtering Recommendation task 1 Predicting the rating on a target item for a given user Predicting John’s rating on Star Wars Movie movie1 ?? Recommender

CMPT 884, SFU, Martin Ester, Recommender Systems Collaborative Filtering Recommendation task 2 Recommending a list of items to a given user Recommending a list of movies to John for watching List of Top Movies ?? Recommender Movie 1Movie 2Movie 3

CMPT 884, SFU, Martin Ester, Recommender Systems Applications Movie recommendations Book recommendations Recommendation of friends

CMPT 884, SFU, Martin Ester, Recommender Systems Privacy and Trustworthiness Who knows what about me? – personal information revealed – identity Is the recommendation honest? – biases built-in by operator e.g. want to sell „old hats“ or prefers ads with higher bids Vulnerability to external manipulation (fraud) - insert fraudulent user profiles which rate my product highly

CMPT 884, SFU, Martin Ester, Collaborative Filtering Introduction Rating Matrix Users Items Ratings What is Joe’s rating of Blimp and of RockyXV? Similar user

CMPT 884, SFU, Martin Ester, Collaborative Filtering Example

CMPT 884, SFU, Martin Ester, Collaborative Filtering Definitions v i,j : vote of user i on item j I i = items for which user i has voted mean vote of user i is predicted vote for active user a on target item j is weighted sum of votes on j by n “similar” users normalizer weights of n similar users

CMPT 884, SFU, Martin Ester, Collaborative Filtering Definitions K-nearest neighbor Pearson correlation coefficient Cosine distance

CMPT 884, SFU, Martin Ester, Collaborative Filtering Evaluation [Herlocker 2004] split users into train/test sets for each user a in the test set: - split a’s votes into observed (I) and to-predict (P) - measure average absolute deviation between predicted and actual votes in P - alternatively, measure the squared deviation predicted and actual votes in P average error measure over all test users  MAE or RMSE

CMPT 884, SFU, Martin Ester, Collaborative Filtering Evaluation There is a trade-off between precision and recall Measure also the recall / coverage, i.e. the percentage of ( a, i ) pairs for which method can make a recommendation F-measure considers both precision and recall Max squared error

CMPT 884, SFU, Martin Ester, Collaborative Filtering Evaluation so far, only comparison against ground truth in industry, want to measure the business profit user surveys in an online system measure click through rates measure add-on sales

CMPT 884, SFU, Martin Ester, Collaborative Filtering Challenges user item rating matrix is very sparse typically 99% of the entries unknown  dimensionality reduction  item-item based CF cannot make (accurate) recommendations for cold start users users who have recently joined the system and have rated only very few items (typically, 50% of users)  trust-based recommendation

CMPT 884, SFU, Martin Ester, Collaborative Filtering Challenges the larger the user community - the more variance among the ratings - the more the ratings converge to the mean value  cluster users and use only the corresponding cluster to make a recommendation cannot compute the confidence of a recommendation system does not know its limits  probabilistic methods vulnerable to fraud copy a user profile and become the most similar user  trust-based recommendation

CMPT 884, SFU, Martin Ester, Collaborative Filtering Challenges need to explain recommendations how to reward serendipity in the evaluation? recommendations should not all be of the same kind how to evaluate a set of recommendations? how to produce the best sequence of recommendations?

CMPT 884, SFU, Martin Ester, Collaborative Filtering

CMPT 884, SFU, Martin Ester, Collaborative Filtering  leads to a denser rating, lower-dimensional matrix  can alternatively use Singular Value Decomposition (SVD) or Latent Semantic Indexing (LSI)

CMPT 884, SFU, Martin Ester, Collaborative Filtering Item-Item Collaborative Filtering [Sarwar et al 2001] Many applications have many more users (customers) than items (products) Many customers have no similar customers Most products have similar products Make recommendation by considering ratings of active user for similar products

CMPT 884, SFU, Martin Ester, Collaborative Filtering Item-Item Collaborative Filtering Aggregator Prediction

CMPT 884, SFU, Martin Ester, Collaborative Filtering Explanations Simple visual representations of neighbors ratings Statement of strong previous performance “MovieLens has predicted correctly 80% of the time for you”

CMPT 884, SFU, Martin Ester, Collaborative Filtering Explanations Complex representations are not accepted by users, e.g. - more than one dimension - any use of statistical terminology such as correlation, variance, etc.

CMPT 884, SFU, Martin Ester, Trust-based Recommendation Introduction Users tend to trust ratings given by their trusted friends Trust is propagated in the social network Trust is transitive (to a certain degree) and asymmetric Use neighborhood of (directly or indirectly) trusted friends to find reliable ratings and make a recommendation Can make recommendations for cold start users as long as they are somehow connected to the network More robust to fraud

CMPT 884, SFU, Martin Ester, Trust-based Recommendation Introduction

CMPT 884, SFU, Martin Ester, Trust-based Recommendation Definitions r i,j : rating of user i for item j Trust network : graph G = (U,T) where U is a set of nodes (users) and T is a set of edges (trust relationships) Edges can be weighted, but typically they are not Trust relationships can be explicitly stated by users (e.g., Epinions.com) or be implicitly derived from observed interactions between users (e.g., MSN network)

CMPT 884, SFU, Martin Ester, Trust-based Recommendation Definitions for users i and j which are connected via T, the indirect trust between i and j is defined via some trust model, based on the direct trust values raters : all users that have rated target item i trusted raters: all raters that are trusted by active user u (to a certain degree)

CMPT 884, SFU, Martin Ester, Trust-based Recommendation Definitions and f is a function comuting the trust model recommendation by aggregating the ratings of k trusted raters u

CMPT 884, SFU, Martin Ester, Trust-based Recommendation Issues How to compute the indirect trust? How many of the trusted raters to consider? Which ones? If using too few, the prediction is not based on a significant number or rates. If using too many, these raters may only be weakly trusted. In a large trust network, need to consider also the efficiency of exploring the trust network.

CMPT 884, SFU, Martin Ester, Trust-based Recommendation TidalTrust [Golbeck 2005] most accurate information will come from the highest trusted neighbors in principle, each node should consider only its neighbors with highest trust rating but different nodes have different max trust among their neighbors, which would lead to different levels of trust in different parts of the network max: largest trust value such that a path can be found from source to sink with all t ij >= max define indirect trust recursively

CMPT 884, SFU, Martin Ester, Trust-based Recommendation MoleTrust [Massa et al 2007] trust model similar to TidalTrust major difference in the set of trusted raters considered both, TidalTrust and MoleTrust perform a breadth-first search of the trust network TidalTrust considers all raters at the minimum depth (shortest path distance from the active user) MoleTrust considers all raters up to a specified maximum depth

CMPT 884, SFU, Martin Ester, Trust-based Recommendation Discussion TidalTrust is likely to find only very few raters MoleTrust may consider too many raters TidalTrust ignores the actual ratings and their distribution MoleTrust even ignores the actual distribution of the raters  maximum depth independent of a and i

CMPT 884, SFU, Martin Ester, Trust-based Recommendation Random Walks [Andersen et al 2008] perform a random walk in the trust network starting from user a if current user u has rating for item i, return it otherwise, choose a trusted neighbor v randomly with probability proportional to t u,v and go to v terminate as soon as rating found or some specified maxdepth reached repeat random walks until the average aggregated rating converges use the aggregated rating as recommendation  termination depends on distribution of raters and ratings

CMPT 884, SFU, Martin Ester, Trust-based Recommendation Experimental Evaluation Epinions dataset products rated on a scale of [1.. 5] explicit trust network (binary)  epinions.com Distinguish cold start users and all users Comparison of various CF and trust-based methods Item based 0 /.4 /.8: considers only items with similarity at least 0 /.4 /.8 Random Walk 1 / 6: considers trusted raters up to depth 1 / 6

CMPT 884, SFU, Martin Ester, Trust-based Recommendation Experimental Evaluation all trust-based methods greatly improve the coverage of CF methods they also have very competitive RMSE

CMPT 884, SFU, Martin Ester, Trust-based Recommendation Experimental Evaluation all methods perform much better on all users than on cold start users only the gain of trust-based methods is not so significant

CMPT 884, SFU, Martin Ester, Model-based Recommendation Introduction [Cohen 2002] so far: memory-based methods CF, trust-based recommendation no training of a model model-based approaches to CF: 1) CF as density estimation 2) CF and content-based recommendation as classification

CMPT 884, SFU, Martin Ester, Model-based Recommendation CF as Density Estimation [Horvitz et al 1998] estimate Pr(R ij =k) for each user i, movie j, and rating k use all available data to build model for this estimator R ij AirplaneMatrixRoom with a View... Hidalgo Joe Carol 8?9…?... Kumar 93?…6

CMPT 884, SFU, Martin Ester, Model-based Recommendation CF as Density Estimation a simple model  same model for all users

CMPT 884, SFU, Martin Ester, Model-based Recommendation CF as Density Estimation a more complex model group users into M “clusters”: c(1),..., c(M)  same model for all users within a group estimate by counts

CMPT 884, SFU, Martin Ester, Model-based Recommendation CF as Density Estimation group users into clusters using Expectation-Maximization: - randomly initialize Pr(R m,j =k) for each m i.e., initialize the clusters differently somehow - E-Step: estimate Pr(user i in cluster m ) for each i,m - M-Step: find maximum likelihood (ML) estimator for R ij within each cluster m use ratio of #(users i in cluster m with rating R ij =k) to #(user i in cluster m ), weighted by Pr( i in m) from E-step - repeat E-step, M-step until convergence

CMPT 884, SFU, Martin Ester, Model-based Recommendation CF as Classification [Basu et al, 1998] Classification task: map (user,movie) pair into {likes,dislikes} Training data : known likes/dislikes, test data : active users Features : any properties of user/movie pair AirplaneMatrixRoom with a View... Hidalgo comedyactionromance... action Joe 27,M,70k 1101 Carol 53,F,20k Kumar 25,M,22k 1001 UaUa 48,M,81k 01???

CMPT 884, SFU, Martin Ester, Model-based Recommendation CF as Classification e.g., moviesLikedByUser(Joe) = {Airplane,Matrix,...,Hidalgo} age(Joe)=27, income(Joe)=70k, genre(Matrix)=action, director(Matrix) =.. AirplaneMatrixRoom with a View... Hidalgo comedyactionromance... action Joe 27,M,70k 1101 Carol 53,F,20k Kumar 25,M,22k 1001 UaUa 48,M,81k 01???

CMPT 884, SFU, Martin Ester, Model-based Recommendation CF as Classification AirplaneMatrixRoom with a View... Hidalgo comedyactionromance... action Joe 27,M,70k 1101 Carol 53,F,20k Kumar 25,M,22k 1001 UaUa 48,M,81k 01??? genre={romance}, age=48, sex=male, income=81k, usersWhoLikedMovie={Carol}, moviesLikedByUser={Matrix,Airplane},...

CMPT 884, SFU, Martin Ester, Model-based Recommendation CF as Classification AirplaneMatrixRoom with a View... Hidalgo comedyactionromance... action Joe 27,M,70k 1101 Carol 53,F,20k Kumar 25,M,22k 1001 UaUa 48,M,81k 01??? genre={action}, age=48, sex=male, income=81k, usersWhoLikedMovie = {Joe,Kumar}, moviesLikedByUser={Matrix,Airplane},...

CMPT 884, SFU, Martin Ester, Model-based Recommendation CF as Classification Classification algorithm RIPPER (rule learner) Sample classification rules if NakedGun33/13 moviesLikedByUser(U) and Joe usersWhoLikedMovie(M) and genre(M)=comedy then likes(U,M) if age(U)>12 and age(U)<17 and HolyGrail moviesLikedByUser(U) and director(M) =MelBrooks then likes(U,M) if Ishtar moviesLikedByUser(U) then likes(U,M)

CMPT 884, SFU, Martin Ester, Model-based Recommendation CF as Classification features - collaborative: UsersWhoLikedMovie, UsersWhoDislikedMovie, MoviesLikedByUser - content: Actors, Directors, Genre, MPAA rating,... - hybrid: ComediesLikedByUser, DramasLikedByUser, UsersWhoLikedFewDramas,... predict liked(U,M) for the M in top quartile of U’s ranking for different feature sets evaluate recall and precision w.r.t. actual (U,M) pairs

CMPT 884, SFU, Martin Ester, Model-based Recommendation CF as Classification precision at same level of recall (about 33%) RIPPER with collaborative features only performs worse than memory-based CF by about 5 pts precision (73% vs. 78%) RIPPER with hybrid features performs better than memory- based CF by about 5 pts precision (83% vs. 78%)

CMPT 884, SFU, Martin Ester, Recommender Systems References R. Andersen, C. Borgs, J. Chayes, U. Feige, A. Flaxman, A. Kalai, V. Mirrokni, and M. Tennenholtz: Trust-based recommendation systems: an axiomatic approach, WWW 2008 Chumki Basu, Haym Hirsh, and William W. Cohen: Recommendation as Classification: Using Social and Content-Based Information in Recommendation, AAAI 1998 William Cohen: Collaborative Filtering, Tutorial DIMACS Workshop, 2002 Jennifer Golbeck: Computing and Applying Trust in Web-based Social Networks, PhD Thesis, University of Maryland College Park, 2005 J. Herlocker et al.: Evaluating Collaborative Filtering Recommender Systems, ACM Transactions on Information Systems, Jan. 2004

CMPT 884, SFU, Martin Ester, Recommender Systems References Eric Horvitz, Jack S. Breese, David Heckerman, David Hovel, Koos Rommelse: The Lumière Project: Bayesian User Modeling for Inferring the Goals and Needs of Software Users, UAI 1998 Joseph A. Konstan: Introduction to Recommender Systems, Tutorial SIGMOD 2008 Paolo Massa, Paolo Avesani: Trust-aware Recommender Systems, ACM RecSys 2007 Paul Resnick, Neophytos Iacovou, Mitesh Suchak, Peter Bergstrom, John Riedl: GroupLens: An Open Architecture for Collaborative Filtering of Netnews, ACM Conference on Computer Supported Cooperative Work, 1994 Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl: ItemBased Collaborative Filtering Recommendation Algorithms, WWW 2001