1 Collaborative Filtering Rong Jin Department of Computer Science and Engineering Michigan State University.

Slides:



Advertisements
Similar presentations
Google News Personalization: Scalable Online Collaborative Filtering
Advertisements

1 Information Filtering Rong Jin. 2 Outline Brief introduction to information filtering Collaborative filtering Adaptive filtering.
1 Machine Learning Spring 2010 Rong Jin. 2 CSE847 Machine Learning Instructor: Rong Jin Office Hour: Tuesday 4:00pm-5:00pm Thursday 4:00pm-5:00pm Textbook.
Item Based Collaborative Filtering Recommendation Algorithms
1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
Jeff Howbert Introduction to Machine Learning Winter Collaborative Filtering Nearest Neighbor Approach.
1 RegionKNN: A Scalable Hybrid Collaborative Filtering Algorithm for Personalized Web Service Recommendation Xi Chen, Xudong Liu, Zicheng Huang, and Hailong.
COLLABORATIVE FILTERING Mustafa Cavdar Neslihan Bulut.
A. Darwiche Learning in Bayesian Networks. A. Darwiche Known Structure Complete Data Known Structure Incomplete Data Unknown Structure Complete Data Unknown.
Active Learning and Collaborative Filtering
George Lee User Context-based Service Control Group
6/2/ An Automatic Personalized Context- Aware Event Notification System for Mobile Users George Lee User Context-based Service Control Group Network.
2. Introduction Multiple Multiplicative Factor Model For Collaborative Filtering Benjamin Marlin University of Toronto. Department of Computer Science.
Evaluating Search Engine
1 Machine Learning Spring 2013 Rong Jin. 2 CSE847 Machine Learning  Instructor: Rong Jin  Office Hour: Tuesday 4:00pm-5:00pm TA, Qiaozi Gao, Thursday.
Rubi’s Motivation for CF  Find a PhD problem  Find “real life” PhD problem  Find an interesting PhD problem  Make Money!
HMM-BASED PATTERN DETECTION. Outline  Markov Process  Hidden Markov Models Elements Basic Problems Evaluation Optimization Training Implementation 2-D.
Lecture 14: Collaborative Filtering Based on Breese, J., Heckerman, D., and Kadie, C. (1998). Empirical analysis of predictive algorithms for collaborative.
1 Collaborative Filtering and Pagerank in a Network Qiang Yang HKUST Thanks: Sonny Chee.
1 Machine Learning Spring 2010 Rong Jin. 2 CSE847 Machine Learning  Instructor: Rong Jin  Office Hour: Tuesday 4:00pm-5:00pm Thursday 4:00pm-5:00pm.
Modeling User Rating Profiles For Collaborative Filtering
Expectation Maximization Algorithm
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
Collaborative Ordinal Regression Shipeng Yu Joint work with Kai Yu, Volker Tresp and Hans-Peter Kriegel University of Munich, Germany Siemens Corporate.
Collaborative Filtering Rong Jin Dept. of Computer Science and Engineering Michigan State University.
Recommender systems Ram Akella November 26 th 2008.
Algorithms for Efficient Collaborative Filtering Vreixo Formoso Fidel Cacheda Víctor Carneiro University of A Coruña (Spain)
Combining Content-based and Collaborative Filtering Department of Computer Science and Engineering, Slovak University of Technology
Chapter 12 (Section 12.4) : Recommender Systems Second edition of the book, coming soon.
Item-based Collaborative Filtering Recommendation Algorithms
1 Information Filtering & Recommender Systems (Lecture for CS410 Text Info Systems) ChengXiang Zhai Department of Computer Science University of Illinois,
Group Recommendations with Rank Aggregation and Collaborative Filtering Linas Baltrunas, Tadas Makcinskas, Francesco Ricci Free University of Bozen-Bolzano.
Presented By :Ayesha Khan. Content Introduction Everyday Examples of Collaborative Filtering Traditional Collaborative Filtering Socially Collaborative.
Google News Personalization: Scalable Online Collaborative Filtering
1 Recommender Systems Collaborative Filtering & Content-Based Recommending.
Learning Geographical Preferences for Point-of-Interest Recommendation Author(s): Bin Liu Yanjie Fu, Zijun Yao, Hui Xiong [KDD-2013]
EigenRank: A Ranking-Oriented Approach to Collaborative Filtering IDS Lab. Seminar Spring 2009 강 민 석강 민 석 May 21 st, 2009 Nathan.
Collaborative Filtering  Introduction  Search or Content based Method  User-Based Collaborative Filtering  Item-to-Item Collaborative Filtering  Using.
Evaluation of Recommender Algorithms for an Internet Information Broker based on Simple Association Rules and on the Repeat-Buying Theory WEBKDD 2002 Edmonton,
A Content-Based Approach to Collaborative Filtering Brandon Douthit-Wood CS 470 – Final Presentation.
1 Collaborative Filtering & Content-Based Recommending CS 290N. T. Yang Slides based on R. Mooney at UT Austin.
EigenRank: A ranking oriented approach to collaborative filtering By Nathan N. Liu and Qiang Yang Presented by Zachary 1.
Recommender Systems Debapriyo Majumdar Information Retrieval – Spring 2015 Indian Statistical Institute Kolkata Credits to Bing Liu (UIC) and Angshul Majumdar.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
Implicit User Feedback Hongning Wang Explicit relevance feedback 2 Updated query Feedback Judgments: d 1 + d 2 - d 3 + … d k -... Query User judgment.
Effective Automatic Image Annotation Via A Coherent Language Model and Active Learning Rong Jin, Joyce Y. Chai Michigan State University Luo Si Carnegie.
Pairwise Preference Regression for Cold-start Recommendation Speaker: Yuanshuai Sun
Google News Personalization Big Data reading group November 12, 2007 Presented by Babu Pillai.
Information Design Trends Unit Five: Delivery Channels Lecture 2: Portals and Personalization Part 2.
Collaborative Filtering via Euclidean Embedding M. Khoshneshin and W. Street Proc. of ACM RecSys, pp , 2010.
ICONIP 2010, Sydney, Australia 1 An Enhanced Semi-supervised Recommendation Model Based on Green’s Function Dingyan Wang and Irwin King Dept. of Computer.
Personalization Services in CADAL Zhang yin Zhuang Yuting Wu Jiangqin College of Computer Science, Zhejiang University November 19,2006.
Online Evolutionary Collaborative Filtering RECSYS 2010 Intelligent Database Systems Lab. School of Computer Science & Engineering Seoul National University.
User Modeling and Recommender Systems: recommendation algorithms
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
Mining Tag Semantics for Social Tag Recommendation Hsin-Chang Yang Department of Information Management National University of Kaohsiung.
Learning in Bayesian Networks. Known Structure Complete Data Known Structure Incomplete Data Unknown Structure Complete Data Unknown Structure Incomplete.
Collaborative Deep Learning for Recommender Systems
Collaborative Filtering - Pooja Hegde. The Problem : OVERLOAD Too much stuff!!!! Too many books! Too many journals! Too many movies! Too much content!
Analysis of massive data sets Prof. dr. sc. Siniša Srbljić Doc. dr. sc. Dejan Škvorc Doc. dr. sc. Ante Đerek Faculty of Electrical Engineering and Computing.
Slope One Predictors for Online Rating-Based Collaborative Filtering Daniel Lemire, Anna Maclachlan In SIAM Data Mining (SDM’05), Newport Beach, California,
Collaborative Filtering With Decoupled Models for Preferences and Ratings Rong Jin 1, Luo Si 1, ChengXiang Zhai 2 and Jamie Callan 1 Language Technology.
Recommender Systems & Collaborative Filtering
Sampath Jayarathna Cal Poly Pomona
Methods and Metrics for Cold-Start Recommendations
Adopted from Bin UIC Recommender Systems Adopted from Bin UIC.
Collaborative Filtering Nearest Neighbor Approach
Movie Recommendation System
ITEM BASED COLLABORATIVE FILTERING RECOMMENDATION ALGORITHEMS
Probabilistic Latent Preference Analysis
Presentation transcript:

1 Collaborative Filtering Rong Jin Department of Computer Science and Engineering Michigan State University

2 Outline  Brief introduction information filtering  Collaborative filtering Major issues in collaborative filtering Main methods for collaborative filtering Flexible mixture model for collaborative filtering Decoupling model for collaborative filtering

3 Short vs. Long Term Info. Need  Short-term information need (Ad hoc retrieval) “Temporary need”, e.g., info about used cars Information source is relatively static User “pulls” information Application example: library search, Web search  Long-term information need (Filtering) “Stable need”, e.g., new data mining algorithms Information source is dynamic System “pushes” information to user Applications: news filter

4 Examples of Information Filtering  News filtering  filtering  Movie/book/product recommenders  Literature recommenders  And many others …

5 Information Filtering  Basic filtering question: Will user U like item X?  Two different ways of answering it Look at what U likes  characterize X  content-based filtering Look at who likes X  characterize U  collaborative filtering  Combine content-based filtering and collaborative filtering

6 Other Names for Information Filtering  Content-based filtering is also called “Adaptive Information Filtering” in TREC “Selective Dissemination of Information” (SDI) in Library & Information Science  Collaborative filtering is also called Recommender systems

7 Example: Content-based Filtering Description: A homicide detective and a fire marshall must stop a pair of murderers who commit videotaped crimes to become media darlings Rating: Description: Benjamin Martin is drawn into the American revolutionary war against his will when a brutal British commander kills his son. Rating: Description: A biography of sports legend, Muhammad Ali, from his early days to his days in the ring Rating: History What to Recommend? Description: A high-school boy is given the chance to write a story about an up-and-coming rock band as he accompanies it on their concert tour. Recommend: ? Description: A young adventurer named Milo Thatch joins an intrepid group of explorers to find the mysterious lost continent of Atlantis. Recommend: ? No Yes

8 Example: Collaborative Filtering User User User 32?354 User 3 is more similar to user 1 than user 2  5 for movie “15 minutes” for user 3 5

9 Collaborative Filtering (CF) vs. Content-based Filtering (CBF)  CF do not need content of items while CBF relies the content of items  CF is useful when content of items are not available or difficult to acquire are brief and insufficient  Example: movie recommendation  A movie is preferred may because its actor its director its popularity

10 Application of Collaborative Filtering

11 ? Collaborative Filtering  Goal: Making filtering decisions for an individual user based on the judgments of other users u1u2…umu1u2…um Users: U Objects: O o 1 o 2 … o j o j+1 … o n 3 1 …. … 4 2 ? 2 5 ? 4 3 ? 3 ? 1 2 u test 3 4…… 1

12 Collaborative Filtering  Goal: Making filtering decisions for an individual user based on the judgments of other users  General idea Given a user u, find similar users {u 1, …, u m } Predict u’s rating based on the ratings of u 1, …, u m

13 Example: Collaborative Filtering User User User 32?354 User 3 is more similar to user 2 than user 1  5 for movie “15 minutes” for user 3 5

14 Memory-based Approaches for CF  The key is to find users that are similar to the test user  Traditional approach Measure the similarity in rating patterns between different users Example: Pearson Correlation Coefficient

15 Pearson Correlation Coefficient for CF  Similarity between a training user y and a test user y 0 : Remove the rating bias from each training user

16 Pearson Correlation Coefficient for CF  Estimate ratings for the test user Weighted vote of normalized rates

17 Example User Normalized Rate User Normalized Rate User 32?354 Normalize Rate

18 Example User Normalized Rate User Normalized Rate User 32?354 Normalize Rate

19 Example User Normalized Rate User Normalized Rate User 32?354 Normalize Rate

20 Problems with Memory-based Approaches User 1?5342 User 2415?5 User 35?425 User 41535?  Most users only rate a few items Two similar users can may not rate the same set of items  Clustering users and items

21 Flexible Mixture Model (FMM) Cluster both users and items simultaneously User 1?5342 User 2415?5 User 35?425 User 41535? User clustering and item clustering are correlated !

22 Flexible Mixture Model (FMM) Cluster both users and items simultaneously User Class I1p(4)=1/4 p(5)=3/4 3 User Class IIp(4)=1/4 p(5)=3/4 p(1)=1/2 p(2)=1/2 p(4)=1/2 p(5)=1/2 Movie Type I Movie Type II Movie Type III Unknown ratings are gone!

23 Flexible Mixture Model (FMM) Z o Z u O U R P(o|Z o ) P(u|Z u ) P(Z o ) P(Z u ) P(r|Z o,Z u ) Z u : user class Z o : item class U: user O: item R: rating Hidden variable Observed variable

24  Annealed Expectation Maximization (AEM) algorithm  E-step: calculate posterior probability for hidden variables z u and Z o b: temperature for Annealed EM algorithm  M-step: updated parameters Flexible Mixture Model: Estimation

25 Flexible Mixture Model: Predication  Fold-in process Repeat the EM algorithm including ratings from the test user Fix all the parameters except for P(u t |z u ) Key issue: What user class does the test user belong to ?

26 Another Prob. with Memory-based Approaches User User User User Users with similar interests can have different rating patterns  Decoupling preference patterns from rating patterns

27 Decoupling Model (DM) Z o Z u O U Hidden variable Observed variable Z u : user class Z o : item class U: user O: item R: rating

28 Decoupling Model (DM) Z u : user class Z o : item class U: user O: item R: rating Z pref : whether users like items Z o Z u O U Z pref Hidden variable Observed variable

29 Decoupling Model (DM) Z u : user class Z o : item class U: user O: item R: rating Z pref : whether users like items Z R : rating class Z o Z u O U R Z pref Z R  Separating preference and rating patterns  User class + Rating class  rating R Z u  Z pref and Z R +Z pref  r Hidden variable Observed variable

30 Experiment  Datasets: EachMovie and MovieRating  Evaluation: Mean Absolute Error (MAE): average absolute deviation of the predicted ratings to the actual ratings on items. The smaller MAE, the better the performance MovieRatingEachMovie Number of Users Number of Items Avg. # of rated items/User Number of ratings56

31 Experiment Protocol  Test the sensitivity of the proposed model to the amount of training data Vary the number of training users MovieRating dataset: 100 and 200 training users EachMovie dataset: 200 and 400 training users  Test the sensitivity of the proposed model to the information needed for the test user Vary the number of rated items provided by the test user  5, 10, and 20 items are given with ratings

32 Experimental Results: FMM and other baseline algorithms Movie Rating, 100 Training Users Movie Rating, 200 Training Users Each Movie, 400 Training Users Each Movie, 200 Training Users Given: Given: Given: Given: MAEMAE MAEMAE MAEMAE MAEMAE A smaller MAE indicates better performance

33 FMM vs. DM Results on Movie Rating Results on Each Movie Training Users Size Algorithms 5 Items Given 10 Items Given 20 Items Given 100 FMM DM FMM DM Training Users Size Algorithms 5 Items Given 10 Items Given 20 Items Given 200 FMM DM FMM DM Smaller value indicates better performance