COLLABORATIVE FILTERING Mustafa Cavdar Neslihan Bulut.

Slides:



Advertisements
Similar presentations
Recommender System A Brief Survey.
Advertisements

Recommender Systems & Collaborative Filtering
Item Based Collaborative Filtering Recommendation Algorithms
Collaborative Filtering Sue Yeon Syn September 21, 2005.
Jeff Howbert Introduction to Machine Learning Winter Collaborative Filtering Nearest Neighbor Approach.
1 RegionKNN: A Scalable Hybrid Collaborative Filtering Algorithm for Personalized Web Service Recommendation Xi Chen, Xudong Liu, Zicheng Huang, and Hailong.
Oct 14, 2014 Lirong Xia Recommender systems acknowledgment: Li Zhang, UCSC.
LYRIC-BASED ARTIST NETWORK METHODOLOGY Derek Gossi CS 765 Fall 2014.
Rubi’s Motivation for CF  Find a PhD problem  Find “real life” PhD problem  Find an interesting PhD problem  Make Money!
1 Collaborative Filtering and Pagerank in a Network Qiang Yang HKUST Thanks: Sonny Chee.
Expertise Networks in Online Communities: Structure and Algorithms Jun Zhang Mark S. Ackerman Lada Adamic University of Michigan WWW 2007, May 8–12, 2007,
Classification for High Dimensional Problems Using Bayesian Neural Networks and Dirichlet Diffusion Trees Radford M. Neal and Jianguo Zhang the winners.
Dimension Reduction and Feature Selection Craig A. Struble, Ph.D. Department of Mathematics, Statistics, and Computer Science Marquette University.
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
1 Introduction to Recommendation System Presented by HongBo Deng Nov 14, 2006 Refer to the PPT from Stanford: Anand Rajaraman, Jeffrey D. Ullman.
Collaborative Ordinal Regression Shipeng Yu Joint work with Kai Yu, Volker Tresp and Hans-Peter Kriegel University of Munich, Germany Siemens Corporate.
Recommender systems Ram Akella November 26 th 2008.
Combining Content-based and Collaborative Filtering Department of Computer Science and Engineering, Slovak University of Technology
Chapter 12 (Section 12.4) : Recommender Systems Second edition of the book, coming soon.
Item-based Collaborative Filtering Recommendation Algorithms
Performance of Recommender Algorithms on Top-N Recommendation Tasks
SHOWTIME! STATISTICAL TOOLS IN EVALUATION CORRELATION TECHNIQUE SIMPLE PREDICTION TESTS OF DIFFERENCE.
Recommender Systems. >1,000,000,000 Finding Trusted Information How many cows in Texas?
Distributed Networks & Systems Lab. Introduction Collaborative filtering Characteristics and challenges Memory-based CF Model-based CF Hybrid CF Recent.
1 Information Filtering & Recommender Systems (Lecture for CS410 Text Info Systems) ChengXiang Zhai Department of Computer Science University of Illinois,
Adaptive News Access Daniel Billsus Presented by Chirayu Wongchokprasitti.
WEMAREC: Accurate and Scalable Recommendation through Weighted and Ensemble Matrix Approximation Chao Chen ⨳ , Dongsheng Li
1 Recommender Systems and Collaborative Filtering Jon Herlocker Assistant Professor School of Electrical Engineering and Computer Science Oregon State.
EMIS 8381 – Spring Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.
Learning user preferences for 2CP-regression for a recommender system Alan Eckhardt, Peter Vojtáš Department of Software Engineering, Charles University.
1 Discovering Authorities in Question Answer Communities by Using Link Analysis Pawel Jurczyk, Eugene Agichtein (CIKM 2007)
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
1 Recommender Systems Collaborative Filtering & Content-Based Recommending.
Online Learning for Collaborative Filtering
1 Social Networks and Collaborative Filtering Qiang Yang HKUST Thanks: Sonny Chee.
Badrul M. Sarwar, George Karypis, Joseph A. Konstan, and John T. Riedl
1 A fast algorithm for learning large scale preference relations Vikas C. Raykar and Ramani Duraiswami University of Maryland College Park Balaji Krishnapuram.
The Effect of Dimensionality Reduction in Recommendation Systems
Chapter 16 Data Analysis: Testing for Associations.
A Content-Based Approach to Collaborative Filtering Brandon Douthit-Wood CS 470 – Final Presentation.
1 Collaborative Filtering & Content-Based Recommending CS 290N. T. Yang Slides based on R. Mooney at UT Austin.
EigenRank: A ranking oriented approach to collaborative filtering By Nathan N. Liu and Qiang Yang Presented by Zachary 1.
Improving Recommendation Lists Through Topic Diversification CaiNicolas Ziegler, Sean M. McNee,Joseph A. Konstan, Georg Lausen WWW '05 報告人 : 謝順宏 1.
Recommender Systems Debapriyo Majumdar Information Retrieval – Spring 2015 Indian Statistical Institute Kolkata Credits to Bing Liu (UIC) and Angshul Majumdar.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
1 Privacy-Enhanced Collaborative Filtering Privacy-Enhanced Personalization workshop July 25, 2005, Edinburgh, Scotland Shlomo Berkovsky 1, Yaniv Eytani.
Collaborative Filtering Zaffar Ahmed
Pearson Correlation Coefficient 77B Recommender Systems.
Pairwise Preference Regression for Cold-start Recommendation Speaker: Yuanshuai Sun
Recommender Systems with Social Regularization Hao Ma, Dengyong Zhou, Chao Liu Microsoft Research Michael R. Lyu The Chinese University of Hong Kong Irwin.
Collaborative Filtering via Euclidean Embedding M. Khoshneshin and W. Street Proc. of ACM RecSys, pp , 2010.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
User Modeling and Recommender Systems: recommendation algorithms
Item-Based Collaborative Filtering Recommendation Algorithms Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl GroupLens Research Group/ Army.
Reputation-aware QoS Value Prediction of Web Services Weiwei Qiu, Zhejiang University Zibin Zheng, The Chinese University of HongKong Xinyu Wang, Zhejiang.
Collaborative Filtering - Pooja Hegde. The Problem : OVERLOAD Too much stuff!!!! Too many books! Too many journals! Too many movies! Too much content!
ItemBased Collaborative Filtering Recommendation Algorithms 1.
Collaborative Filtering With Decoupled Models for Preferences and Ratings Rong Jin 1, Luo Si 1, ChengXiang Zhai 2 and Jamie Callan 1 Language Technology.
Hao Ma, Dengyong Zhou, Chao Liu Microsoft Research Michael R. Lyu
Data Mining: Concepts and Techniques
Recommender Systems & Collaborative Filtering
WSRec: A Collaborative Filtering Based Web Service Recommender System
Machine Learning With Python Sreejith.S Jaganadh.G.
Adopted from Bin UIC Recommender Systems Adopted from Bin UIC.
Collaborative Filtering Nearest Neighbor Approach
M.Sc. Project Doron Harlev Supervisor: Dr. Dana Ron
RECOMMENDER SYSTEMS WITH SOCIAL REGULARIZATION
Movie Recommendation System
ITEM BASED COLLABORATIVE FILTERING RECOMMENDATION ALGORITHEMS
Presentation transcript:

COLLABORATIVE FILTERING Mustafa Cavdar Neslihan Bulut

OUTLINE Definition Problem Space Neighborhood based methods Experiments Weighting Possible Neighbors Selecting Neighborhoods Producing a Prediction Conclusions Future Work

AUTOMATED COLLABORATIVE FILTERING Reduce information load Complements Content-based information filtering systems Collaborative filtering utilizes human judgments(ratings) Key advancements over content based filtering:  Support for contents not easily analyzed by automated processes  Filter items based on quality and taste  Ability to provide serendipitous recommendations

PROBLEM SPACE Predict how well a user will like an item History of preference judgments for a community of users Prediction engine Matrix of users and ratings Predict the values for specific empty cells Neighborhood based methods are most prevalent Other methods; Bayesian networks, singular value decomposition with neural net classification and induction rule learning

NEIGHBORHOOD BASED METHODS

EXPERIMENTAL TECHNIQUE Compare results of different neighborhood based prediction algorithms Data: anonymous reviews from the MovieLens movie recommendation site  122,176 ratings from 1173 users, every user having at least 20 ratings.  %10 users selected to be test users, ratings for 5 items were withheld  For each item getting predicted highest ranking neighbors that have rated the item are used for the prediction Quality of a prediction algorithm;  Coverage: Usually high  Accuracy: Statistical accuracy or decision support accuracy

EXPERIMENTAL TECHNIQUE Conclusions from empirical analysis of prediction algorithm components are tested Components:  Similarity weight  Significance weighting  Variance weighting  Selecting neighborhoods  Rating normalization Variations of the components’ techniques were evaluated

WEIGHTING POSSIBLE NEIGHBORS Similarity weighting  Pearson Correlation Coefficient: Measures the degree to which a linear relationship exists between two variables.  Spearman Rank Correlation Coefficient: Similar to Pearson but doesn’t rely on model assumptions, and performs similarly as well.  Vector Similarity: performs well for information retrieval but not as good for collaborative filtering  Entropy: Not as good as pearson correlation  Mean-squared difference: Not as good as pearson correlation

WEIGHTING POSSIBLE NEIGHBORS Significance Weighting: Considers ‘trust’ in correlation with neighbor Neighbors with tiny samples are terrible candidates Similar results with Pearson correlation

WEIGHTING POSSIBLE NEIGHBORS Variance Weighting: Ratings on some items are more valuable Increases the influence of item with higher variance

WEIGHTING POSSIBLE NEIGHBORS Variance Weighting (Cont.): Ratings on some items are more valuable Increases the influence of item with higher variance

WEIGHTING POSSIBLE NEIGHBORS Variance Weighting (Cont.): No significant effect on accuracy of the prediction algorithm Ignores disagreements with popular feeling

SELECTING NEIGHBORHOODS Select a subset of users

SELECTING NEIGHBORHOODS Correlation-thresholding  Set an absolute correlation threshold  Pick anyone whose correlation is greater  High threshold => Results in small neighborhood  Low threshold => nullify purpose of thresholding

SELECTING NEIGHBORHOODS Best-n-neighbors  Pick best n correlates  Large n => too much noise  Small n => poor predictions

PRODUCING A PREDICTION Combine ratings of neighbors Deviation-from-mean approach

CONCLUSIONS Filtering information based on quality and taste Best-n-neighbors is the best Non-personalized average algorithm found

FUTURE WORK Apply the framework diverse set of domains Integration with existing retrieval technology Scale algorithms to handle extremely large datasets

THANK YOU!