Data Mining: Concepts and Techniques

Slides:



Advertisements
Similar presentations
Recommender System A Brief Survey.
Advertisements

Recommender Systems & Collaborative Filtering
Content-based Recommendation Systems
Item Based Collaborative Filtering Recommendation Algorithms
Prediction Modeling for Personalization & Recommender Systems Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
A Graph-based Recommender System Zan Huang, Wingyan Chung, Thian-Huat Ong, Hsinchun Chen Artificial Intelligence Lab The University of Arizona 07/15/2002.
COLLABORATIVE FILTERING Mustafa Cavdar Neslihan Bulut.
Distance and Similarity Measures
Sean Blong Presents: 1. What are they…?  “[…] specific type of information filtering (IF) technique that attempts to present information items (movies,
Recommender Systems Aalap Kohojkar Yang Liu Zhan Shi March 31, 2008.
Item-based Collaborative Filtering Idea: a user is likely to have the same opinion for similar items [if I like Canon cameras, I might also like Canon.
Chapter 8 Collaborative Filtering Stand
Agent Technology for e-Commerce
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
1 Introduction to Recommendation System Presented by HongBo Deng Nov 14, 2006 Refer to the PPT from Stanford: Anand Rajaraman, Jeffrey D. Ullman.
Collaborative Filtering CMSC498K Survey Paper Presented by Hyoungtae Cho.
Recommender systems Ram Akella November 26 th 2008.
CONTENT-BASED BOOK RECOMMENDING USING LEARNING FOR TEXT CATEGORIZATION TRIVIKRAM BHAT UNIVERSITY OF TEXAS AT ARLINGTON DATA MINING CSE6362 BASED ON PAPER.
Chapter 12 (Section 12.4) : Recommender Systems Second edition of the book, coming soon.
+ Social Bookmarking and Collaborative Filtering Christopher G. Wagner.
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
Distributed Networks & Systems Lab. Introduction Collaborative filtering Characteristics and challenges Memory-based CF Model-based CF Hybrid CF Recent.
Item Based Collaborative Filtering Recommendation Algorithms Badrul Sarwar, George Karpis, Joseph KonStan, John Riedl (UMN) p.s.: slides adapted from:
Recommendation system MOPSI project KAROL WAGA
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
EMIS 8381 – Spring Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.
An Energy-Efficient Mobile Recommender Systems Bingchun Zhu Dung Phan Hien Le February 22, 2011.
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
Presented By :Ayesha Khan. Content Introduction Everyday Examples of Collaborative Filtering Traditional Collaborative Filtering Socially Collaborative.
Toward the Next generation of Recommender systems
1 Social Networks and Collaborative Filtering Qiang Yang HKUST Thanks: Sonny Chee.
1 Automatic Classification of Bookmarked Web Pages Chris Staff Second Talk February 2007.
Collaborative Information Retrieval - Collaborative Filtering systems - Recommender systems - Information Filtering Why do we need CIR? - IR system augmentation.
Healthcare Process Modelling by Rule Based Networks Han Liu First Year PhD Student Alex Gegov, Jim Briggs, Mohammed Bader PhD Supervisors.
Collaborative Filtering  Introduction  Search or Content based Method  User-Based Collaborative Filtering  Item-to-Item Collaborative Filtering  Using.
Recommender Systems Debapriyo Majumdar Information Retrieval – Spring 2015 Indian Statistical Institute Kolkata Credits to Bing Liu (UIC) and Angshul Majumdar.
Recommender Systems. Recommender Systems (RSs) n RSs are software tools providing suggestions for items to be of use to users, such as what items to buy,
Cosine Similarity Item Based Predictions 77B Recommender Systems.
Collaborative Filtering Zaffar Ahmed
Recommendation Algorithms for E-Commerce. Introduction Millions of products are sold over the web. Choosing among so many options is proving challenging.
Collaborative Filtering via Euclidean Embedding M. Khoshneshin and W. Street Proc. of ACM RecSys, pp , 2010.
Personalization Services in CADAL Zhang yin Zhuang Yuting Wu Jiangqin College of Computer Science, Zhejiang University November 19,2006.
User Modeling and Recommender Systems: recommendation algorithms
Item-Based Collaborative Filtering Recommendation Algorithms Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl GroupLens Research Group/ Army.
10/03/59 1 Recommendation Systems Sunantha Sodsee Information Technology King Mongkut’s University of Technology North Bangkok.
Recommendation Systems By: Bryan Powell, Neil Kumar, Manjap Singh.
Overview on Web Mining and Recommendation June 13, CENG 770.
Collaborative Filtering - Pooja Hegde. The Problem : OVERLOAD Too much stuff!!!! Too many books! Too many journals! Too many movies! Too much content!
Analysis of massive data sets Prof. dr. sc. Siniša Srbljić Doc. dr. sc. Dejan Škvorc Doc. dr. sc. Ante Đerek Faculty of Electrical Engineering and Computing.
Item-Based Collaborative Filtering Recommendation Algorithms
Recommender Systems 11/04/2017
He Xiangnan Research Fellow National University of Singapore
Recommender Systems & Collaborative Filtering
Item-to-Item Recommender Network Optimization
Automatic cLasification d
Data Mining, Neural Network and Genetic Programming
Web Mining Ref:
Adopted from Bin UIC Recommender Systems Adopted from Bin UIC.
Collaborative Filtering Nearest Neighbor Approach
Advanced Artificial Intelligence
Web Mining and Recommendation
PSA: Never work with Rate My Professor
Basic Statistics for Non-Mathematicians: What do statistics tell us
What Is Good Clustering?
Recommender Systems Copyright: Dietmar Jannah, Markus Zanker and Gerhard Friedrich (slides based on their IJCAI talk „Tutorial: Recommender Systems”)
Personalization & Recommender Systems
Content Based Image Retrieval
Recommender Systems: Collaborative & Content-based Filtering Features
Recommendation Systems
A Glimpse of Recommender Systems on the Web
Presentation transcript:

Data Mining: Concepts and Techniques Recommender Systems May 4, 2018 Data Mining: Concepts and Techniques

Recommender Systems RS – problem of information filtering RS – problem of machine learning seeks to predict the 'rating' that a user would give to an item she/he had not yet considered. Enhance user experience Assist users in finding information Reduce search and navigation time

Types of RS Three broad types: Content based RS Collaborative RS Hybrid RS

Types of RS – Content based RS Content based RS highlights Recommend items similar to those users preferred in the past User profiling is the key Items/content usually denoted by keywords Matching “user preferences” with “item characteristics” … works for textual information Vector Space Model widely used

Types of RS – Content based RS Content based RS - Limitations Not all content is well represented by keywords, e.g. images Items represented by the same set of features are indistinguishable Users with thousands of purchases is a problem New user: No history available

Types of RS – Collaborative RS Collaborative RS highlights Use other users recommendations (ratings) to judge item’s utility Key is to find users/user groups whose interests match with the current user Vector Space model widely used (directions of vectors are user specified ratings) More users, more ratings: better results Can account for items dissimilar to the ones seen in the past too Example: Movielens.org

Types of Collaborative Filtering User-based collaborative filtering Item-based collaborative filtering

User-based Collaborative Filtering Idea: People who agreed in the past are likely to agree again To predict a user’s opinion for an item, use the opinion of similar users Similarity between users is decided by looking at their overlap in opinions for other items

Example: User-based Collaborative Filtering Item 1 Item 2 Item 3 Item 4 Item 5 User 1 8 1 ? 2 7 User 2 5 User 3 4 User 4 3 User 5 6 User 6

Similarity between users Item 1 Item 2 Item 3 Item 4 Item 5 User 1 8 1 ? 2 7 User 2 5 User 4 3 How similar are users 1 and 2? How similar are users 1 and 5? How do you calculate similarity?

Similarity between users: simple way Item 1 Item 2 Item 3 Item 4 Item 5 User 1 8 1 ? 2 7 User 2 5 Only consider items both users have rated For each item: Calculate difference in the users’ ratings Take the average of this difference over the items Average j : Item j rated by User 1 and User 2: | rating (User 1, Item j) – rating (User 2, Item j) |

Algorithm 1: using entire matrix 5 7 7 Aggregation function: often weighted sum Picture shows six users, our target user in middle (with red circle indicating them), distance between users based on how similar they are. Numbers in yellow boxes are rating by users for Item 3. Ratings of all other users are used by an aggregation function (often a weighted sum) to decide on predicted rating for our target user. Weight depends on similarity 8 4

Algorithm 2: K-Nearest-Neighbour Neighbours are people who have historically had the same taste as our user 5 7 7 Aggregation function: often weighted sum Picture shows six users, our target user in middle (with red circle indicating them), distance between users based on how similar they are. Blue area around target user shows nearest neighbours (including two of our users in this case). Numbers in yellow boxes are rating by users for Item 3. Ratings of nearest neighbours are used by an aggregation function (often a weighted sum) to decide on predicted rating for our target user. Weight depends on similarity 8 4

Item-based Collaborative Filtering Idea: a user is likely to have the same opinion for similar items [same idea as in Content-Based Filtering] Similarity between items is decided by looking at how other users have rated them [different from Content-based, where item features are used] Advantage (compared to user-based CF): Prevents User Cold-Start problem Improves scalability (similarity between items is more stable than between users)

Example: Item-based Collaborative Filtering User 1 8 1 ? 2 7 User 2 5 User 3 4 User 4 3 User 5 6 User 6

Similarity between items ? 2 7 5 4 3 8 6 How similar are items 3 and 4? How similar are items 3 and 5? How do you calculate similarity? Each row in the table are the ratings one user on the items

Similarity between items: simple way ? 2 5 7 4 3 6 8 Only consider users who have rated both items For each user: Calculate difference in ratings for the two items Take the average of this difference over the users Average i : User i has rated Items 3 and 4: | rating (User i, Item 3) – rating (User i, Item 4) | Each row in the table are the ratings one user on the items

Aggregation function: often weighted sum Algorithms As User-Based: can use nearest-neighbours or all Item 2 8 1 Aggregation function: often weighted sum Item 1 Item 3 Item 5 Showing five items, Item 3 is the one we need to know for User 1. Distances to Item 3 indicate similarity. Numbers in yellow boxes give ratings for User 1 for other items. Blue area shows nearest neighbours, items that are most similar to Item 3 based on past ratings by other users. Weight depends on similarity 7 Item 4 2

Types of RS – Collaborative RS Collaborative RS - Limitations Different users might use different scales. Possible solution: weighted ratings, i.e. deviations from average rating Finding similar users/user groups isn’t very easy New user: No preferences available (user cold start problem) New item: No ratings available (item cold start problem) Demographic filtering is required

Some ways to make a Hybrid RS Weighted. Ratings of several recommendation techniques are combined together to produce a single recommendation Switching. The system switches between recommendation techniques depending on the current situation Mixed. Recommendations from several different recommenders are presented simultaneously (e.g. Amazon) Cascade. One recommender refines the recommendations given by another

Model-based collaborative filtering Instead of using ratings directly, develop a model of user ratings Use the model to predict ratings for new items To build the model: Bayesian network (probabilistic) Clustering (classification) Rule-based approaches (e.g., association rules between co-purchased items)

Model-based collaborative filtering Cluster Models Create clusters or groups Put a customer into a category Classification simplifies the task of user matching More scalability and performance Lesser accuracy than normal collaborative filtering method

Possible Improvement in RS Better understanding of users and items Social network (social RS) User level Highlighting interests, hobbies, and keywords people have in common Item level link the keywords to eCommerce