1 Introduction to Recommendation System Presented by HongBo Deng Nov 14, 2006 Refer to the PPT from Stanford: Anand Rajaraman, Jeffrey D. Ullman.

Slides:



Advertisements
Similar presentations
Recommender System A Brief Survey.
Advertisements

Recommender Systems & Collaborative Filtering
Content-based Recommendation Systems
Prediction Modeling for Personalization & Recommender Systems Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
Similarity and Distance Sketching, Locality Sensitive Hashing
Jeff Howbert Introduction to Machine Learning Winter Collaborative Filtering Nearest Neighbor Approach.
1 RegionKNN: A Scalable Hybrid Collaborative Filtering Algorithm for Personalized Web Service Recommendation Xi Chen, Xudong Liu, Zicheng Huang, and Hailong.
Oct 14, 2014 Lirong Xia Recommender systems acknowledgment: Li Zhang, UCSC.
COLLABORATIVE FILTERING Mustafa Cavdar Neslihan Bulut.
Recommender Systems Aalap Kohojkar Yang Liu Zhan Shi March 31, 2008.
Rubi’s Motivation for CF  Find a PhD problem  Find “real life” PhD problem  Find an interesting PhD problem  Make Money!
CS345 Data Mining Recommendation Systems Netflix Challenge Anand Rajaraman, Jeffrey D. Ullman.
Database Management Systems, R. Ramakrishnan1 Computing Relevance, Similarity: The Vector Space Model Chapter 27, Part B Based on Larson and Hearst’s slides.
RECOMMENDATION SYSTEMS
Recommendations via Collaborative Filtering. Recommendations Relevant for movies, restaurants, hotels…. Recommendation Systems is a very hot topic in.
Chapter 8 Collaborative Filtering Stand
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
Recommender systems Ram Akella November 26 th 2008.
Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University Note to other teachers and users of these.
Algorithms for Efficient Collaborative Filtering Vreixo Formoso Fidel Cacheda Víctor Carneiro University of A Coruña (Spain)
CONTENT-BASED BOOK RECOMMENDING USING LEARNING FOR TEXT CATEGORIZATION TRIVIKRAM BHAT UNIVERSITY OF TEXAS AT ARLINGTON DATA MINING CSE6362 BASED ON PAPER.
Recommendation Systems
Collaborative Filtering & Content-Based Recommending
Chapter 5: Information Retrieval and Web Search
Combining Content-based and Collaborative Filtering Department of Computer Science and Engineering, Slovak University of Technology
Chapter 12 (Section 12.4) : Recommender Systems Second edition of the book, coming soon.
Item-based Collaborative Filtering Recommendation Algorithms
Performance of Recommender Algorithms on Top-N Recommendation Tasks
Recommender Systems. >1,000,000,000 Finding Trusted Information How many cows in Texas?
Distributed Networks & Systems Lab. Introduction Collaborative filtering Characteristics and challenges Memory-based CF Model-based CF Hybrid CF Recent.
Item Based Collaborative Filtering Recommendation Algorithms Badrul Sarwar, George Karpis, Joseph KonStan, John Riedl (UMN) p.s.: slides adapted from:
1 Information Filtering & Recommender Systems (Lecture for CS410 Text Info Systems) ChengXiang Zhai Department of Computer Science University of Illinois,
Recommender systems Drew Culbert IST /12/02.
Clustering-based Collaborative filtering for web page recommendation CSCE 561 project Proposal Mohammad Amir Sharif
Sarah Fatima Varda Sarfraz.  What is Recommendation systems?  Three recommendation approaches  Content-based  Collaborative  Hybrid approach  Conclusions.
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
Toward the Next generation of Recommender systems
1 Recommender Systems Collaborative Filtering & Content-Based Recommending.
Chapter 6: Information Retrieval and Web Search
1 Computing Relevance, Similarity: The Vector Space Model.
EigenRank: A Ranking-Oriented Approach to Collaborative Filtering IDS Lab. Seminar Spring 2009 강 민 석강 민 석 May 21 st, 2009 Nathan.
Collaborative Filtering  Introduction  Search or Content based Method  User-Based Collaborative Filtering  Item-to-Item Collaborative Filtering  Using.
A Content-Based Approach to Collaborative Filtering Brandon Douthit-Wood CS 470 – Final Presentation.
Evaluation of Recommender Systems Joonseok Lee Georgia Institute of Technology 2011/04/12 1.
1 Collaborative Filtering & Content-Based Recommending CS 290N. T. Yang Slides based on R. Mooney at UT Austin.
Recommender Systems Debapriyo Majumdar Information Retrieval – Spring 2015 Indian Statistical Institute Kolkata Credits to Bing Liu (UIC) and Angshul Majumdar.
Recommender Systems. Recommender Systems (RSs) n RSs are software tools providing suggestions for items to be of use to users, such as what items to buy,
DATA MINING LECTURE 5 Similarity and Distance Recommender Systems.
The Summary of My Work In Graduate Grade One Reporter: Yuanshuai Sun
CS425: Algorithms for Web Scale Data Most of the slides are from the Mining of Massive Datasets book. These slides have been modified for CS425. The original.
Pairwise Preference Regression for Cold-start Recommendation Speaker: Yuanshuai Sun
Ensemble Methods Construct a set of classifiers from the training data Predict class label of previously unseen records by aggregating predictions made.
Collaborative Filtering via Euclidean Embedding M. Khoshneshin and W. Street Proc. of ACM RecSys, pp , 2010.
Personalization Services in CADAL Zhang yin Zhuang Yuting Wu Jiangqin College of Computer Science, Zhejiang University November 19,2006.
User Modeling and Recommender Systems: recommendation algorithms
Item-Based Collaborative Filtering Recommendation Algorithms Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl GroupLens Research Group/ Army.
What do you recommend? THOSE … THIS … THAT 1. what you’ll read next summer (Amazon, Barnes&Noble) what movies you should watch… (Reel, RatingZone, Amazon)
Collaborative Deep Learning for Recommender Systems
Analysis of massive data sets Prof. dr. sc. Siniša Srbljić Doc. dr. sc. Dejan Škvorc Doc. dr. sc. Ante Đerek Faculty of Electrical Engineering and Computing.
Collaborative Filtering With Decoupled Models for Preferences and Ratings Rong Jin 1, Luo Si 1, ChengXiang Zhai 2 and Jamie Callan 1 Language Technology.
Announcements Paper presentation Project meet with me ASAP
Matrix Factorization and Collaborative Filtering
High dim. data Graph data Infinite data Machine learning Apps
Data Mining: Concepts and Techniques
Recommender Systems & Collaborative Filtering
CS728 The Collaboration Graph
مهندسی سيستم‌هاي تجارت الکترونيک
Adopted from Bin UIC Recommender Systems Adopted from Bin UIC.
Collaborative Filtering Nearest Neighbor Approach
Recommendation Systems
Presentation transcript:

1 Introduction to Recommendation System Presented by HongBo Deng Nov 14, 2006 Refer to the PPT from Stanford: Anand Rajaraman, Jeffrey D. Ullman

2 Netflix Prize - $1,000,000 Prize Netflix recently announced their NetflixPrize in which they will award $1 million dollars for an algorithm that can out-perform their recommendation approach Cinematch by 10%.

3 Outline  What is Recommendation systems?  Three recommendation approaches Content-based Collaborative Hybrid approach  Conclusions  Review of my previous work

4 What is Recommendation systems? Items SearchRecommendations Products, web sites, blogs, news items, … Recommendation systems are programs which attempt to predict items that a user may be interested in

5 Recommendation Types  Editorial  Simple aggregates Top 10, Most Popular, Recent Uploads  Tailored to individual users Amazon, Netflix, … Books, CDs, other products at amazon.com Movies by Netflix, MovieLens

6 Formal Model  C = set of Customers  S = set of Items, e.g. books, movies  The space S of possible items and the user space C can be very large.  Utility function u: C £ S ! R R = set of ratings R is a totally ordered set e.g., 0-5 stars, real number in [0,1]

7 Utility Matrix King Kong LOTRMatrix Nacho Libre Alice Bob Carol David

8 Recommendation Process  Collecting “known” ratings for matrix  Extrapolate unknown ratings from known ratings Estimate ratings for the items that have not been seen by a user  Recommend the items with the highest estimated ratings to a user

9 Collecting Ratings  Explicit data collection Ask people to rate items Doesn’t work well in practice – people can’t be bothered  Implicit data collection Learn ratings from user actions e.g., purchase implies high rating What about low ratings?

10 Extrapolating Utilities  Key problem: matrix U is sparse most people have not rated most items  Three approaches Content-based recommendation Collaborative recommendation Hybrid recommendation

11 Content-based recommendations  Main idea: recommend items to customer C similar to previous items rated highly by C  Movie recommendations recommend movies with same actor(s), director, genre, …  Websites, blogs, news recommend other sites with “similar” content

12 Plan of action likes Item profiles RedCirclesTriangles User profile match recommend build

13 Item Profiles  For each item, create an item profile  Profile is a set of features movies: author, title, actor, director,… text: set of “important” words in document  How to pick important words? Usual heuristic is TF.IDF (Term Frequency times Inverse Doc Frequency)

14 TF.IDF f ij = frequency of term t i in document d j n i = number of docs that mention term i N = total number of docs TF.IDF score w ij = TF ij £ IDF i Doc profile = set of words with highest TF.IDF scores, together with their scores

15 User profiles and prediction  User profile possibilities: Weighted average of rated item profiles Variation: weight by difference from average rating for item …  Traditional heuristic Given user profile c and item profile s, estimate u(c,s) = cos(c,s) = c.s/(|c||s|) Need efficient method to find items with high utility E.g.

16 Model-based approaches  For each user, learn a classifier that classifies items into rating classes liked by user and not liked by user e.g., Bayesian, regression, SVM  Apply classifier to each item to find recommendation candidates  Problem: scalability

17 Limitations of content-based approach  Finding the appropriate features e.g., images, movies, music  Overspecialization Never recommends items outside user’s content profile People might have multiple interests  Recommendations for new users How to build a profile? A new user, having very few ratings, would not be able to get accurate recommendations.

18 Collaborative Filtering  Consider user c  Find set D of other users whose ratings are “similar” to c’s ratings  Estimate user’s ratings based on ratings of users in D Set of other users Similar Ratings Estimate

19 Similar users  Let r x be the vector of user x’s ratings  Cosine similarity measure sim(x,y) = cos(r x, r y )  Pearson correlation coefficient S xy = items rated by both users x and y

20 Rating predictions  Let D be the set of k users that are the most similar to c and who have rated item s  Possibilities for prediction function (item s): r cs = 1/k  d 2 D r ds r cs = ( d 2 D sim(c,d) £ r ds )/( d 2 D sim(c,d)) Other options?

21 Complexity  Expensive step is finding k most similar customers O(|U|)  Too expensive to do at runtime Need to pre-compute  Naïve precomputation takes time O(N|U|) Simple trick gives some speedup  Can use clustering, partitioning as alternatives, but quality degrades

22 Item-Item Collaborative Filtering  So far: User-user collaborative filtering  Another view For item s, find other similar items Estimate rating for item based on ratings for similar items Can use same similarity metrics and prediction functions as in user-user model  In practice, it has been observed that item-item often works better than user- user

23 Pros and cons of collaborative filtering  Works for any kind of item No feature selection needed  New user problem The same problem as with content-based system  New item problem  Sparsity of rating matrix

24 Hybrid Methods  Implement two separate recommenders and combine their predictions  Add content-based methods to collaborative approach item profiles for new item problem deal with sparsity-related problems

25 Evaluating Recommendations  Precision Accuracy of predictions Compare predictions with known ratings, Root- mean-square error (RMSE)  Receiver operating characteristic (ROC) Tradeoff curve between false positives and false negatives  Recommendation Quality Top-n measures (e.g., Breese score)  Item-Set Coverage Number of items/users for which system can make predictions

26 Conclusions  Content-based The user will be recommended items similar to the ones the user preferred in the past  Collaborative The user will be recommended items that people with similar tastes and preferences liked in the past;  Hybrid Combine collaborative and content-based methods

27 Review of my previous work

28 Facial Expression Recognition Preprocessing procedure Rotate to line up eye coordinates Locate & Corp Face Region Geometrical Normalize Gabor Feature Extraction Normalize PCA&LDA Translation Matrix Train PhaseTemplates Test Phase Histogram Equalization Distance Classifier

29 Image Stitching Feature Points extraction Correlation Match Ransac eliminate pseudo match points Build the Model Perspective model Image alignmentImage Stitching Demo

30 Any questions or suggestions  Thank you