Amit Goyal Laks V. S. Lakshmanan RecMax: Exploiting Recommender Systems for Fun and Profit University of British Columbia

Slides:



Advertisements
Similar presentations
Numbers Treasure Hunt Following each question, click on the answer. If correct, the next page will load with a graphic first – these can be used to check.
Advertisements

Angstrom Care 培苗社 Quadratic Equation II
AP STUDY SESSION 2.
1
Feichter_DPG-SYKL03_Bild-01. Feichter_DPG-SYKL03_Bild-02.
1 Vorlesung Informatik 2 Algorithmen und Datenstrukturen (Parallel Algorithms) Robin Pomplun.
© 2008 Pearson Addison Wesley. All rights reserved Chapter Seven Costs.
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Chapter 1 The Study of Body Function Image PowerPoint
Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 6 Author: Julia Richards and R. Scott Hawley.
Author: Julia Richards and R. Scott Hawley
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
Properties Use, share, or modify this drill on mathematic properties. There is too much material for a single class, so you’ll have to select for your.
Objectives: Generate and describe sequences. Vocabulary:
UNITED NATIONS Shipment Details Report – January 2006.
RXQ Customer Enrollment Using a Registration Agent (RA) Process Flow Diagram (Move-In) Customer Supplier Customer authorizes Enrollment ( )
1 RA I Sub-Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Casablanca, Morocco, 20 – 22 December 2005 Status of observing programmes in RA I.
Conversion Problems 3.3.
Properties of Real Numbers CommutativeAssociativeDistributive Identity + × Inverse + ×
Custom Statutory Programs Chapter 3. Customary Statutory Programs and Titles 3-2 Objectives Add Local Statutory Programs Create Customer Application For.
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt BlendsDigraphsShort.
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Year 6 mental test 10 second questions
1 Discreteness and the Welfare Cost of Labour Supply Tax Distortions Keshab Bhattarai University of Hull and John Whalley Universities of Warwick and Western.
1 Click here to End Presentation Software: Installation and Updates Internet Download CD release NACIS Updates.
REVIEW: Arthropod ID. 1. Name the subphylum. 2. Name the subphylum. 3. Name the order.
Break Time Remaining 10:00.
Table 12.1: Cash Flows to a Cash and Carry Trading Strategy.
PP Test Review Sections 6-1 to 6-6
EU market situation for eggs and poultry Management Committee 20 October 2011.
Bright Futures Guidelines Priorities and Screening Tables
Bellwork Do the following problem on a ½ sheet of paper and turn in.
2 |SharePoint Saturday New York City
Green Eggs and Ham.
Exarte Bezoek aan de Mediacampus Bachelor in de grafische en digitale media April 2014.
VOORBLAD.
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
1 RA III - Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Buenos Aires, Argentina, 25 – 27 October 2006 Status of observing programmes in RA.
Factor P 16 8(8-5ab) 4(d² + 4) 3rs(2r – s) 15cd(1 + 2cd) 8(4a² + 3b²)
Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.
1..
RecMax – Can we combine the power of Social Networks and Recommender Systems? Amit Goyal and L. RecMax: Exploting Recommender Systems for Fun and Profit.
CONTROL VISION Set-up. Step 1 Step 2 Step 3 Step 5 Step 4.
© 2012 National Heart Foundation of Australia. Slide 2.
Adding Up In Chunks.
Universität Kaiserslautern Institut für Technologie und Arbeit / Institute of Technology and Work 1 Q16) Willingness to participate in a follow-up case.
Understanding Generalist Practice, 5e, Kirst-Ashman/Hull
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt Synthetic.
Artificial Intelligence
Note to the teacher: Was 28. A. to B. you C. said D. on Note to the teacher: Make this slide correct answer be C and sound to be “said”. to said you on.
Model and Relationships 6 M 1 M M M M M M M M M M M M M M M M
25 seconds left…...
1 hi at no doifpi me be go we of at be do go hi if me no of pi we Inorder Traversal Inorder traversal. n Visit the left subtree. n Visit the node. n Visit.
Analyzing Genes and Genomes
We will resume in: 25 Minutes.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
Essential Cell Biology
Intracellular Compartments and Transport
PSSA Preparation.
Essential Cell Biology
Immunobiology: The Immune System in Health & Disease Sixth Edition
Energy Generation in Mitochondria and Chlorplasts
Murach’s OS/390 and z/OS JCLChapter 16, Slide 1 © 2002, Mike Murach & Associates, Inc.
Chapter 5 The Mathematics of Diversification
RecMax: Exploiting Recommender Systems for Fun and Profit RecMax – Recommendation Maximization Previous research in Recommender Systems mostly focused.
Presentation transcript:

Amit Goyal Laks V. S. Lakshmanan RecMax: Exploiting Recommender Systems for Fun and Profit University of British Columbia

Recommender Systems 2 Movies Products Music VideosNews Websites

RecMax – Recommendation Maximization 3  Previous research mostly focused on improving accuracy of recommendations.  In this paper, we propose a novel problem RecMax (short for Recommendation Maximization). Can we launch a targeted marketing campaign over an existing operational Recommender System?

Consider an item in a Recommender System 4 Some users rate the item (seed users) Because of these ratings, the item may be recommended to some other users. Flow of information RecMax: Can we strategically select the seed users?

RecMax 5 Seed Users Flow of information Users to whom the item is recommended Select k seed users such that if they provide high ratings to a new product, then the number of other users to whom the product is recommended (hit score) by the underlying recommender system algorithm is maximum.

RecMax – Problem Formulation 6 RecommendationsExpected Rating Harry Potter4.8 American Pie4.3 …. … The Dark Knight3.2 Number of recommendations are l Recommendation List for user v rating threshold of user v (denoted by θ v ) For a new item i, if expected rating R(v,i) > θ v, then the new item is recommended to v The goal of RecMax is to find a seed set S such that hit score f(S) is maximized.

Benefits of RecMax 7  Targeted marketing in Recommender Systems  Marketers can effectively advertise new products on a Recommender System platform.  Business opportunity to Recommender System platform.  Similar to Influence Maximization problem in spirit.  Beneficial to seed users  They get free/discounted samples of a new product.  Helpful to other users  They receive recommendations of new products – solution to cold start problem.

A key Challenge – Wide diversity of Recommender Systems 8 Recommender Systems Content Based Collaborative Filtering Model Based Matrix Factorization Memory Based User-basedItem-based Similarity functions: Cosine, Pearson, Adjusted Cosine etc Due to this wide diversity, it is very difficult to study RecMax

Outline 9  What is RecMax?  Does Seeding Help? – Preliminary Experiments.  Theoretical Analysis of RecMax.  Experiments.  Conclusions and Future Work.

Does Seeding Help?  Dataset: Movielens  Recommender System: User-based  Seeds are picked randomly.  Recall that Hit Score is the number of users to whom the product is recommended. 10 A budget of 500 can get a hit score of 5091 (10x) (User-based)

Does Seeding Help? 11  Dataset: Movielens  Recommender System: Item-based  Seeds are picked randomly.  Recall that Hit Score is the number of users to whom the product is recommended. A budget of 20 can get a hit score of 636 (30x) (Item-based)

Outline 12  What is RecMax?  Does Seeding Help? – Preliminary Experiments.  Theoretical Analysis of RecMax.  Experiments.  Conclusions and Future Work.

Key Theoretical Results 13  RecMax is NP-hard to solve exactly.  RecMax is NP-hard to approximate within a factor to 1/|V| (1- ε ) for any ε > 0.  No reasonable approximation algorithm can be developed.  RecMax is as hard as Maximum Independent Set Problem.  Under both User-based and under Item-based.

Why is RecMax so hard? (1/2) 14  We introduce a helper problem – Maximum Encirclement Problem  find a set S of size k such that it encircles maximum number of nodes in the graph. A A D D B B C C E E Nodes {B,C} encircle node A. Nodes {B,C,E} encircle node D. Thus, {B,C,E} encircle A and D.

Why is RecMax so hard? (2/2) 15 A A D D B B C C E E Set {B,C,E} is a solution to Maximum Encirclement Problem (for k=3). Nodes {A,D} form Maximum Independent Set. Reduction: Nodes {B,C,E} must rate the new item highly for the item to be recommended to A and D. RecMax is as hard as Maximum Independent Set, and hence NP-hard to approximate within a factor to 1/|V| (1- ε )

Discussion (1/2) 16  We show hardness for User-based and Item-based methods.  What about Matrix Factorization?  Most likely hardness would remain (future work).

Discussion (2/2) 17  Since the problem is hard to approximate, does it make sense to study?  YES, as we saw earlier, even a random heuristic fetches impressive gains.  We explore several natural heuristics and compare them.  What about sophisticated heuristics (future work).

Outline 18  What is RecMax?  Does Seeding Help? – Preliminary Experiments.  Theoretical Analysis of RecMax.  Experiments.  Conclusions and Future Work.

Datasets 19

Heuristics 20  Random: Seed set is selected randomly. The process is repeated several times and average is taken.  Most-Active: Top-k users with most number of ratings.  Most-Positive: Top-k users with most positive average ratings.  Most-Critical: Top-k users with most critical average ratings.

Heuristics 21  Most-Central: Top-k central users.

User-Based Recommender Systems 22

Comparison – Hit Score achieved 23  Dataset: Movielens  Recommender System: User-based Most Central, Most Positive and Random perform good here.

Comparison – Hit Score achieved 24  Dataset: Yahoo! Music  Recommender System: User-based Most Positive, Most Central perform good here.

Comparison – Hit Score achieved 25  Dataset: Jester Joke  Recommender System: User-based Most Central out-performs all other heuristics.

Key Takeaways 26  Even the simple heuristics perform well.  With a budget of 300, Most-Central heuristic achieves hit score of 4.4K, 3.4K and 15.6K on Movielens, Yahoo! and Jester respectively.  Depending on the data set, we may encounter a “tipping point” – a minimum seeding is needed for the results to be impressive.

Item-Based Recommender Systems 27

Comparison – Hit Score achieved 28  Dataset: Movielens  Recommender System: Item-based Most Central performs good here.

Comparison – Hit Score achieved 29  Dataset: Yahoo! Music  Recommender System: Item-based Most Central performs good here.

Comparison – Hit Score achieved 30  Dataset: Jester Joke  Recommender System: Item-based Most Central, Random and Most-Active performs good here.

Key Takeaways 31  Again, the simple heuristics perform well.  Hit score achieved in Item-based is much lower than in User-based.  Thus, much less seeding is required to achieve maximum possible hit score.  Overall, Most-Central performs well.  The difference of Most-Central with baseline Random is not much.  We need better heuristics (future work).

User-Based vs Item-Based 32

User-based vs Item-based 33  Dataset: Yahoo! Music  Initial rise of hit score is steeper in Item-based.  Hit score saturates much earlier in Item- based. Eventual hit score that can be achieved is much more in User-based.

User-based vs Item-based 34 Common Seeds (out of 1000 seeds) Movielens103 (10.3%) Yahoo! Music219 (21.9 %) Jester Joke62 (0.62 %) Seed Sets are different in both methods.

Outline 35  What is RecMax?  Does Seeding Help? – Preliminary Experiments.  Theoretical Analysis of RecMax.  Experiments.  Conclusions and Future Work.

Our Contributions 36  The main goal of the paper is to propose and study a novel problem that we call RecMax  Select k seed users such that if they endorse a new product by providing relatively high ratings, the number of users to whom the product is recommended (hit score) is maximum.  We focus on User-based and Item-based recommender systems.  We offer empirical evidence that seeding does help in boosting the number of recommendations

Our Contributions 37  We perform a thorough theoretical analysis of RecMax.  RecMax is NP-hard to solve exactly.  RecMax is NP-hard to approximate within any reasonable factor.  Given this hardness, we explore several natural heuristics on 3 real world datasets and report our findings.  Even simple heuristics like Most-Central provide impressive gains  This makes RecMax an interesting problem for targeted marketing in recommender systems.

Future Work 38  RecMax is a new problem and has real applications – our work is just the first work.  Developing better heuristics.  Studying RecMax on more sophisticated recommender systems algorithms  Matrix Factorization.

Thanks and Questions 39 University of British Columbia RecMax: Exploiting Recommender Systems for Fun and Profit Amit Goyal Laks V. S. Lakshmanan University of British Columbia

Heuristics 40  Most-Central: Top-k central users.