E NHANCING M ATRIX F ACTORIZATION T HROUGH I NITIALIZATION FOR I MPLICIT F EEDBACK D ATABASES Balázs Hidasi Domonkos Tikk Gravity R&D Ltd. Budapest University.

Slides:



Advertisements
Similar presentations
TWO STEP EQUATIONS 1. SOLVE FOR X 2. DO THE ADDITION STEP FIRST
Advertisements

Alternative Algorithms
Subspace Embeddings for the L1 norm with Applications Christian Sohler David Woodruff TU Dortmund IBM Almaden.
Finite Element Method CHAPTER 6: FEM FOR FRAMES
OLA Library Building Award Architectural Design and Transformation Ottawa Public Library Sunnyside Branch submitted May 2012 BiblioOttawaLibrary.ca.
By D. Fisher Geometric Transformations. Reflection, Rotation, or Translation 1.
1 Disable Pictures & Login. 2 Turn off Pop-up Blocker.
1 Beth McKillop, The British Library Library contacts with Korea.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Topic 1Topic Q 1Q 6Q 11Q 16Q 21 Q 2Q 7Q 12Q 17Q 22 Q 3Q 8Q 13Q 18Q 23 Q 4Q 9Q 14Q 19Q 24 Q 5Q 10Q 15Q 20Q 25.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Title Subtitle.
CALENDAR.
0 - 0.
ALGEBRAIC EXPRESSIONS
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
MULTIPLYING MONOMIALS TIMES POLYNOMIALS (DISTRIBUTIVE PROPERTY)
ADDING INTEGERS 1. POS. + POS. = POS. 2. NEG. + NEG. = NEG. 3. POS. + NEG. OR NEG. + POS. SUBTRACT TAKE SIGN OF BIGGER ABSOLUTE VALUE.
MULTIPLICATION EQUATIONS 1. SOLVE FOR X 3. WHAT EVER YOU DO TO ONE SIDE YOU HAVE TO DO TO THE OTHER 2. DIVIDE BY THE NUMBER IN FRONT OF THE VARIABLE.
SUBTRACTING INTEGERS 1. CHANGE THE SUBTRACTION SIGN TO ADDITION
MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
Addition Facts
Year 6 mental test 5 second questions
WARM UP What is the value of 4 5 x 4? Explain. WARM UP What is the value of 4 5 x 4 3 ? Explain.
L.O.1 To be able to recall multiplication and division facts involving the 2,3,4,6,7 and 8 times tables.
The National Certificate in Adult Numeracy
ANALYZING AND ADJUSTING COMPARABLE SALES Chapter 9.
CMU SCS : Multimedia Databases and Data Mining Lecture #17: Text - part IV (LSI) C. Faloutsos.
Requirements Diagrams With UML Models
Pole Placement.
The basics for simulations
1 Heating and Cooling of Structure Observations by Thermo Imaging Camera during the Cardington Fire Test, January 16, 2003 Pašek J., Svoboda J., Wald.
Vincent W. Zheng, Yu Zheng, Xing Xie, Qiang Yang Hong Kong University of Science and Technology Microsoft Research Asia This work was done when Vincent.
Mental Math Math Team Skills Test 20-Question Sample.
Introduction Recall that the imaginary unit i is equal to. A fraction with i in the denominator does not have a rational denominator, since is not a rational.
Context-Aware Mobile Music Recommendation for Daily Activities
O X Click on Number next to person for a question.
© S Haughton more than 3?
1 Directed Depth First Search Adjacency Lists A: F G B: A H C: A D D: C F E: C D G F: E: G: : H: B: I: H: F A B C G D E H I.
Squares and Square Root WALK. Solve each problem REVIEW:
1..
Introduction to Information Retrieval Outline ❶ Latent semantic indexing ❷ Dimensionality reduction ❸ LSI in information retrieval 1.
Memory vs. Model-based Approaches SVD & MF Based on the Rajaraman and Ullman book and the RS Handbook. See the Adomavicius and Tuzhilin, TKDE 2005 paper.
Adding Up In Chunks.
Past Tense Probe. Past Tense Probe Past Tense Probe – Practice 1.
This, that, these, those Number your paper from 1-10.
Chapter 5 Test Review Sections 5-1 through 5-4.
Event 4: Mental Math 7th/8th grade Math Meet ‘11.
Addition 1’s to 20.
Model and Relationships 6 M 1 M M M M M M M M M M M M M M M M
25 seconds left…...
Subtraction: Adding UP
Equal or Not. Equal or Not
Slippery Slope
Test B, 100 Subtraction Facts
U2 L5 Quotient Rule QUOTIENT RULE
Week 1.
Faculty Added Questions: Where do I begin? David Neiss & Laura Sandino.
We will resume in: 25 Minutes.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
1 Unit 1 Kinematics Chapter 1 Day
O X Click on Number next to person for a question.
PSSA Preparation.
Introduction Distance-based Adaptable Similarity Search
RollCaller: User-Friendly Indoor Navigation System Using Human-Item Spatial Relation Yi Guo, Lei Yang, Bowen Li, Tianci Liu, Yunhao Liu Hong Kong University.
Amit Goyal Laks V. S. Lakshmanan RecMax: Exploiting Recommender Systems for Fun and Profit University of British Columbia
Introduction Embedded Universal Tools and Online Features 2.
Which Recommender System Can Best Fit Social Learning Platforms?
C ONTEXT - AWARE SIMILARITIES WITHIN THE FACTORIZATION FRAMEWORK Balázs Hidasi Domonkos Tikk C A RR WORKSHOP, 5 TH F EBRUARY 2013, R OME.
Yue Xu Shu Zhang.  A person has already rated some movies, which movies he/she may be interested, too?  If we have huge data of user and movies, this.
Presentation transcript:

E NHANCING M ATRIX F ACTORIZATION T HROUGH I NITIALIZATION FOR I MPLICIT F EEDBACK D ATABASES Balázs Hidasi Domonkos Tikk Gravity R&D Ltd. Budapest University of Technology and Economics C A RR WORKSHOP, 14 TH F EBRUARY 2012, L ISBON

O UTLINE Matrix factoriztion Initialization concept Methods Naive SimFactor Results Discussion 2/19

M ATRIX F ACTORIZATION Collaborative Filtering One of the most common approaches Approximates the rating matrix as product of low-rank matrices R Q P Items Users 3/19

M ATRIX FACTORIZATION Initialize P and Q with small random numbers Teach P and Q Alternating Least Squares Gradient Descent Etc. Transforms the data to a feature space Separately for users and items Noise reduction Compression Generalization 4/19

I MPLICIT FEEDBACK No ratings User-item interactions (events) Much noisier Presence of an event might not be positive feedback Absence of an event does not mean negative feedback No negative feedback is available! More common problem MF for implicit feedback Less accurate results due to noise Mostly ALS is used Scalability problems (rating matrix is dense) 5/19

C ONCEPT Good MF model The feature vectors of similar entities are similar If data is too noisy similar entities wont be similar by their features Start MF from a good point Feature vector similarities are OK Data is more than just events Metadata Info about items/users Contextual data In what context did the event occured Can we incorporate those to help implicit MF? 6/19

N AIVE APPROACH Describe items using any data we have (detailed later) Long, sparse vectors for item description Compress these vectors to dense feature vectors PCA, MLP, MF, … Length of desired vectors = Number of features in MF Use these features as starting points 7/19

N AIVE APPROACH Compression and also noise reduction Does not really care about similarities But often feature similarities are not that bad If MF is used Half of the results is thrown out Description of items Descriptor features Item features Descriptor s Items 8/19

S IM F ACTOR ALGORITHM Try to preserve similarities better Starting from an MF of item description Similarities of items: DD Some metrics require transformation on D Description of items (D) Descriptors features Item features Descriptor s Items Description of items (D) Description of items (D) Item similaritie s (S) = 9/19

S IM F ACTOR ALGORITHM Similarity approximation YY KxK symmetric Eigendecomposition Item similaritie s (S) Descriptors features (Y) Item features (X) Descriptors features (Y) Item features (X) Item similaritie s (S) Item features (X) YY 10/19

S IM F ACTOR ALGORITHM λ diagonal λ = SQRT(λ) * SQRT(λ) X*U*SQRT(λ) = (SQRT(λ)*U*X)=F F is MxK matrix S F * F F used for initialization YY U SQRT (λ) U Item similaritie s (S) Item features (X) SQRT (λ) Item similaritie s (S) F F UU λ = 11/19

C REATING THE DESCRIPTION MATRIX Any data about the entity Vector-space reprezentation For Items : Metadata vector (title, category, description, etc) Event vector (who bought the item) Context-state vector (in which context state was it bought) Context-event (in which context state who bought it) For Users : All above except metadata Currently: Choose one source for D matrix Context used: seasonality 12/19

E XPERIMENTS : S IMILARITY PRESERVATION Real life dataset: online grocery shopping events SimFactor approximates similarities better 13/19

E XPERIMENTS : I NITIALIZATION Using different description matrices And both naive and SimFactor initialization Baseline: random init Evaluation metric: 14/19

E XPERIMENTS : G ROCERY DB Up to 6% improvement Best methods use SimFactor and user context data 15/19

E XPERIMENTS : I MPLICITIZED M OVIE L ENS Keeping 5 star ratings implicit events Up to 10% improvement Best methods use SimFactor and item context data 16/19

D ISCUSSION OF RESULTS SimFactor yields better results than naive Context information yields better results than other descriptions Context information separates well between entities Grocery: User context Peoples routines Different types of shoppings in different times MovieLens: Item context Different types of movies watched on different hours Context-based similarity 17/19

W HY CONTEXT ? Grocery example Correlation between context states by users low Why can context aware algorithms be efficient? Different recommendations in different context states Context differentiates well between entities Easier subtasks ITEM MonTueWedThuFriSatSun Mon1,000,79 0,780,760,700,74 Tue0,791,000,790,780,760,690,73 Wed0,79 1,000,790,760,700,74 Thu0,78 0,791,000,760,710,74 Fri0,76 1,000,710,72 Sat0,700,690,700,71 1,000,71 Sun0,740,730,74 0,720,711,00 USER MonTueWedThuFriSatSun Mon1,000,360,34 0,350,290,19 Tue0,361,000,34 0,330,290,19 Wed0,34 1,000,360,350,270,17 Thu0,34 0,361,000,390,300,16 Fri0,350,330,350,391,000,320,16 Sat0,29 0,270,300,321,000,33 Sun0,19 0,170,16 0,331,00 18/19

C ONCLUSION & F UTURE WORK SimFactor Similarity preserving compression Similarity based MF initialization: Description matrix from any data Apply SimFactor Use output as initial features for MF Context differentitates between entities well Future work: Mixed description matrix (multiple data sources) Multiple description matrix Using different context information Using different similarity metrics 19/19

T HANKS FOR YOUR ATTENTION ! For more of my recommender systems related research visit my website: