Presentation is loading. Please wait.

Presentation is loading. Please wait.

Modeling Relationship Strength in Online Social Networks Rongjing Xiang: Purdue University Jennifer Neville: Purdue University Monica Rogati: LinkedIn.

Similar presentations


Presentation on theme: "Modeling Relationship Strength in Online Social Networks Rongjing Xiang: Purdue University Jennifer Neville: Purdue University Monica Rogati: LinkedIn."— Presentation transcript:

1 Modeling Relationship Strength in Online Social Networks Rongjing Xiang: Purdue University Jennifer Neville: Purdue University Monica Rogati: LinkedIn WWW 2010 Presenter: Chenghui REN Supervisors: Dr Ben Kao, Prof David Cheung

2 Why do we care about Relationship Strength? Various aspects of online social networks (OSNs) are based on relationship strength: – Link prediction Suggesting new people with top relationship strength to users – Item recommendation Items may be groups to join, articles to read… – Newsfeeds Real-time updates about status change, activities, new posts… – People search –…–…

3 What has been done on Relationship Strength? Previous work analyzing OSNs has focused on binary friendship relations – E.g., friends or not Low cost of link formation leads to networks with different relationship strengths – E.g. close friends and acquaintances Treating all relationships as equal will increase the level of noise in a learned model and degrade performance.

4 Problem Typically, an OSN contains: – Profiles – Interaction activities To propose a method to infer a continuous- valued relationship strength for links based on the factors above

5 Roadmap Motivation Latent Variable Model Experimental Evaluation Conclusions

6 Latent Variable Model: Introduction The homophily is common in OSNs – People tend to form ties with other people who have similar characteristics – The stronger the tie, the higher the similarity Relationship strength is modeled as a hidden effect of nodal profile similarities – E.g. the schools and companies the users attended – E.g. the online groups they joined – E.g. the geographic locations that they belong to Relationship strength is modeled as a hidden cause of user interactions – E.g. profile viewing activities – E.g. picture tagging

7 Model Introduction (Cont’d) Profile attributes Relationship strength User interactions Have effect on Cause of Visible Invisible

8 Model: Introduction (Cont’d) Goal: Estimate z to maximize the overall observed data likelihood Figure 1: Graphical model representation of the general relationship strength model

9 Model Specification Profile attributes Relationship strength Affect Visible Invisible First model this part

10 Model Specification (Cont’d) Using Gaussian distribution to model the conditional probability of z given profile similarities:

11 Model Specification (Cont’d) Relationship strength User interactions Cause of Visible Invisible Then model this part

12 Model Specification (Cont’d) Using a logistic function to model the conditional probability of y given u: Figure 2: Graphical model representation of the specific instantiation

13 Model Specification (Cont’d) To avoid over-fitting, L2 regularizers are put on the parameters w and θ, which can be regarded as Gaussian priors:

14 Model Inference Two ways to estimate a latent variable model Future work Accepted!

15 Model Inference (Cont’d)

16 Roadmap Motivation Latent Variable Model Experimental Evaluation Conclusions

17 Experimental Evaluation Dataset: Purdue facebook data #nodes: 4500 #links: 144,712 Three profile similarity measures: Two types of user interactions: Auxiliary variables: #people whose wall i has posted i has tagged in pictures

18 Experiment Evaluation (Cont’d) Use the proposed latent variable model to estimate the relationship strengths for the 144,712 pairs of users How to evaluate the estimated weighted graph? Apply the estimated weighted graph in a number of collective classification tasks. Gender: Male? Relationship status: Single? Political views: Conservative? Religious views: Christian?

19 Classification Algorithm Gaussian Random Field Model – Autocorrelation is present in the graph – Information is propagated from the labeled portion of the graph to infer the values for unlabeled nodes Vary the proportion of labeled nodes in the graph from 30% to 90% Measure the resulting classification rankings using area under the ROC curve

20 ROC curve x-axis: False positive rate y-axis: True positive rate The larger the area under the ROC curve, the higher the overall accuracy of the classification

21 Comparisons to Six Graphs

22 Results Collective classification performance on various Facebook graphs Curves for the wall graph and the picture graph lie well below other curves, and are then omitted

23 Roadmap Motivation Latent Variable Model Experimental Evaluation Conclusions

24 A latent variable model was proposed to estimate relationship strength in OSNs The weighted graph formed by the estimated relationship strengths give rise to higher autocorrelation and better classification The model can facilitate many graph learning and social network mining tasks

25 Q&A Thanks!


Download ppt "Modeling Relationship Strength in Online Social Networks Rongjing Xiang: Purdue University Jennifer Neville: Purdue University Monica Rogati: LinkedIn."

Similar presentations


Ads by Google