Modeling Relationship Strength in Online Social Networks Rongjing Xiang: Purdue University Jennifer Neville: Purdue University Monica Rogati: LinkedIn.

Slides:



Advertisements
Similar presentations
Google News Personalization: Scalable Online Collaborative Filtering
Advertisements

By Venkata Sai Pulluri ( ) Narendra Muppavarapu ( )
Learning on the Test Data: Leveraging “Unseen” Features Ben Taskar Ming FaiWong Daphne Koller.
Suleyman Cetintas 1, Monica Rogati 2, Luo Si 1, Yi Fang 1 Identifying Similar People in Professional Social Networks with Discriminative Probabilistic.
+ Multi-label Classification using Adaptive Neighborhoods Tanwistha Saha, Huzefa Rangwala and Carlotta Domeniconi Department of Computer Science George.
1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
LEARNING INFLUENCE PROBABILITIES IN SOCIAL NETWORKS Amit Goyal Francesco Bonchi Laks V. S. Lakshmanan University of British Columbia Yahoo! Research University.
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Other Classification Techniques 1.Nearest Neighbor Classifiers 2.Support Vector Machines.
Modeling Relationship Strength in Online Social Networks Rongjian Xiang 1, Jennifer Neville 1, Monica Rogati 2 1 Purdue University, 2 LinkedIn WWW 2010.
Trust Relationship Prediction Using Online Product Review Data Nan Ma 1, Ee-Peng Lim 2, Viet-An Nguyen 2, Aixin Sun 1, Haifeng Liu 3 1 Nanyang Technological.
1 Yuxiao Dong *$, Jie Tang $, Sen Wu $, Jilei Tian # Nitesh V. Chawla *, Jinghai Rao #, Huanhuan Cao # Link Prediction and Recommendation across Multiple.
Link creation and profile alignment in the aNobii social network Luca Maria Aiello et al. Social Computing Feb 2014 Hyewon Lim.
Relational Learning with Gaussian Processes By Wei Chu, Vikas Sindhwani, Zoubin Ghahramani, S.Sathiya Keerthi (Columbia, Chicago, Cambridge, Yahoo!) Presented.
1 Learning Semantics-Preserving Distance Metrics for Clustering Graphical Data Aparna S. Varde, Elke A. Rundensteiner, Carolina Ruiz, Mohammed Maniruzzaman.
Evaluating Search Engine
UNDERSTANDING VISIBLE AND LATENT INTERACTIONS IN ONLINE SOCIAL NETWORK Presented by: Nisha Ranga Under guidance of : Prof. Augustin Chaintreau.
Classification and risk prediction
CAP and ROC curves.
Statistical Methods Chichang Jou Tamkang University.
Influence and Correlation in Social Networks Aris Anagnostopoulos Ravi Kumar Mohammad Mahdian.
User Interactions in OSNs Evangelia Skiani. Do you have a Facebook account? Why? How likely to know ALL your friends? Why confirm requests? Why not remove.
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
PEPA is based at the IFS and CEMMAP © Institute for Fiscal Studies Identifying social effects from policy experiments Arun Advani (UCL & IFS) and Bansi.
Models of Influence in Online Social Networks
Incomplete Graphical Models Nan Hu. Outline Motivation K-means clustering Coordinate Descending algorithm Density estimation EM on unconditional mixture.
Using Friendship Ties and Family Circles for Link Prediction Elena Zheleva, Lise Getoor, Jennifer Golbeck, Ugur Kuter (SNAKDD 2008)
Attention and Event Detection Identifying, attributing and describing spatial bursts Early online identification of attention items in social media Louis.
Social Networking and On-Line Communities: Classification and Research Trends Maria Ioannidou, Eugenia Raptotasiou, Ioannis Anagnostopoulos.
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
Alignment and classification of time series gene expression in clinical studies Tien-ho Lin, Naftali Kaminski and Ziv Bar-Joseph.
Using Transactional Information to Predict Link Strength in Online Social Networks Indika Kahanda and Jennifer Neville Purdue University.
Data Mining and Machine Learning Lab Network Denoising in Social Media Huiji Gao, Xufei Wang, Jiliang Tang, and Huan Liu Data Mining and Machine Learning.
WALKING IN FACEBOOK: A CASE STUDY OF UNBIASED SAMPLING OF OSNS junction.
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
To Blog or Not to Blog: Characterizing and Predicting Retention in Community Blogs Imrul Kayes 1, Xiang Zuo 1, Da Wang 2, Jacob Chakareski 3 1 University.
User Interests Imbalance Exploration in Social Recommendation: A Fitness Adaptation Authors : Tianchun Wang, Xiaoming Jin, Xuetao Ding, and Xiaojun Ye.
Presenter: Lung-Hao Lee ( 李龍豪 ) January 7, 309.
A Graph-based Friend Recommendation System Using Genetic Algorithm
TEMPLATE DESIGN © Zhiyao Duan 1,2, Lie Lu 1, and Changshui Zhang 2 1. Microsoft Research Asia (MSRA), Beijing, China.2.
Learning Geographical Preferences for Point-of-Interest Recommendation Author(s): Bin Liu Yanjie Fu, Zijun Yao, Hui Xiong [KDD-2013]
Feedback Effects between Similarity and Social Influence in Online Communities David Crandall, Dan Cosley, Daniel Huttenlocher, Jon Kleinberg, Siddharth.
Exploit of Online Social Networks with Community-Based Graph Semi-Supervised Learning Mingzhen Mo and Irwin King Department of Computer Science and Engineering.
Jiafeng Guo(ICT) Xueqi Cheng(ICT) Hua-Wei Shen(ICT) Gu Xu (MSRA) Speaker: Rui-Rui Li Supervisor: Prof. Ben Kao.
Tell Me What You See and I will Show You Where It Is Jia Xu 1 Alexander G. Schwing 2 Raquel Urtasun 2,3 1 University of Wisconsin-Madison 2 University.
Presenter: Jinhua Du ( 杜金华 ) Xi’an University of Technology 西安理工大学 NLP&CC, Chongqing, Nov , 2013 Discriminative Latent Variable Based Classifier.
Finding Experts Using Social Network Analysis 2007 IEEE/WIC/ACM International Conference on Web Intelligence Yupeng Fu, Rongjing Xiang, Yong Wang, Min.
GENDER AND AGE RECOGNITION FOR VIDEO ANALYTICS SOLUTION PRESENTED BY: SUBHASH REDDY JOLAPURAM.
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
Active Sampling of Networks Joseph J. Pfeiffer III 1 Jennifer Neville 1 Paul N. Bennett 2 Purdue University 1 Microsoft Research 2 July 1, 2012 MLG, Edinburgh.
Collaborative Filtering via Euclidean Embedding M. Khoshneshin and W. Street Proc. of ACM RecSys, pp , 2010.
ICONIP 2010, Sydney, Australia 1 An Enhanced Semi-supervised Recommendation Model Based on Green’s Function Dingyan Wang and Irwin King Dept. of Computer.
Supervised Random Walks: Predicting and Recommending Links in Social Networks Lars Backstrom (Facebook) & Jure Leskovec (Stanford) Proc. of WSDM 2011 Present.
A Framework to Predict the Quality of Answers with Non-Textual Features Jiwoon Jeon, W. Bruce Croft(University of Massachusetts-Amherst) Joon Ho Lee (Soongsil.
1 Relational Factor Graphs Lin Liao Joint work with Dieter Fox.
Hierarchical Mixture of Experts Presented by Qi An Machine learning reading group Duke University 07/15/2005.
CSC Lecture 23: Sigmoid Belief Nets and the wake-sleep algorithm Geoffrey Hinton.
Dependency Networks for Inference, Collaborative filtering, and Data Visualization Heckerman et al. Microsoft Research J. of Machine Learning Research.
Inferring User Interest Familiarity and Topic Similarity with Social Neighbors in Facebook INSTRUCTOR: DONGCHUL KIM ANUSHA BOOTHPUR
Graphical Models for Segmenting and Labeling Sequence Data Manoj Kumar Chinnakotla NLP-AI Seminar.
Exploring Social Tagging Graph for Web Object Classification
Methods and Metrics for Cold-Start Recommendations
Multimodal Learning with Deep Boltzmann Machines
Location Recommendation — for Out-of-Town Users in Location-Based Social Network Yina Meng.
Learning Probabilistic Graphical Models Overview Learning Problems.
GANG: Detecting Fraudulent Users in OSNs
Hierarchical Relational Models for Document Networks
Mingzhen Mo and Irwin King
Discriminative Probabilistic Models for Relational Data
GhostLink: Latent Network Inference for Influence-aware Recommendation
Presentation transcript:

Modeling Relationship Strength in Online Social Networks Rongjing Xiang: Purdue University Jennifer Neville: Purdue University Monica Rogati: LinkedIn WWW 2010 Presenter: Chenghui REN Supervisors: Dr Ben Kao, Prof David Cheung

Why do we care about Relationship Strength? Various aspects of online social networks (OSNs) are based on relationship strength: – Link prediction Suggesting new people with top relationship strength to users – Item recommendation Items may be groups to join, articles to read… – Newsfeeds Real-time updates about status change, activities, new posts… – People search –…–…

What has been done on Relationship Strength? Previous work analyzing OSNs has focused on binary friendship relations – E.g., friends or not Low cost of link formation leads to networks with different relationship strengths – E.g. close friends and acquaintances Treating all relationships as equal will increase the level of noise in a learned model and degrade performance.

Problem Typically, an OSN contains: – Profiles – Interaction activities To propose a method to infer a continuous- valued relationship strength for links based on the factors above

Roadmap Motivation Latent Variable Model Experimental Evaluation Conclusions

Latent Variable Model: Introduction The homophily is common in OSNs – People tend to form ties with other people who have similar characteristics – The stronger the tie, the higher the similarity Relationship strength is modeled as a hidden effect of nodal profile similarities – E.g. the schools and companies the users attended – E.g. the online groups they joined – E.g. the geographic locations that they belong to Relationship strength is modeled as a hidden cause of user interactions – E.g. profile viewing activities – E.g. picture tagging

Model Introduction (Cont’d) Profile attributes Relationship strength User interactions Have effect on Cause of Visible Invisible

Model: Introduction (Cont’d) Goal: Estimate z to maximize the overall observed data likelihood Figure 1: Graphical model representation of the general relationship strength model

Model Specification Profile attributes Relationship strength Affect Visible Invisible First model this part

Model Specification (Cont’d) Using Gaussian distribution to model the conditional probability of z given profile similarities:

Model Specification (Cont’d) Relationship strength User interactions Cause of Visible Invisible Then model this part

Model Specification (Cont’d) Using a logistic function to model the conditional probability of y given u: Figure 2: Graphical model representation of the specific instantiation

Model Specification (Cont’d) To avoid over-fitting, L2 regularizers are put on the parameters w and θ, which can be regarded as Gaussian priors:

Model Inference Two ways to estimate a latent variable model Future work Accepted!

Model Inference (Cont’d)

Roadmap Motivation Latent Variable Model Experimental Evaluation Conclusions

Experimental Evaluation Dataset: Purdue facebook data #nodes: 4500 #links: 144,712 Three profile similarity measures: Two types of user interactions: Auxiliary variables: #people whose wall i has posted i has tagged in pictures

Experiment Evaluation (Cont’d) Use the proposed latent variable model to estimate the relationship strengths for the 144,712 pairs of users How to evaluate the estimated weighted graph? Apply the estimated weighted graph in a number of collective classification tasks. Gender: Male? Relationship status: Single? Political views: Conservative? Religious views: Christian?

Classification Algorithm Gaussian Random Field Model – Autocorrelation is present in the graph – Information is propagated from the labeled portion of the graph to infer the values for unlabeled nodes Vary the proportion of labeled nodes in the graph from 30% to 90% Measure the resulting classification rankings using area under the ROC curve

ROC curve x-axis: False positive rate y-axis: True positive rate The larger the area under the ROC curve, the higher the overall accuracy of the classification

Comparisons to Six Graphs

Results Collective classification performance on various Facebook graphs Curves for the wall graph and the picture graph lie well below other curves, and are then omitted

Roadmap Motivation Latent Variable Model Experimental Evaluation Conclusions

A latent variable model was proposed to estimate relationship strength in OSNs The weighted graph formed by the estimated relationship strengths give rise to higher autocorrelation and better classification The model can facilitate many graph learning and social network mining tasks

Q&A Thanks!