Exponential random graph models for multilevel networks

Slides:

Advertisements

Similar presentations

An introduction to exponential random graph models (ERGM)

Advertisements

Network Matrix and Graph. Network Size Network size – a number of actors (nodes) in a network, usually denoted as k or n Size is critical for the structure.

Where we are Node level metrics Group level metrics Visualization

Analysis and Modeling of Social Networks Foudalis Ilias.

Normal Distribution Sampling and Probability. Properties of a Normal Distribution Mean = median = mode There are the same number of scores below and.

Joint social selection and social influence models for networks: The interplay of ties and attributes. Garry Robins Michael Johnston University of Melbourne,

Sunbelt 2009statnet Development Team ERGM introduction 1 Exponential Random Graph Models Statnet Development Team Mark Handcock (UW) Martina.

CS8803-NS Network Science Fall 2013

Network Measures Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Measures Klout.

Random Graph Models of Social Networks Paper Authors: M.E. Newman, D.J. Watts, S.H. Strogatz Presentation presented by Jessie Riposo.

(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.

Analysis and Modeling of the Open Source Software Community Yongqin Gao, Greg Madey Computer Science & Engineering University of Notre Dame Vincent Freeh.

Principles of Social Network Analysis. Definition of Social Networks “A social network is a set of actors that may have relationships with one another”

Random-Graph Theory The Erdos-Renyi model. G={P,E}, PNP 1,P 2,...,P N E In mathematical terms a network is represented by a graph. A graph is a pair of.

Yaomin Jin Design of Experiments Morris Method.

A Graph-based Friend Recommendation System Using Genetic Algorithm

School of Information Sciences University of Pittsburgh TELCOM2125: Network Science and Analysis Konstantinos Pelechrinis Spring 2013 Figures are taken.

Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.

Yongqin Gao, Greg Madey Computer Science & Engineering Department University of Notre Dame © Copyright 2002~2003 by Serendip Gao, all rights reserved.

A two minute introduction to: Exponential random graph (p*)models for social networks SNAC Workshop, Illinois, November 2005 Garry Robins, University of.

1 Comparing multiple tests for separating populations Juliet Popper Shaffer Paper presented at the Fifth International Conference on Multiple Comparisons,

Most of contents are provided by the website Graph Essentials TJTSD66: Advanced Topics in Social Media.

Slides are modified from Lada Adamic

Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.

28. Multiple regression The Practice of Statistics in the Life Sciences Second Edition.

Sampling and estimation Petter Mostad

Clusters Recognition from Large Small World Graph Igor Kanovsky, Lilach Prego Emek Yezreel College, Israel University of Haifa, Israel.

1 Finding Spread Blockers in Dynamic Networks (SNAKDD08)Habiba, Yintao Yu, Tanya Y., Berger-Wolf, Jared Saia Speaker: Hsu, Yu-wen Advisor: Dr. Koh, Jia-Ling.

Informatics tools in network science

Urban Traffic Simulated From A Dual Perspective Hu Mao-Bin University of Science and Technology of China Hefei, P.R. China

Introduction to ERGM/p* model Kayo Fujimoto, Ph.D. Based on presentation slides by Nosh Contractor and Mengxiao Zhu.

CHAPTER 6: SAMPLING, SAMPLING DISTRIBUTIONS, AND ESTIMATION Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.

The simultaneous evolution of author and paper networks

Mingze Zhang, Mun Choon Chan and A. L. Ananda School of Computing

Network (graph) Models

Why Model? Make predictions or forecasts where we don’t have data.

Connectivity and the Small World

Chapter 8: Estimating with Confidence

Chapter 8: Estimating with Confidence

Groups of vertices and Core-periphery structure

Social Networks Analysis

Cyclomatic complexity

Applications of graph theory in complex systems research

Date of download: 11/12/2017 Copyright © ASME. All rights reserved.

Analyzing Redistribution Matrix with Wavelet

Empirical analysis of Chinese airport network as a complex weighted network Methodology Section Presented by Di Li.

Understanding Standards Event Higher Statistics Award

Section 8.6: Clustering Coefficients

Research Statistics Objective: Students will acquire knowledge related to research Statistics in order to identify how they are used to develop research.

Section 8.6 of Newman’s book: Clustering Coefficients

Effective Social Network Quarantine with Minimal Isolation Costs

Introduction to Summary Statistics

RAM XI Training Summit October 2018

+ – If we look at any two people in the group in isolation,

Network Science: A Short Introduction i3 Workshop

Clustering Coefficients

Emotions in Social Networks: Distributions, Patterns, and Models

Chapter 8: Estimating with Confidence

Psych 231: Research Methods in Psychology

Chapter 7: The Normality Assumption and Inference with OLS

Lecture 9: Network models CS 765: Complex Networks

Hypothesis Testing: The Difference Between Two Population Means

(Social) Networks Analysis II

Chapter 8: Estimating with Confidence

Chapter 8: Estimating with Confidence

Chapter 8: Estimating with Confidence

Chapter 8: Estimating with Confidence

Qi Li,Qing Wang,Ye Yang and Mingshu Li

Marios Mattheakis and Pavlos Protopapas

Presentation transcript:

Exponential random graph models for multilevel networks Peng Wang, Garry Robins, Philippa Pattison, Emmanuel Lazega Melbourne School of Psychological Sciences, The University of Melbourne, Australia IRISSO-ORIO, University of Paris-Dauphine, France Social Networks, Volume 35, Issue 1, January 2013 An introduction to exponential random graph models (ERGM) Johan Koskinen http://www.ccsr.ac.uk/staff/jk.htm! johan.koskinen@manchester.ac.uk

Outline Introduction Exponential random graph models Multilevel networks ERGMs for two-level networks Models for French cancer researchers and their institutions Multilevel ERGM simulation studies Conclusion

Introduction To date, multilevel analysis has not been commonly used in social network analysis, although attention has been drawn to the theoretical importance of multilevel concepts. Recent methods to analyze cross-sectional network data include exponential random graph models (ERGMs). In this paper, we present a general formulation of a multilevel network structure, and extend ERGMs to multilevel networks.

Exponential random graph models

Exponential random graph models

Exponential random graph models

Exponential random graph models

Exponential random graph models

Exponential random graph models We want to model tie variables But structure – overall pattern – is evident What kind of structural elements can we incorporate in the model for the tie variables?

Exponential random graph models the parameter associated with zQ(y) a network instance the network statistic for the corresponding network configuration of type Q a normalizing constant defined based on the graph space of networks of size n and the actual model specification the network configurations which are based on the dependence assumptions of tie variables

Multilevel networks (u, v) two-level network M: a two-level network with u nodes at the macro-level, and v nodes at the micro-level, we label the macro-level network as network A, the micro-level network as network B, and the meso-level bipartite network as network X.

ERGMs for two-level networks the network statistics for structural effects within the bipartite affiliation network statistics for configurations involving ties from all three networks network statistics for the corresponding within level network configurations network statistics for configurations involving ties from one of the unipartite network (A or B) and the bipartite network (X ), representing the interactions between the two networks

Models for French cancer researchers and their institutions

Multilevel ERGM simulation studies The simulation studies were performed on (30, 30) two-level networks with 65 ties in each of the within-level (A, B) and meso-level (X) networks. For proposed effects only involving interactions between one of the within-level network and the meso-level network, network B was kept empty for simplicity, as the interaction effects between A and X are equally applicable to interactions between B and X. The densities of the networks are fixed, and we used strongly positive (+2) and negative (−2) parameter values for each of proposed configurations and left other effects at 0s to test the non-zero main effects on the network structure.

Interaction stars represent the dependence between two tie variables of different types established by a node in common

Interaction stars (Star2AX+) A-hub creator almost frozen graph distribution (the standard deviations close to 0 in the various graph statistics) SD and SK of A-node degree distributions in both networks A and X are much higher than the random graph distribution B-node degree distribution in network X is flat Standard Deviation SKewness Global Clustering Coefficient

Interaction stars (Star2AX-) tries to avoid any association between the within- and the meso-level ties, and created two components in the overall multilevel network Since more isolated nodes are involved, the available network ties had to constrain themselves to the connected component, so we have higher SDs and SKs for A- nodes in both networks A and X, and higher clustering coefficients than expected from random

Researcher–laboratory networks the B-level network contains advice seeking ties among the researchers the A-level network is defined based on the collaboration ties among the laboratories the researcher–laboratory affiliation network X a positive Star2BX parameter indicates that researchers popular in the advice seeking network are also popular in the affiliation network (i.e. have many affiliations)

Interaction stars (AAAXS+) higher SD for the A-node degree distribution than a simple random graph, but much smaller than for the previous model with positive Star2AX The SD(A) in network X, and SKs for A-nodes for both networks A and X are not very different from random more isolates in network A the unpopular nodes in within level network are also unpopular in the affiliation network

Interaction stars (AAAXS-) marginally smaller A-node SDs in both networks A and X, and smaller SK in network A than random networks more flat A-node degree distributions in both networks A and X tendency against centralization

Interaction triangles have two connected nodes within the same level, sharing one or more nodes from the other level through affiliations

Interaction triangles (TXAX+) clique-like structures in the within- , meso- and overall two-level networks greater than random SD and SK in network A, and greater SD(A) in network X, as there are centralization effects on network ties among nodes involved in the clique the global clustering coefficients for both networks A and X are also much greater than expected from random networks

Interaction triangles (TXAX-) the clique-like structures disappear, and most of the previously isolated nodes merge into the big components of the networks the connected A-nodes tend not to share common affiliations, we can see long circular-like structures in network X Compared with random networks, there is no obvious differences on the degree distribution and clustering measures

Interaction triangles (ATXAX+) clique-like core structures in both network A and X instead of being isolated, the rest of the A-nodes are connected to the core in network A; and only the core-members in network A are sharing multiple B-nodes in the bipartite network X and the overall multilevel network M The GCCs and SDs for A-node degree distributions in both networks A and X are higher than random networks but smaller than the case when TXAX is set at the same positive value

Interaction triangles (ATXAX-) higher values in most of the statistics, but generally lower compared with the model with positive ATXAX

Researcher–laboratory networks a positive TXBX or ATXAX effect may indicate that researchers from the same laboratories tend to seek advice from each other

Interaction three-paths defined by two X-ties and one within level tie (either A or B), we label the three-path statistics involving the interactions between the within- and meso-level networks as L3XAX and L3XBX

Interaction three-paths (L3XAX+) creates as many XAX three- paths as possible such that non-isolated A-nodes in network A are all involved in the bipartite network X, as an A-tie is an essential part of the L3XAX Compared with the random GCCs for both networks A and X, the networks with positive L3XAX have higher GCCs

Interaction three-paths (L3XAX-) very long paths in the bipartite network to make bipartite clustering less likely the clustering in network A is very similar to random networks Together, they create fewer XAX three-paths

Researcher–laboratory networks a positive L3XAX parameter may suggest that researchers (B) popular in the affiliation network tend to be associated with collaborating laboratories (A); or there is a degree assortativity effect such that labs tend to collaborate with other labs having a similar number of researchers

Cross-level three-path involves ties from all three (A, B and X) networks; it represents a tendency for nodes that are popular within levels to be affiliated through the meso-level network X, and can be seen as degree assortativity effect between networks A and B through meso-level network X

Cross-level three-path (L3AXB+) hubs emerge from one of the within level networks (in our case, network A) ties from the other network (B) form a clique-like strongly connected component All nodes that are part of the connected component in network (A) are connected to the hubs in network (B) through the bipartite network all of the SDs, SKs and GCCs are higher than random networks

Cross-level three-path (L3AXB-) to form as few as possible AXB three-paths, the parameter avoids any affiliations between nodes from the connected components of the within-level networks the isolated nodes in within level networks become the popular nodes in network X the global structures of the within-level networks are very similar

Researcher–laboratory networks a positive L3AXB parameter suggests high degree researchers are affiliated with high degree labs

Cross-level four-cycle involves ties from all three networks, representing a cross-level “mirroring” or alignment effect such that members of connected groups are themselves connected

Cross-level four-cycle (C4AXB+) generated clique-like cores in both networks A and B, and the members of the cores are affiliated with each other centralization on ties from both of the within-level networks with greater than random SDs and SKs with the clustering of high- degree nodes reflected by the high GCCs in all networks A, B and X

Cross-level four-cycle (C4AXB-) trying to misalign A- and B-ties through affiliation, and can be generally understood as a decentralization effect, with fewer isolated nodes especially in the affiliation network X In terms of degree distributions and global clustering, the SDs, SKs and GCCs for all three networks are not very different from random

Researcher–laboratory networks a positive C4AXB may suggest researchers from collaborating laboratories tend to seek advice from each other

Caveman Graph The (connected) caveman graph is a graph arising in social network theory formed by modifying a set of isolated k-cliques (or "caves") by removing one edge from each clique and using it to connect to a neighboring clique along a central cycle such that all n cliques form a single unbroken loop (Watts 1999). Weisstein, Eric W. "Caveman Graph." From MathWorld--A Wolfram Web Resource. http://mathworld.wolfram.com/CavemanGraph.html

Models for caveman graphs To date, there is not a typical set of one-mode ERGM parameters that could reproduce caveman graphs. However, it is not difficult to reproduce caveman like graphs using two-level ERGMs where within level networks form the clique like structures or “caves”, and the cross level bipartite network can be used to connect the caves. Simulations were conducted on (20, 20) two-level networks with fixed within level densities of 0.15, and a fixed bipartite density of 0.06.

Models for caveman graphs To have the highly clustered but separated components or “caves” in the within level networks, we use the following set of parameters for both within level networks A and B where: AS = −1, AT = 2 and A2P = −1. The positive alternating-triangles ensure high clustering; the negative alternating-stars decentralize the degrees, i.e. discourage formation of hubs; and the negative alternating- two-paths break down the network into separate components. The bipartite network (X) is left at random, and all between or cross-level interaction effects are set at 0s.

Models for caveman graphs

Models for caveman graphs some random overlapping between the caves where connected dyads from caves of different types are forming cross-level four- cycles (C4AXBs) For the interaction model, the following interaction effects were imposed where: AAAXS = 2, ABAXS = −2 and C4AXB = −2. These parameters in effect imply that popular nodes in the A network will be popular with B nodes as well; but that popular B nodes are not popular with A nodes; and that cross-level 4 cycles are less likely to be present.

Models for caveman graphs

Models for caveman graphs we have fewer but large clique like components, as well as more isolates, indicating stronger degree centralization in network A more isolates of type A than B, hence the average degree of A nodes are higher in the bipartite network as nodes from larger cliques have higher average degrees than smaller cliques, and the bipartite ties are centralized on A nodes that are not isolates, the interaction effects effectively created more AAAXS and less ABAXS than the previous model

Conclusion We proposed and formulated a new multilevel ERGM to model a generalized data structure which captures both within-level and meso-level networks. Our simulation studies demonstrated the properties of each of the proposed configurations, and showed that even simple cross-level effects can create highly structured within-level networks.