Where we are Node level metrics Group level metrics Visualization

Slides:



Advertisements
Similar presentations
Assumptions underlying regression analysis
Advertisements

CS188: Computational Models of Human Behavior
Statistical Social Network Analysis - Stochastic Actor Oriented Models Johan Koskinen The Social Statistics Discipline Area, School of Social Sciences.
An introduction to exponential random graph models (ERGM)
Dr. Henry Hexmoor Department of Computer Science Southern Illinois University Carbondale Network Theory: Computational Phenomena and Processes Social Network.
Structural Equation Modeling
Network Matrix and Graph. Network Size Network size – a number of actors (nodes) in a network, usually denoted as k or n Size is critical for the structure.
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
The Statistical Analysis of the Dynamics of Networks and Behaviour. An Introduction to the Actor-based Approach. Christian Steglich and Tom Snijders ——————
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Analysis and Modeling of Social Networks Foudalis Ilias.
SOCI 5013: Advanced Social Research: Network Analysis Spring 2004.
Linear Regression t-Tests Cardiovascular fitness among skiers.
Inference for Regression
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Analysis of Social Media MLD , LTI William Cohen
Hypothesis Testing IV Chi Square.
Exponential random graph (p*) models for social networks Workshop Harvard University February 2002 Philippa Pattison Garry Robins Department of Psychology.
Centrality and Prestige HCC Spring 2005 Wednesday, April 13, 2005 Aliseya Wright.
Joint social selection and social influence models for networks: The interplay of ties and attributes. Garry Robins Michael Johnston University of Melbourne,
Exponential Random Graph Models (ERGM) Michael Beckman PAD777 April 9, 2010.
3-1 Introduction Experiment Random Random experiment.
Slide 1 Statistics Workshop Tutorial 4 Probability Probability Distributions.
Slide 1 Statistics Workshop Tutorial 7 Discrete Random Variables Binomial Distributions.
Sunbelt 2009statnet Development Team ERGM introduction 1 Exponential Random Graph Models Statnet Development Team Mark Handcock (UW) Martina.
CHAPTER 8 Estimating with Confidence
Chapter 12 Inferring from the Data. Inferring from Data Estimation and Significance testing.
Introduction to Regression Analysis, Chapter 13,
Simple Linear Regression Analysis
Network Measures Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Measures Klout.
Connectivity and the Small World Overview Background: de Pool and Kochen: Random & Biased networks Rapoport’s work on diffusion Travers and Milgram Argument.
Chapter 8 Introduction to Hypothesis Testing
ECE 8443 – Pattern Recognition LECTURE 06: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Bias in ML Estimates Bayesian Estimation Example Resources:
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Connectivity and the Small World Overview Background: de Pool and Kochen: Random & Biased networks Rapoport’s work on diffusion Travers and Milgram Argument.
Statistics and Quantitative Analysis U4320 Segment 8 Prof. Sharyn O’Halloran.
3-2 Random Variables In an experiment, a measurement is usually denoted by a variable such as X. In a random experiment, a variable whose measured.
Scalable Statistical Bug Isolation Authors: B. Liblit, M. Naik, A.X. Zheng, A. Aiken, M. I. Jordan Presented by S. Li.
Social Network Metrics. Types of network metrics Group level – Density – Components Isolates – Cliques – Centralization Degree Closeness Betweenness –
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Neighbourhood-based models for social networks: model specification issues Pip Pattison, University of Melbourne [with Garry Robins, University of Melbourne.
Slide 26-1 Copyright © 2004 Pearson Education, Inc.
Analyze Improve Define Measure Control L EAN S IX S IGMA L EAN S IX S IGMA Chi-Square Analysis Chi-Square Analysis Chi-Square Training for Attribute Data.
Susan O’Shea The Mitchell Centre for Social Network Analysis CCSR/Social Statistics, University of Manchester
Ch 8. Graphical Models Pattern Recognition and Machine Learning, C. M. Bishop, Revised by M.-O. Heo Summarized by J.W. Nam Biointelligence Laboratory,
Chapter 14 Repeated Measures and Two Factor Analysis of Variance
A two minute introduction to: Exponential random graph (p*)models for social networks SNAC Workshop, Illinois, November 2005 Garry Robins, University of.
+ Big Data, Network Analysis Week How is date being used Predict Presidential Election - Nate Silver –
Sampling and estimation Petter Mostad
Introduction to Statistical Models for longitudinal network data Stochastic actor-based models Kayo Fujimoto, Ph.D.
Point Estimation of Parameters and Sampling Distributions Outlines:  Sampling Distributions and the central limit theorem  Point estimation  Methods.
11 Network Level Indicators Bird’s eye view of network Image matrix example of network level Many network level measures Some would argue this is the most.
© 2010 Pearson Prentice Hall. All rights reserved 7-1.
Introduction to Matrices and Statistics in SNA Laura L. Hansen Department of Sociology UMB SNA Workshop July 31, 2008 (SOURCE: Introduction to Social Network.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Outline of Today’s Discussion 1.Regression Analysis: Introduction 2.An Alternate Formula For Regression 3.Correlation, Regression, and Statistical Significance.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Introduction to ERGM/p* model Kayo Fujimoto, Ph.D. Based on presentation slides by Nosh Contractor and Mengxiao Zhu.
STA302/1001 week 11 Regression Models - Introduction In regression models, two types of variables that are studied:  A dependent variable, Y, also called.
Lecture Slides Elementary Statistics Twelfth Edition
Social Networks Analysis
Exponential random graph models for multilevel networks
Network Science: A Short Introduction i3 Workshop
CHAPTER 26: Inference for Regression
Regression Models - Introduction
Statistical Analysis of Social Networks
ENM 310 Design of Experiments and Regression Analysis Chapter 3
Regression Models - Introduction
Longitudinal Social Network Data
Presentation transcript:

Where we are Node level metrics Group level metrics Visualization Degree centrality Betweenness centrality Group level metrics Degree centralization Betweenness centralization Components Subgroups Visualization None of these address the probability that a dyad or triad exists They are broad summaries of structure

Mathematical versus Statistical Models Statistical models can tell you if the relationship observed between variables is due to chance Mathematical models describe the relationship between variables and suggest what we should observe E =𝑚 𝑐 2 This formula predicts: Nuclear fission Photoelectric cells Black holes The statistical analog would be to observe the characteristics of, say, a black hole and conclude they exist from those observations

Thinking about models Models let us try to test why a structure exists rather than just describing it QAP allows us to test whether a structure is explained by another structure or by an attribute or set of attributes Equivalence begins to let us see how nodes have roles in network structure Structural Regular Equivalence in Ucinet (Profile and CATREGE)

Network Models Network models make it possible to test the probability that a dyad or triad exists due to chance or not Dyads and triads are considered local structures Network modeling is based on the concept that patterns of local structures may aggregate to a global structure Ultimately, the global structure that is observed may in part emerge from local structures, from attributes or a combination of both

Five reasons to construct a network model (Garry Robins, Pip Pattison, Yuval Kalish, Dean Lusher (2007) An introduction to exponential random graph (p*) models for social networks Social Networks 29: 173–191) Regularities in processes that give rise to ties. Models let you understand the uncertainty associated with observed data Can determine if substructures are expected by chance Can distinguish between structural effects versus node attribute effects Simple measures (e.g. density, centrality) may not capture processes in complex networks Can traverse the micro-macro gap – Does the distribution of local structures explain macro structures?

Local structures -- Dyads Dyad – Two nodes There are two types of dyads in an undirected graph: Mutual Null There a re three types of dyads in a directed graph: Asymmetric P1 models (Holland and Leinhardt, 1975) are based on probabilities of dyadic relations

P1 in UCINET Network->P1 Three equations: P1 on Class data Probability of a reciprocated or asymmetric tie based on outdegree (expansiveness) Probability of a reciprocated or asymmetric tie based on indegree (attractiveness) Probability of a null tie (the residual of these two) P1 on Class data Analysis of residuals

Local structures -- Triads Triads are sets of three nodes Transitivity refers to the notion that if A knows B and B knows C then A should know C This is not always the case Some triads are transitive and some are intransitive

Transitivity and network models If you take all possible sub-graphs of triads there is some distribution of transitive and intransitive triads Holland, P.W., and Leinhardt, S. 1975. “Local structure in social networks." In D. Heise (ed.), Sociological Methodology. San Francisco: Jossey-Bass. For undirected graphs there are four types Empty One edge Two path Triangle For directed graphs there are 16 types Snijders Transitivity slides 14-15

Triads in UCINET Transitivity Index Transitive ties/Potentially Transitive Ties For random graphs the expected value is close to density of graph For actual networks values between .3 and .6 are typical (from Tom Snijders) Do Cohesion->Transitivity on class data Do Triad Census on class data

Triads in Pajek Info->Network->Triadic Census Compare to UCINET Triad Census

ERGM (p*) models (Exponential Random Graph Models) When observing a network there is the notion that the structure could have been different The idea of modeling is to propose a process by which the observed data ended up as they did For example, does the network demonstrate more reciprocity than you would expect due to chance – reciprocity can be a model parameter Recall the triad census and the distribution of the different types You can think of models as trying to explain that distribution, and in particular determining if the distribution is essentially random

p* models (cont.) Networks are graphs of nodes and edges The nodes are fixed – Meaning they are not a parameter to consider With models you create a probability distribution of the possible graphs with the fixed nodes The observed graph is located somewhere in this distribution If the observed graph has many reciprocated ties, then a model that is a good fit will also have many reciprocated ties Once you have a distribution of graphs it can be used to compare sampled graphs (from the distribution) to the observed one on other characteristics The idea is to use the model to understand the processes underlying the observed structure You can test whether node attributes (e.g. homophily) or local processes (e.g. transitivity) explain the global structure

Dependence assumptions The possible set of configurations of the set of nodes is constrained by (dependent on) the statistics of the observed network This limits the possibilities Graphs in the distribution a consequence of potentially overlapping configurations The evolution of ties is not random, it is in some way dependent on the environment around it In considering a parameter like reciprocity, it could be further subdivided into other parameters that use node attributes, like girl-girl reciprocity, or girl-boy reciprocity

Different Models Bernoulli graph – Assumes edges are independent Dyadic model – Assumes dyads are independent Markov random graphs – Assumes tie between two nodes is contingent on their ties to other nodes (conditional dependence)

ERGM on Class Data in R