Network Analysis Max Hinne Social Networks 6/1/20152Network Analysis.

Slides:



Advertisements
Similar presentations
Class 12: Communities Network Science: Communities Dr. Baruch Barzel.
Advertisements

Fast algorithm for detecting community structure in networks M. E. J. Newman Department of Physics and Center for the Study of Complex Systems, University.
Complex Networks for Representation and Characterization of Images For CS790g Project Bingdong Li 9/23/2009.
Mobile Communication Networks Vahid Mirjalili Department of Mechanical Engineering Department of Biochemistry & Molecular Biology.
Network analysis Sushmita Roy BMI/CS 576
Social network partition Presenter: Xiaofei Cao Partick Berg.
Analysis and Modeling of Social Networks Foudalis Ilias.
Modularity and community structure in networks
Community Detection Laks V.S. Lakshmanan (based on Girvan & Newman. Finding and evaluating community structure in networks. Physical Review E 69,
Graph Partitioning Dr. Frank McCown Intro to Web Science Harding University This work is licensed under Creative Commons Attribution-NonCommercial 3.0Attribution-NonCommercial.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.
Information Networks Small World Networks Lecture 5.
V4 Matrix algorithms and graph partitioning
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
Mining and Searching Massive Graphs (Networks)
Network Statistics Gesine Reinert. Yeast protein interactions.
Fast algorithm for detecting community structure in networks.
Complex networks and random matrices. Geoff Rodgers School of Information Systems, Computing and Mathematics.
Advanced Topics in Data Mining Special focus: Social Networks.
A scalable multilevel algorithm for community structure detection
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
CS8803-NS Network Science Fall 2013
Network Measures Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Measures Klout.
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Network properties Slides are modified from Networks: Theory and Application by Lada Adamic.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Science: Graph theory and networks Dr Andy Evans.
Self-Similarity of Complex Networks Maksim Kitsak Advisor: H. Eugene Stanley Collaborators: Shlomo Havlin Gerald Paul Zhenhua Wu Yiping Chen Guanliang.
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
Complex Networks: Models Lecture 2 Slides by Panayiotis TsaparasPanayiotis Tsaparas.
1. 2 CIShell Features A framework for easy integration of new and existing algorithms written in any programming language. CIShell Sci2 Tool NWB Tool.
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Communities. Questions 1.What is a community (intuitively)? Examples and fundamental hypothesis 2.What do we really mean by communities? Basic definitions.
Models and Algorithms for Complex Networks Introduction and Background Lecture 1.
Clusters Recognition from Large Small World Graph Igor Kanovsky, Lilach Prego Emek Yezreel College, Israel University of Haifa, Israel.
University at BuffaloThe State University of New York Detecting Community Structure in Networks.
Community Discovery in Social Network Yunming Ye Department of Computer Science Shenzhen Graduate School Harbin Institute of Technology.
DM GROUP MEETING PRESENTATION PLAN Eigenvector-based Centrality Measures For Temporal Networks by D Taylor et.al. Uncovering the Small Community.
CS 590 Term Project Epidemic model on Facebook
Community detection via random walk Draft slides.
Finding community structure in very large networks
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Informatics tools in network science
Network Theory: Community Detection Dr. Henry Hexmoor Department of Computer Science Southern Illinois University Carbondale.
Example Apply hierarchical clustering with d min to below data where c=3. Nearest neighbor clustering d min d max will form elongated clusters!
Network Partition –Finding modules of the network. Graph Clustering –Partition graphs according to the connectivity. –Nodes within a cluster is highly.
James Hipp Senior, Clemson University.  Graph Representation G = (V, E) V = Set of Vertices E = Set of Edges  Adjacency Matrix  No Self-Inclusion (i.
1 Discovering Web Communities in the Blogspace Ying Zhou, Joseph Davis (HICSS 2007)
Alan Mislove Bimal Viswanath Krishna P. Gummadi Peter Druschel.
1 New metrics for characterizing the significance of nodes in wireless networks via path-based neighborhood analysis Leandros A. Maglaras 1 Dimitrios Katsaros.
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
Department of Computer and IT Engineering University of Kurdistan Social Network Analysis Communities By: Dr. Alireza Abdollahpouri.
Graph clustering to detect network modules
Random Walk for Similarity Testing in Complex Networks
Hierarchical Agglomerative Clustering on graphs
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
Groups of vertices and Core-periphery structure
Greedy Algorithm for Community Detection
Algorithms and Networks
Community detection in graphs
Finding modules on graphs
Michael L. Nelson CS 495/595 Old Dominion University
3.3 Network-Centric Community Detection
Alan Kuhnle*, Victoria G. Crawford, and My T. Thai
Presentation transcript:

Network Analysis Max Hinne

Social Networks 6/1/20152Network Analysis

Networks & Digital Security Interdisciplinary Combination formal & ‘soft’ interpretation Security in the sense of a detective 6/1/2015Network Analysis3

Overview 1.Primer on graph theory 2.Centrality – Who is important? 3.Clustering – Who belong together? 4.Detecting & predicting changes – LIGA project Central theme: global vs. local approaches 6/1/2015Network Analysis4

GRAPH PRIMER 6/1/2015Network Analysis5

Graph primer - basics V = vertices, N = |V| A = arcs, M = |A| 6 (x points to y) 6/1/2015Network Analysis

Graph primer - concepts Neighborhood: Degree: Path: Similar concepts for undirected graphs G=(V,E) 6/1/2015Network Analysis7

Graph primer – graph types 6/1/2015Network Analysis Models for these graphs by: 1.Erdős-Renyi (1959) 2.Tsvetovat-Carley (2005) 3.Barabási-Albert (1999)

Graph primer – degree distributions Erdős-Renyi: number of vertices N, each edge occurs with probability p Barabási-Albert: start with a small set of vertices and add new ones. Each new vertex is connected to others with a probability based on their degree 6/1/2015Network Analysis9 Degree distributions: what is the chance a node has degree k? Poisson Power-law (scale-free)

Graph primer – small world effect Famous experiment by Milgram (1967) Everyone on the world is connected to everyone else in at most 6 steps Social graphs exhibit the ‘small world effect’: the diameter of a social graph scales logarithmically with N 6/1/2015Network Analysis10

CENTRALITY 6/1/2015Network Analysis11

Centrality 6/1/201512Network Analysis Importance, control of flow Ranking of most important (control) to least important (control)

Node centrality measures 1/4 6/1/2015Network Analysis13 – Degree Immediate effect

Node centrality measures 2/4 6/1/2015Network Analysis14 – Closeness ETA of flow to v c C inverted for visualization

Node centrality measures 3/4 6/1/2015Network Analysis15 – Eigenvector Influence or risk

Node centrality measures 4/4 6/1/2015Network Analysis16 – Betweenness Volume of flow/traffic

Obtaining c B Fastest current algorithm by Brandes in O(nm) Solves all shortest paths in one pass – For each vertex, consider all d=1 nearest neighbors, then d=2 and so on – For each shortest path, store which vertices are on it – Derive c B 6/1/2015Network Analysis17

Local approach No known algorithms calculate c B (v) faster than c B (v) for all v! We only want to rank nodes of interest, not all Local approach – Find c B for some specific nodes – If we can estimate c B, we can rank relevant nodes 6/1/201518Network Analysis

Ego betweenness 6/1/201519Network Analysis Ego-net: and corresponding edges Calculate c B considering only ego(v) Let A be the adjacency matrix:

No direct link between c B and c EB 6/1/2015Network Analysis20 Red circles + ego form a n+1 node star Green triangles form an p node complete graph K p Red circles + ego form a p+1 node star Green triangles + ego form an n node complete graph K n

Correlation c B and c EB Very strong positive correlation! 6/1/2015Network Analysis21

GRAPH CLUSTERING 6/1/2015Network Analysis22

Types of clustering What is a cluster? Supervised vs. unsupervised Partitional vs. hierarchical 6/1/2015Network Analysis23

Clustering quality – modularity C1C1 C2C2 C3C3 C4C4 C1C C2C C3C C4C C1C1 C2C2 C3C3 C4C4 C1C C2C C3C C4C Network Analysis6/1/2015 Cluster adjacency matrixCluster adjacency matrix E

Newman & Girvan clustering algorithm Edges that are the most ‘between’ connect large parts of the graph 1.Calculate edge betweenness A ij in n x n matrix A 2.Remove edge with highest score 3.Recalculate edge betweenness for affected edges 4.Goto 2 until no edges remain O(m 2 n), may be smaller on graphs with strong clustering 6/1/2015Network Analysis25

Greedy clustering algorithm Maximize Q to find clustering Greedy approach: Creates a bottom-up dendogram Cut corresponding to maximum Q is optimal clustering Still a costly process, O(n 2 ) 6/1/2015Network Analysis26 C := V; repeat (i,j) := argmax{∆Q|C i, C j C}; C := C - C j ; C i := C i + C j ; until |C| = 1

Practical applications of social clusters Find people related to someone Find out if people belong to the same cluster This does not require a partitioning of the entire network! 6/1/2015Network Analysis27

Local modularity C= collection nodes v ∈ V with known link structure U(C) = all nodes outside C to which nodes from C point: U(C) = {u ∈ V-C|A(C,u) ≠ ∅ } B(C) = all nodes in C with at least one neighbor outside C: B(C) = {b ∈ C|A(b,U) ≠ ∅ } C: cluster U: universe B: boundary 28Network Analysis6/1/2015

Local cluster algorithm C := Ø; v := v 0 ; repeat C := C+v; v := argmax{R(C+u)|u ∈ U(C)} until |C| = k or R ≥ d ∆R(C,u) = R(C+u) – R(C) Arcs removed from arcs(B(C),V) Arcs newly added to arcs(B(C),V) Arcs removed from arcs(B(C),C) Arcs newly added to arcs(B(C),C) ∆R(C+v 4 ) = 1/3 – 1/4 = 1/12 29Network Analysis6/1/2015

Example 1 on Zachary’s Karate Club (d=0.65) 6/1/2015Network Analysis30

Example 2 on Zachary’s Karate Club (d=0.65) 6/1/2015Network Analysis31

Local cluster quality vs. global clusters For each node v in each global cluster i – Find the local cluster with the same size – Average 6/1/2015Network Analysis32

Preliminary results on real graphs Network (size)Compiled bySim(L v,G i )STD Karate club (34)Zachary Dolphin social relations (62)Lusseau Les Miserables coappearance (75)Knuth American College Football (113)Girvan & Newman C. Elegans neural network (295)Watts & Strogatz 6/1/2015Network Analysis33 Experiment too small for real conclusions, but – edge vertices ruin the fun, – edge betweenness? Usefulness of local approach depends on the seed node

LOCAL INTELLIGENCE IN GLOBAL APPLICATIONS LIGA 6/1/2015Network Analysis34

Web graph ‘Social’ network of blogs and news sites Most graph models are static, but the Web is highly dynamic Stored copy is infeasible, continuous crawling intractable Change in relevance -> change in link structure 6/1/2015Network Analysis35

Fully connected triad (1 role) Node roles Frequently recurring sub graphs: motifs Nodes share a role iff there is a permutation of nodes and edges that preserves motif structure On the Web: 6/1/2015Network Analysis36 Uplinked mutual dyad (2 roles) Feedback with two mutual dyads (2 roles)

Dynamic graphs Changes in relevance cause changes in link structure Changes in specific roles imply changes in other node roles – Fanbase links to itself and their authorities – Learning relevant links through affiliated sites – etc. Relevance decays (half-life λ) 6/1/2015Network Analysis37

LIGA research questions How to model (Web) node relevance ? How does acquired or lost relevance change linkage? How can we predict consequential changes? How can such prediction models be approximated by local incremental algorithms? A. m. o.... 6/1/2015Network Analysis38

Putting it together Networks can be analyzed using an array of tools Network analysis is useful in various disciplines: – Information Retrieval – Security But also in: – Sociology – (Statistical) physics – Bioinformatics – AI 6/1/2015Network Analysis39

Most cited literature Centrality: – Borgatti S. P.: Centrality and Network Flow. Social Networks 27 (2005) – Brandes U.: A Faster Algorithm for Betweenness Centrality. Journal of Mathematical Sociology 25(2) (2001) – Freeman L. C.: A Set of Measures of Centrality Based on Betweennes. Sociometry 40 (1977) Clustering: – Clauset A.: Finding local community structure in networks. Physics Review E 72 (2005) – Girvan M., Newman M. E. J.: Community structure in social and biological networks. PNAS 99(12) (2002) – Newman M. E. J.: Fast algorithm for detecting community structure in networks. Physics Review E 69 (2004) /1/2015Network Analysis40