Multilevel compression of random walks on networks reveals hierarchical organization in large integrated systems M. RosvallM. Rosvall and C. T. BergstromC.

Slides:



Advertisements
Similar presentations
Presenter: Guoliang Liu Date:4/25/2012. Background Introduction Definition Basic idea of partition Quality Function Classification Based On Algorithms.
Advertisements

Information Retrieval Lecture 7 Introduction to Information Retrieval (Manning et al. 2007) Chapter 17 For the MSc Computer Science Programme Dell Zhang.
An Evaluation of Community Detection Algorithms on Large-Scale Traffic 1 An Evaluation of Community Detection Algorithms on Large-Scale Traffic.
Analysis and Modeling of Social Networks Foudalis Ilias.
Community Detection Algorithm and Community Quality Metric Mingming Chen & Boleslaw K. Szymanski Department of Computer Science Rensselaer Polytechnic.
Jure Leskovec, CMU Kevin Lang, Anirban Dasgupta and Michael Mahoney Yahoo! Research.
Bounds on Code Length Theorem: Let l ∗ 1, l ∗ 2,..., l ∗ m be optimal codeword lengths for a source distribution p and a D-ary alphabet, and let L ∗ be.
Network Coding in Peer-to-Peer Networks Presented by Chu Chun Ngai
Jure Leskovec, CMU Kevin Lang, Anirban Dasgupta, Michael Mahoney Yahoo! Research.
Network histograms and universality of blockmodel approximation Sofia C. Olhede and Patrick J. Wolfe PNAS 111(41):
Hierarchical Decompositions for Congestion Minimization in Networks Harald Räcke 1.
Clustering (1) Clustering Similarity measure Hierarchical clustering Model-based clustering Figures from the book Data Clustering by Gan et al.
Structural Inference of Hierarchies in Networks BY Yu Shuzhi 27, Mar 2014.
1 Network Coding: Theory and Practice Apirath Limmanee Jacobs University.
Lecture 21: Spectral Clustering
Reji Mathew and David S. Taubman CSVT  Introduction  Quad-tree representation  Quad-tree motion modeling  Motion vector prediction strategies.
On Computing Compression Trees for Data Collection in Wireless Sensor Networks Jian Li, Amol Deshpande and Samir Khuller Department of Computer Science,
An Introduction to Bayesian Inference Michael Betancourt April 8,
L16: Micro-array analysis Dimension reduction Unsupervised clustering.
Modularity in Biological networks.  Hypothesis: Biological function are carried by discrete functional modules.  Hartwell, L.-H., Hopfield, J. J., Leibler,
A gentle introduction to Gaussian distribution. Review Random variable Coin flip experiment X = 0X = 1 X: Random variable.
Learning Low-Level Vision William T. Freeman Egon C. Pasztor Owen T. Carmichael.
A scalable multilevel algorithm for community structure detection
Introduction to Hierarchical Clustering Analysis Pengyu Hong 09/16/2005.
Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)
The Relative Vertex-to-Vertex Clustering Value 1 A New Criterion for the Fast Detection of Functional Modules in Protein Interaction Networks Zina Mohamed.
Models of Influence in Online Social Networks
Studying Visual Attention with the Visual Search Paradigm Marc Pomplun Department of Computer Science University of Massachusetts at Boston
1 A(n) (extremely) brief/crude introduction to minimum description length principle jdu
Genetic network inference: from co-expression clustering to reverse engineering Patrik D’haeseleer,Shoudan Liang and Roland Somogyi.
ENTROPIC CHARACTERISTICS OF QUANTUM CHANNELS AND THE ADDITIVITY PROBLEM A. S. Holevo Steklov Mathematical Institute, Moscow.
Community Detection by Modularity Optimization Jooyoung Lee
Community detection algorithms: a comparative analysis Santo Fortunato.
An Approximation Algorithm for Binary Searching in Trees Marco Molinaro Carnegie Mellon University joint work with Eduardo Laber (PUC-Rio)
1 Optimal Oblivious Routing in Hole-Free Networks Costas Busch Louisiana State University Malik Magdon-Ismail Rensselaer Polytechnic Institute.
1 Oblivious Routing in Wireless networks Costas Busch Rensselaer Polytechnic Institute Joint work with: Malik Magdon-Ismail and Jing Xi.
Combinational Problems: Unate Covering, Binate Covering, Graph Coloring and Maximum Cliques Example of application: Decomposition.
Dimension reduction : PCA and Clustering Slides by Agnieszka Juncker and Chris Workman modified by Hanne Jarmer.
Coding Theory Efficient and Reliable Transfer of Information
Recent Results in Combined Coding for Word-Based PPM Radu Rădescu George Liculescu Polytechnic University of Bucharest Faculty of Electronics, Telecommunications.
Relative Validity Criteria for Community Mining Evaluation
Overlapping Communities for Identifying Misbehavior in Network Communications 1 Overlapping Communities for Identifying Misbehavior in Network Communications.
Lecture 3 1.Different centrality measures of nodes 2.Hierarchical Clustering 3.Line graphs.
Communities. Questions 1.What is a community (intuitively)? Examples and fundamental hypothesis 2.What do we really mean by communities? Basic definitions.
Community Detection Algorithms: A Comparative Analysis Authors: A. Lancichinetti and S. Fortunato Presented by: Ravi Tiwari.
CHAPTER 1: Introduction. 2 Why “Learn”? Machine learning is programming computers to optimize a performance criterion using example data or past experience.
Author : 컴퓨터 공학과 김홍연 An Energy Efficient Hierarchical Clustering Algorithm for Wireless Sensor Networks. Seema Bandyopadhyay, Edward J. Coyle.
Data Structures and Algorithms in Parallel Computing Lecture 7.
1 CS612 Algorithms for Electronic Design Automation CS 612 – Lecture 8 Lecture 8 Network Flow Based Modeling Mustafa Ozdal Computer Engineering Department,
Kijung Shin Jinhong Jung Lee Sael U Kang
KAIST TS & IS Lab. CS710 Know your Neighbors: Web Spam Detection using the Web Topology SIGIR 2007, Carlos Castillo et al., Yahoo! 이 승 민.
Community detection via random walk Draft slides.
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
James Hipp Senior, Clemson University.  Graph Representation G = (V, E) V = Set of Vertices E = Set of Edges  Adjacency Matrix  No Self-Inclusion (i.
Jevin West, Information School, University of Washington Ian Wesley-Smith, Information School, University of Washington Carl T. Bergstrom, Department of.
By Ping-Chu Hung Advisor: Ying-Ping Chen.  Introduction: background and objectives  Review of ECGA  ECGA for integer variables ◦ Experiments and performances.
Community Detection  Definition: Community Detection  Girwan Newman Approach  Hierarchical Clustering.
Graph clustering to detect network modules
Clustering (1) Clustering Similarity measure Hierarchical clustering
Cohesive Subgraph Computation over Large Graphs
Machine Learning Clustering: K-means Supervised Learning
Shannon Entropy Shannon worked at Bell Labs (part of AT&T)
Community detection in graphs
Quad-Tree Motion Modeling with Leaf Merging
Large Scale Metabolic Network Alignments by Compression
BNFO 602 Phylogenetics Usman Roshan.
Overcoming Resolution Limits in MDL Community Detection
Text Categorization Berlin Chen 2003 Reference:
Introducing complex networks into quantum regime
Clustering Techniques
Presentation transcript:

Multilevel compression of random walks on networks reveals hierarchical organization in large integrated systems M. RosvallM. Rosvall and C. T. BergstromC. T. Bergstrom Presented by Thang N. Dinh CISE Department, University of Florida

Introduction Flows of commuters in UK Community structure gives 2-level map of the network Real-world networks, the organization rarely is limited to two levels

Hierarchical Community structure How many hierarchical levels? How many modules at each level? What are members of each module?

Hierarchical Community structure Hierarchical clustering: ▫Agglomerative algorithms ▫Divisive algorithms Sales-Pardo, 2007: Deriving hierarchical from node similarities. Clauset et al. 2007, 2008: Divide the dendogram using Maximum Likelihood Lancichinetti et al. 2009: Local optimization of a fitness function

Encoding Network Flows Optimal compression of network flows  Network structure (M. Rosvall, C. T. Bergstrom, PNAS 2008) Top three performers (LFR benchmark, Fortunato et al. 2009) Can be extended to reveal hierarchical structure of network

Encoding network flows

Encoding Network Flows Compressing an infinite flow (random walk) Minimize the expected code length Shannon’s source coding theorem:

Two-level map encoding Each module has a code Within module: ▫Each node has a code ▫Special exit code

Two-level map encoding ndex.html

Two-level map encoding Community structure M, with modules P i Probability of entering modules i The rate in which code for modules used The entropy of code for nodes in module iEntropy of code for modules

Two-level encoding

Hierarchical map encoding

Algorithm Fast stochastic and recursive search algorithm ▫Join neighboring nodes -> modules -> super- modules (Brondel. 2008) ▫Recursive operate on modules at any levels ▫Local improvement  Submodules movement  Single-node movement

Experiments