A Local Seed Selection Algorithm for Overlapping Community Detection 1 A Local Seed Selection Algorithm for Overlapping Community Detection Farnaz Moradi,

Slides:



Advertisements
Similar presentations
1 Dynamics of Real-world Networks Jure Leskovec Machine Learning Department Carnegie Mellon University
Advertisements

Social network partition Presenter: Xiaofei Cao Partick Berg.
ICDE 2014 LinkSCAN*: Overlapping Community Detection Using the Link-Space Transformation Sungsu Lim †, Seungwoo Ryu ‡, Sejeong Kwon§, Kyomin Jung ¶, and.
An Evaluation of Community Detection Algorithms on Large-Scale Traffic 1 An Evaluation of Community Detection Algorithms on Large-Scale Traffic.
LEARNING INFLUENCE PROBABILITIES IN SOCIAL NETWORKS Amit Goyal Francesco Bonchi Laks V. S. Lakshmanan University of British Columbia Yahoo! Research University.
Modeling Blog Dynamics Speaker: Michaela Götz Joint work with: Jure Leskovec, Mary McGlohon, Christos Faloutsos Cornell University Carnegie Mellon University.
Community Detection Laks V.S. Lakshmanan (based on Girvan & Newman. Finding and evaluating community structure in networks. Physical Review E 69,
Beyond Trilateration: On the Localizability of Wireless Ad Hoc Networks Reported by: 莫斌.
Schema Summarization cong Yu Department of EECS University of Michigan H. V. Jagadish Department of EECS University of Michigan
1 Social Influence Analysis in Large-scale Networks Jie Tang 1, Jimeng Sun 2, Chi Wang 1, and Zi Yang 1 1 Dept. of Computer Science and Technology Tsinghua.
CIKM’2008 Presentation Oct. 27, 2008 Napa, California
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
Neighborhood Formation and Anomaly Detection in Bipartite Graphs Jimeng Sun Huiming Qu Deepayan Chakrabarti Christos Faloutsos Speaker: Jimeng Sun.
SCS CMU Proximity Tracking on Time- Evolving Bipartite Graphs Speaker: Hanghang Tong Joint Work with Spiros Papadimitriou, Philip S. Yu, Christos Faloutsos.
On Distinguishing between Internet Power Law B Bu and Towsley Infocom 2002 Presented by.
Network A/B Testing: From Sampling to Estimation
Models of Influence in Online Social Networks
Towards Modeling Legitimate and Unsolicited Traffic Using Social Network Properties 1 Towards Modeling Legitimate and Unsolicited Traffic Using.
Link Recommendation In P2P Social Networks Yusuf Aytaş, Hakan Ferhatosmanoğlu, Özgür Ulusoy Bilkent University, Ankara, Turkey.
Lifetime and Coverage Guarantees Through Distributed Coordinate- Free Sensor Activation ACM MOBICOM 2009.
Personalized Influence Maximization on Social Networks
A Generic Virtual Content Insertion System Based on Visual Attention Analysis H. Liu 1, 2, S. Jiang 1, Q. Huang 1, 2, C. Xu 2, 3 1 Institute of Computing.
Andreas Larsson, Philippas Tsigas SIROCCO Self-stabilizing (k,r)-Clustering in Clock Rate-limited Systems.
Efficient Identification of Overlapping Communities Jeffrey Baumes Mark Goldberg Malik Magdon-Ismail Rensselaer Polytechnic Institute, Troy, NY.
DETECTING SPAMMERS AND CONTENT PROMOTERS IN ONLINE VIDEO SOCIAL NETWORKS Fabrício Benevenuto ∗, Tiago Rodrigues, Virgílio Almeida, Jussara Almeida, and.
Influence Maximization in Dynamic Social Networks Honglei Zhuang, Yihan Sun, Jie Tang, Jialin Zhang, Xiaoming Sun.
December 7-10, 2013, Dallas, Texas
Xiaowei Ying, Xintao Wu Univ. of North Carolina at Charlotte PAKDD-09 April 28, Bangkok, Thailand On Link Privacy in Randomizing Social Networks.
Peer Centrality in Socially-Informed P2P Topologies Nicolas Kourtellis, Adriana Iamnitchi Department of Computer Science & Engineering University of South.
Computer Science Department, Peking University
Efficient Processing of Top-k Spatial Preference Queries
A Passive Approach to Sensor Network Localization Rahul Biswas and Sebastian Thrun International Conference on Intelligent Robots and Systems 2004 Presented.
Guided Learning for Role Discovery (GLRD) Presented by Rui Liu Gilpin, Sean, Tina Eliassi-Rad, and Ian Davidson. "Guided learning for role discovery (glrd):
Overlapping Communities for Identifying Misbehavior in Network Communications 1 Overlapping Communities for Identifying Misbehavior in Network Communications.
LOGO Identifying Opinion Leaders in the Blogosphere Xiaodan Song, Yun Chi, Koji Hino, Belle L. Tseng CIKM 2007 Advisor : Dr. Koh Jia-Ling Speaker : Tu.
Community Detection Algorithms: A Comparative Analysis Authors: A. Lancichinetti and S. Fortunato Presented by: Ravi Tiwari.
Community-enhanced De-anonymization of Online Social Networks Shirin Nilizadeh, Apu Kapadia, Yong-Yeol Ahn Indiana University Bloomington CCS 2014.
Stefanos Antaris A Socio-Aware Decentralized Topology Construction Protocol Stefanos Antaris *, Despina Stasi *, Mikael Högqvist † George Pallis *, Marios.
Panther: Fast Top-k Similarity Search in Large Networks JING ZHANG, JIE TANG, CONG MA, HANGHANG TONG, YU JING, AND JUANZI LI Presented by Moumita Chanda.
RTM: Laws and a Recursive Generator for Weighted Time-Evolving Graphs Leman Akoglu, Mary McGlohon, Christos Faloutsos Carnegie Mellon University School.
Kijung Shin Jinhong Jung Lee Sael U Kang
Overlapping Community Detection in Networks
Speaker : Yu-Hui Chen Authors : Dinuka A. Soysa, Denis Guangyin Chen, Oscar C. Au, and Amine Bermak From : 2013 IEEE Symposium on Computational Intelligence.
Supervised Random Walks: Predicting and Recommending Links in Social Networks Lars Backstrom (Facebook) & Jure Leskovec (Stanford) Proc. of WSDM 2011 Present.
Robust Local Community Detection: On Free Rider Effect and Its Elimination 1 Case Western Reserve University Yubao Wu 1, Ruoming Jin 2, Jing Li 1, Xiang.
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Informatics tools in network science
Project funded by the Future and Emerging Technologies arm of the IST Programme Search in Unstructured Networks Niloy Ganguly, Andreas Deutsch Center for.
1 Distributed Vertex Coloring. 2 Vertex Coloring: each vertex is assigned a color.
KAIS T Sensor Deployment Based on Virtual Forces Reference: Yi Zou and Krishnendu Chakarabarty, “Sensor Deployment and Target Localization Based on Virtual.
Community Detection based on Distance Dynamics Reporter: Yi Liu Student ID: Department of Computer Science and Engineering Shanghai Jiao Tong.
PhD Thesis Proposal Evolution in Social Networks Candidate Giulio Rossetti Supervisor Dino Pedreschi Supervisor Fosca Giannotti Pisa, Computer Science.
1 Link Privacy in Social Networks Aleksandra Korolova, Rajeev Motwani, Shubha U. Nabar CIKM’08 Advisor: Dr. Koh, JiaLing Speaker: Li, HueiJyun Date: 2009/3/30.
Presented by: Mi Tian, Deepan Sanghavi, Dhaval Dholakia
Hao Ma, Haixuan Yang, Michael R. Lyu and Irwin King CIKM, 2008.
Greedy Algorithm for Community Detection
Finding Communities by Clustering a Graph into Overlapping Subgraphs
Synchronization trade-offs in GPU implementations of Graph Algorithms
Generative Model To Construct Blog and Post Networks In Blogosphere
Distributed Representations of Subgraphs
The likelihood of linking to a popular website is higher
Statistical properties of network community structure
Degree-aware Hybrid Graph Traversal on FPGA-HMC Platform
Department of Computer Science University of York
Probabilistic Latent Preference Analysis
Hierarchical Search on DisCSPs
Enriching Taxonomies With Functional Domain Knowledge
Ho-Ramamoorthy 2-Phase Deadlock Detection Algorithm
Efficient Processing of Top-k Spatial Preference Queries
Presentation transcript:

A Local Seed Selection Algorithm for Overlapping Community Detection 1 A Local Seed Selection Algorithm for Overlapping Community Detection Farnaz Moradi, Tomas Olovsson, Philippas Tsigas

A Local Seed Selection Algorithm for Overlapping Community Detection 2 Community detection in large-scale real networks Global and local algorithms Local algorithms for global community detection –Seed expansion –Seed selection Motivation

A Local Seed Selection Algorithm for Overlapping Community Detection 3 Community detection algorithms –Global and local algorithms Seed selection –Challenges Proposed seed selection algorithm –Link prediction –Graph coloring Experimental Results Conclusions Outline

A Local Seed Selection Algorithm for Overlapping Community Detection 4 Global algorithms require the global structure of the network to be known –High quality communities –Not scalable Local algorithms only need the knowledge of local neighborhood of the seed nodes –Easily paralelizable –Low coverage if seeds are not selected carefully Community Detection Algorithms

A Local Seed Selection Algorithm for Overlapping Community Detection 5 Naive approach –All nodes being expanded –Expensive Challenges in local seed selection –Unaccessible global information –Unknown number of seeds –Well distributed over the network –No neighboring seeds Seed Selection

A Local Seed Selection Algorithm for Overlapping Community Detection 6 Spread hub (SH) [CIKM 2013] –Highest degree nodes (k or higher) Low conductance cuts (EC) [KDD 2012] –Egonets with low conductance Local maximal degree (MD) [SNA 2012] –Local maximal degree nodes Seed Selection Algorithms Global Local

A Local Seed Selection Algorithm for Overlapping Community Detection 7 Properties –Local –Parameter free –Distributed/parallelizable Approach –Link prediction –Graph coloring Proposed Local Seed Selection Algorithm

A Local Seed Selection Algorithm for Overlapping Community Detection 8 Predicting the relations that should exist or are very likely to be formed in a network Local similarity indices –CN: Common Neighbors, PA: Preferrential Attachment, HP: Hub Promoted, LHN: Leich-Holme-Newman, RA: Resource Allocation We define a similarity score for seed selection as sum of the similarities of a node with its neighbors Link Prediction

A Local Seed Selection Algorithm for Overlapping Community Detection 9 Intution: a node which has high similarity with its neighbors is expected to be in the same community with its neighbors Link Prediction-Based Seeding Similarity score calculation using common neighbors (CN) Local seed selection based on similarity scores SS(5)= CN(5,0)+CN(5,1)+CN(5,2) +CN(5,3)+CN(5,4)+CN(5,6) = =

A Local Seed Selection Algorithm for Overlapping Community Detection 10 Enhancing our seed selection algorithm –Well distributed seeds –No neighboring seeds Steps of the algorithm: 1.Calculate the similarity scores 2.Nodes with the highest local similarity score pick a specific color (in contrast to basic random coloring) 3.Other nodes pick a color at random 4.Color conflicts are resolved locally 5.Nodes with the specific color are selected as seeds Biased Coloring-Based Seeding

A Local Seed Selection Algorithm for Overlapping Community Detection 11 Biased Coloring-Based Seeding C1 C2 C3 C4 C5 C6 1. Similarity score calculation using common neighbors 2,3. Local color assignment based on similarity scores 4,5. Local color conflict resolution and seed selection Specific color

A Local Seed Selection Algorithm for Overlapping Community Detection 12 The selected seed nodes are expanded into overlapping communities –Local community detection Personalized PageRank-based community detection algorithm –Yang and Leskovec [ICDM 2012] Local Community Detection

A Local Seed Selection Algorithm for Overlapping Community Detection 13 Large-scale real networks Compare local seed selection algorithms –Number of seeds –Quality of the communities (F1-score and conductance) –Coverage of the communities –Execution time Experimental Evaluation

A Local Seed Selection Algorithm for Overlapping Community Detection 14 Experimental Results Link Prediction-Based Seeding

A Local Seed Selection Algorithm for Overlapping Community Detection 15 Experimental Results Biased Coloring-Based Seeding

A Local Seed Selection Algorithm for Overlapping Community Detection 16 Experimental Results Execution Time Seeding Community Detection F1-ScoreConductanceCoverage PA+Coloring52 s2 h 38 m AmazonAll-17 h 15 m DEMON-37 h 40 m PA+Coloring2 m 16 s1 h 12 m DBLPAll-8 h 42 m DEMON-32 h 54 m

A Local Seed Selection Algorithm for Overlapping Community Detection 17 A novel seed selection algorithm –Link prediction-based and biased coloring-based Our biased coloring algorithm can be used to improve existing seed selection algorithms Experiments on large-scale real networks –Well distributed seeds over the network –Communities with high coverage and quality –Reduced execution time Conclusions

A Local Seed Selection Algorithm for Overlapping Community Detection Similarity score calculation using common neighbors (CN) C1 C2 C3 C4 C5 C6 2,3. Local color assignment based on similarity scores Specific color ,5. Local color conflict resolution and seed selection