Social Network Analysis & Network Optimization Dimitrios Katsaros, Ph.D. Koblenz, February 18 th, Dept. of Computer & Communication Engineering,

Slides:



Advertisements
Similar presentations
Retrieval of Information from Distributed Databases By Ananth Anandhakrishnan.
Advertisements

Future and Emerging Technologies (FET) Future and Emerging Technologies (FET) The roots of innovation Proactive initiative on: Global Computing (GC) Proactive.
1 Routing Techniques in Wireless Sensor networks: A Survey.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
Dimitrios Katsaros* † Yannis Manolopoulos* † Aristotle University, Greece *University of Thessaly, Greece Suffix Tree Based Prediction for Pervasive Computing.
1 Cooperative Caching in Wireless Multimedia Sensor Nets Nikos Dimokas 1 Dimitrios Katsaros 1,2 (presentation) Yannis Manolopoulos 1 3 rd MobiMedia Conference,
Some contributions to the management of data in grids Lionel Brunie National Institute of Applied Science (INSA) LIRIS Laboratory/DRIM Team – UMR CNRS.
Information Retrieval in Practice
P2p, Spring 05 1 Topics in Database Systems: Data Management in Peer-to-Peer Systems March 29, 2005.
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
1 Placement of Continuous Media in Wireless Peer-to-Peer Networks Shahram Ghadeharizadeh, Bhaskar Krishnamachari, Shanshan Song, IEEE Transactions on Multimedia,
Beneficial Caching in Mobile Ad Hoc Networks Bin Tang, Samir Das, Himanshu Gupta Computer Science Department Stony Brook University.
Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.
1 Efficient Retrieval of User Contents in MANETs Marco Fiore, Claudio Casetti, Carla-Fabiana Chiasserini Dipartimento di Elettronica, Politecnico di Torino,
Database caching in MANETs Based on Separation of Queries and Responses Author: Hassan Artail, Haidar Safa, and Samuel Pierre Publisher: Wireless And Mobile.
SESSION 9 THE INTERNET AND THE NEW INFORMATION NEW INFORMATIONTECHNOLOGYINFRASTRUCTURE.
7DS: Node Cooperation in Mostly Disconnected Networks Henning Schulzrinne (joint work with Arezu Moghadan, Maria Papadopouli, Suman Srinivasan and Andy.
CDNs Content Outsourcing via Generalized Communities Dimitrios Katsaros, Ph.D. Heraklion, March 20 th, Dept. of Computer & Communication Engineering,
Overview of Web Data Mining and Applications Part I
1CS 6401 Peer-to-Peer Networks Outline Overview Gnutella Structured Overlays BitTorrent.
Overview of Search Engines
Internet basics, Browsers, application, advantages and disadvantages, architecture, WWW, URL, HTML Week 10 Mr. Mohammed Rahmath.
© 2009 AT&T Intellectual Property. All rights reserved. Multimedia content growth: From IP networks to Medianets Cisco-IEEE ComSoc Webinar. Sept. 23, 2009.
09/07/2004Peer-to-Peer Systems in Mobile Ad-hoc Networks 1 Lookup Service for Peer-to-Peer Systems in Mobile Ad-hoc Networks M. Tech Project Presentation.
1 Caching in Wireless Multimedia Sensor Dept. of Computer & Communication Engineering, University of Dept. of Informatics, Aristotle.
Page 1 WEB MINING by NINI P SURESH PROJECT CO-ORDINATOR Kavitha Murugeshan.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Jong Y. Choi, Joshua Rosen, Siddharth Maini, Marlon E. Pierce, and Geoffrey C. Fox Community Grids Laboratory Indiana University.
Master Thesis Defense Jan Fiedler 04/17/98
2: Application Layer1 Chapter 2 outline r 2.1 Principles of app layer protocols r 2.2 Web and HTTP r 2.3 FTP r 2.4 Electronic Mail r 2.5 DNS r 2.6 Socket.
Web Engineering we define Web Engineering as follows: 1) Web Engineering is the application of systematic and proven approaches (concepts, methods, techniques,
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
Young-sam Kim / M.D Course School of Information Technology Dept. of Computer Engineering Korea University of Technology and Education Location Tracking.
The Internet 8th Edition Tutorial 4 Searching the Web.
Locating Mobile Agents in Distributed Computing Environment.
Autumn Web Information retrieval (Web IR) Handout #1:Web characteristics Ali Mohammad Zareh Bidoki ECE Department, Yazd University
1 Searching the Physical World: Distributed Protocols for Data Coverage and Caching in Dept. of Computer & Communication Engineering, University.
Peer Centrality in Socially-Informed P2P Topologies Nicolas Kourtellis, Adriana Iamnitchi Department of Computer Science & Engineering University of South.
Internet Architecture and Governance
Feb 5, ECET 581/CPET/ECET 499 Mobile Computing Technologies & Apps Data Dissemination and Management 2 of 3 Lecture 7 Paul I-Hai Lin, Professor Electrical.
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #15 Secure Multimedia Data.
Plethora: Infrastructure and System Design. Introduction Peer-to-Peer (P2P) networks: –Self-organizing distributed systems –Nodes receive and provide.
1 1 COMP5331: Knowledge Discovery and Data Mining Acknowledgement: Slides modified based on the slides provided by Lawrence Page, Sergey Brin, Rajeev Motwani.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
Taxonomy Caching: A Scalable Low- Cost Mechanism for Indexing Remote Contents in Peer-to-Peer Systems Kjetil Nørvåg Norwegian University of Science and.
1/16/20161 Introduction to Graphs Advanced Programming Concepts/Data Structures Ananda Gunawardena.
Copyright © 2002 Pearson Education, Inc. Slide 3-1 Internet II A consortium of more than 180 universities, government agencies, and private businesses.
IoT Meets Big Data Standardization Considerations
Content Delivery Networks: Status and Trends Speaker: Shao-Fen Chou Advisor: Dr. Ho-Ting Wu 5/8/
第 1 讲 分布式系统概述 §1.1 分布式系统的定义 §1.2 分布式系统分类 §1.3 分布式系统体系结构.
Personalization Services in CADAL Zhang yin Zhuang Yuting Wu Jiangqin College of Computer Science, Zhejiang University November 19,2006.
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
/ Fast Web Content Delivery An Introduction to Related Techniques by Paper Survey B Li, Chien-chang R Sung, Chih-kuei.
Research Directions in Databases Technological Education Institution of Larisa in collaboration with Staffordshire University Larisa Dr. Theodoros.
Chapter 8: Web Analytics, Web Mining, and Social Analytics
4WARD Networking of Information 4WARD WP6. © 4WARD Consortium Confidential Networking of Information Telephony Interconnecting wires 4WARD Future Internet.
GUILLOU Frederic. Outline Introduction Motivations The basic recommendation system First phase : semantic similarities Second phase : communities Application.
Overview Issues in Mobile Databases – Data management – Transaction management Mobile Databases and Information Retrieval.
1 New metrics for characterizing the significance of nodes in wireless networks via path-based neighborhood analysis Leandros A. Maglaras 1 Dimitrios Katsaros.
WEB STRUCTURE MINING SUBMITTED BY: BLESSY JOHN R7A ROLL NO:18.
Data mining in web applications
Information Retrieval in Practice
Nikos Dimokas1 Dimitrios Katsaros2 (presentation)
Wireless Sensor Network Architectures
CHAPTER 3 Architectures for Distributed Systems
Content-Based Music Information Retrieval in Wireless Ad-hoc Networks
Mobile P2P Data Retrieval and Caching
Nikos Dimokas1 Dimitrios Katsaros (presentation) Leandros Tassiulas2
The Internet and Electronic mail
Presentation transcript:

Social Network Analysis & Network Optimization Dimitrios Katsaros, Ph.D. Koblenz, February 18 th, Dept. of Computer & Communication Engineering, University of Dept. of Informatics, Aristotle University

2 Outline of the talk A summary of my research Latest results: “ Social Network Analysis for Network Optimization” Web (2 nd round IEEE Transactions on Knowledge & Data Engineering) PRIMITIVE: Community Identification PROTOCOL: Content Outsourcing GOAL: Latency Reduction Wireless Multimedia Sensor Nets (2 nd round ACM Mobile Networks & Applications) PRIMITIVE: “Important” Sensor Nodes Identification PROTOCOL: Cooperative Caching GOAL: Latency Reduction Collective Intelligence: Latest step of cyberspace

3 My Research Areas (chronological info) WIRELESS NETWORKS Mobile & Pervasive Computing Data Management Caching ( ’04 ) Air-Indexing ( ’07 ) Data Dissemination Broadcast Scheduling ( ’04 ) Prediction Mobility Prediction ( ’03+’08 ) Prefetching ( ’03 ) Mobile Ad Hoc Networks Content-based Multimedia Retrieval ( ’05+’08 ) Broadcasting ( ’06+’08 ) Wireless Sensor Networks Sensor Network Clustering ( ’07 ) (Distr+Local) Data Indexing ( ’06+’08 ) Cooperative Caching ( ’07+’08 ) Data Dissemination ( ’08 ) WIRED NETWORKS Conventional and Streaming Media Distribution in the Web Replication ( ’03 ) Prefetching ( ’01+’02+03 ) Caching ( ’04 ) Overlay and P2P Networks Content Distribution Networks ( ’05+’06 ) Content Placement in CDNs ( ’07+’08 ) Indexing & Query Routing in P2P ( progress ) Distributed Structures over P2P ( progress ) Web Information Retrieval and Data Mining Web Link Mining ( ’05 ) Web Ranking ( ’07+’08 ) Rank Aggregation ( ’07+’08 ) Social Network Analysis ( ’07+’08 ) Bibliometrics (’06+’07+’08)

4 Research areas: Ultimately  ??? Overlay Nets Mobile/Pervasive Computing Sensors Ad Hoc Information Retrieval Web Location Tracking Caching & Air-Indexing Peer-to-Peer Networks Content Distribution Networks Caching & Prefetching & Replication & Semistructured Data & Web views Web Ranking & Search Engines Social Network Analysis Cooperative Caching & Sensor Node Clustering & Distributed Indexing & Coverage/Connectivity & Flash storage & Content-Based MIR Broadcasting & Data Dissemination Webcasting INTELLIGENCE Pervasive Web

5 Social Network Analysis A social network is a social structure to describe social relations (wikipedia) The history of Social Network is older than everybody who is here More than 100 years (Cooley 1909, Durkheim 1893) Focusing on small groups Information Techniques give it a new life [ book: Stanley Wasserman & Katherine Faust ] 1.Mathematical Representation 2.Structural & Locational Properties 3.Roles & Positions 4.Dyadic & Triadic Methods

6 Social Network Analysis [Stanley Wasserman & Katherine Faust] 1.Mathematical Representation 2.Structural & Locational Properties 1.Centrality 1.Betweenness Centrality 3.Roles & Positions 4.Dyadic & Triadic Methods

7 Betweenness Centrality Let σ uw = σ wu denote the number of shortest paths from u  V to w  V (by definition, σ uu = 0) Let σ uw (v) denote the number of shortest paths from u to w that some vertex v  V lies on The Betweenness Centrality index NI(v) of a vertex v is defined as: Large values for the NI index of a node v indicate that this node can reach others on relatively short paths, or that v lies on considerable fractions of shortest paths connecting others

8 The NI index in sample graphs In parenthesis, the NI index of the respective node; i.e., 7(156): node with ID 7 has NI equal to 156. Nodes with large NI:  Articulation nodes (in bridges), e.g., 3, 4, 7, 16, 18  With large fanout, e.g., 14, 8, U Therefore: geodesic nodes

9 The NI index in a localized algorithm For any node v, the NI indexes of the nodes in N 12 (v) calculated only for the subgraph of the 2-hop (in general, k -hop) neighborhood reveal the relative importance of the nodes in N 12 For a node u (of the 2-hop neighbourhood of a node v ), the NI index of u will be denoted as NI v (u)

10 Betweenness Centrality in … [ WEB ] Performing graph clustering and recognizing communities in Web site graphs [ WIRELESS MULTIMEDIA SENSOR NETWORKS ] Recognizing (in a distributed fashion) important sensor nodes, the mediators, that coordinate cooperative caching decisions

Community Identification & Content Outsourcing for the Web

12 The need for content outsourcing

13 CiBC Method Target: is true CiBC method: Building cliques and clusters around representative (pole) nodes (with low CB) Earlier methods have Defined “hard communities”:  node deg(inCom)>deg(outCom) exploited “edge betweenness” to perform hierarchical agglomerative clustering

14 CiBC Method IDNI index Phase 1: NI Computation -O(nm) Phase 2: Initialization of cliques O(n)

15 CiBC Method IDNI index Phase 2: Initialization of cliques O(n)

16 CiBC Method IDNI index Phase 2: Initialization of cliques O(n)

17 CiBC Method IDNI index Phase 2: Initialization of cliques O(n)

18 CiBC Method IDNI index Phase 2: Initialization of cliques O(n)

19 CiBC Method A B ABCD A3300 B3311 C0134 D CD Phase 3: Clique Merging & Creation of Communities Complexity: O(l 2 ) l is the number of cliques

20 CiBC Method A B ABCD A3300 B3311 C01 34 D CD Phase 3: Clique Merging & Creation of Communities 4343

21 CiBC Method A B ABC A330 B332 C C Phase 3: Clique Merging & Creation of Communities

22 CiBC Method A B ABC A 33 0 B332 C C Phase 3: Clique Merging & Creation of Communities

23 CiBC Method A AC A92 C C Phase 3: Clique Merging & Creation of Communities Phase 4: Check constraints

24 CiBC vs. Clique Percolation Method, LRU

Cooperative Caching in Wireless Multimedia Sensor Networks

26 The NICoCa protocol Each node is aware of its 2-hop neighborhood Uses NI to characterize some neighbors as mediators A node can be either a mediator or an ordinary node Each sensor node stores the dataID, and the actual multimedia datum the data size, TTL interval for each cached item, the timestamps of the K most recent accesses each cached item is characterized either as O (i.e., own) or H (i.e., hosted)

27 The cache discovery protocol (1/2) A sensor node issues a request for a multimedia item Searches its local cache and if it is found ( local cache hit ) then the K most recent access timestamps are updated Otherwise ( local cache miss ), the request is broadcasted and received by the mediators These check the 2-hop neighbors of the requesting node whether they cache the datum ( proximity hit ) If none of them responds ( proximity cache miss ), then the request is directed to the Data Center

28 The cache discovery protocol (2/2) When a mediator receives a request, searches its cache If it deduces that the request can be satisfied by a neighboring node ( remote cache hit ), forwards the request to the neighboring node with the largest residual energy If the request can not be satisfied by this mediator node, then it does not forward it recursively to its own mediators, since this will be done by the routing protocol, e.g., AODV If none of the nodes can help, then requested datum is served by the Data Center ( global hit )

29 Cache vs. hits (MB files & uniform access) in a sparse WMSN (d = 4) HYBRID: appears at: L. Yin and G. Cao, “Supporting cooperative caching in ad hoc networks”, IEEE Transactions on Mobile Computing, 5(1):77-89, 2006

30 Cache vs. hits (MB files & uniform access) in a dense WMSN (d = 7) HYBRID: appears at: L. Yin and G. Cao, “Supporting cooperative caching in ad hoc networks”, IEEE Transactions on Mobile Computing, 5(1):77-89, 2006

31 Evolution of cyberspace … Semantic Web + Pervasive Computing WWW + Broadband + WIFI + grid computing Unicode + XML + RDF + Ontologies Internet + Multimedia + URL + HTTP + HTML Servers + Telecom Networks + PCs + TCP-IP + + FTP Computers + Micro-chips + Application Software + WYSIWYG Interfaces Transistors+Formal Logic+Digital Coding+Program. Languages Collective Intelligence Net Semantic Web WWW Internet PC Computer

32 Why Collective Intelligence? Users/ devices generate data at an unprecedented rate Blogs Tags Sensor measurements Web pages Rankings by search engines They could be treated as “opinions” or “votes” Under some conditions: group IQ > individual IQ [So far] Opinion/Vote fusion: PageRank (i.e., collective linking preferences) Metasearching (ranked list merging) Collaborative filtering ( what is interesting from what other people say, what people like you say) …..

33 Collective Intelligence: Some challenges Statistical analysis of social networks Identification of influential opinions and/or producers Discover social context to provide personalization Opinion spam Bias filtering

34 Collective Intelligence: Some challenges Finding high-quality content Opinion mining Dealing with controversies Metadata from data analysis Storage of metadata ……………. MOST IMPORTANTLY In Centralized and/or Distributed settings

Thank you for your attention! Questions?

36 References Our work D. Katsaros , G. Pallis, K. Stamos, A. Sidiropoulos, A. Vakali, Y. Manolopoulos. “ CDNs Content Outsourcing via Generalized Communities ”. IEEE Transactions on Knowledge and Data Engineering, (under second round review), December, N. Dimokas, D. Katsaros, and Y. Manolopoulos, “ Cooperative Caching in Wireless Multimedia Sensor Networks ” ACM Mobile Networks and Applications, (under second round review), February, Competing methods [ CPM community identification method ] G. Palla, I.Derenyi, I.Farkas, and T.Vicsek. Uncovering the overlapping community structure of complex networks in nature and society. Nature, 435(7043):814–818, [ Hybrid cooperative caching method ] L. Yin and G. Cao. Supporting cooperative caching in ad hoc networks. IEEE Transactions on Mobile Computing, 5(1):77–89, 2006.