George Anadiotis, Spyros Kotoulas and Ronny Siebes VU University Amsterdam.

Slides:



Advertisements
Similar presentations
You have been given a mission and a code. Use the code to complete the mission and you will save the world from obliteration…
Advertisements

1 Senn, Information Technology, 3 rd Edition © 2004 Pearson Prentice Hall James A. Senns Information Technology, 3 rd Edition Chapter 7 Enterprise Databases.
Advanced Piloting Cruise Plot.
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Chapter 1 The Study of Body Function Image PowerPoint
1 Copyright © 2010, Elsevier Inc. All rights Reserved Fig 2.1 Chapter 2.
Performance in Decentralized Filesharing Networks Theodore Hong Freenet Project.
By D. Fisher Geometric Transformations. Reflection, Rotation, or Translation 1.
…to Ontology Repositories Mathieu dAquin Knowledge Media Institute, The Open University From…
Document #07-12G 1 RXQ Customer Enrollment Using a Registration Agent Process Flow Diagram (Switch) Customer Supplier Customer authorizes Enrollment.
Relational Database and Data Modeling
Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination.
Scalable Routing In Delay Tolerant Networks
Designing Services for Grid-based Knowledge Discovery A. Congiusta, A. Pugliese, Domenico Talia, P. Trunfio DEIS University of Calabria ITALY
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Title Subtitle.
My Alphabet Book abcdefghijklm nopqrstuvwxyz.
Multiplying binomials You will have 20 seconds to answer each of the following multiplication problems. If you get hung up, go to the next problem when.
0 - 0.
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
FACTORING Think Distributive property backwards Work down, Show all steps ax + ay = a(x + y)
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Addition Facts
Year 6 mental test 5 second questions
Peer-to-peer and agent-based computing Peer-to-Peer Computing: Introduction.
|epcc| NeSC Workshop Open Issues in Grid Scheduling Ali Anjomshoaa EPCC, University of Edinburgh Tuesday, 21 October 2003 Overview of a Grid Scheduling.
ZMQS ZMQS
Date : 2012/09/20 Author : Sina Fakhraee, Farshad Fotouhi Source : KEYS12 Speaker : Er-Gang Liu Advisor : Dr. Jia-ling Koh 1.
17 th International World Wide Web Conference 2008 Beijing, China XML Data Dissemination using Automata on top of Structured Overlay Networks Iris Miliaraki.
Database Systems: Design, Implementation, and Management
BT Wholesale October Creating your own telephone network WHOLESALE CALLS LINE ASSOCIATED.
Report Card P Only 4 files are exported in SAMS, but there are at least 7 tables could be exported in WebSAMS. Report Card P contains 4 functions: Extract,
Information Systems Today: Managing in the Digital World
How To Use Google Forms to Create A Test Quick Easy Self-Graded!! Instant Reports.
© 2011 TIBCO Software Inc. All Rights Reserved. Confidential and Proprietary. Towards a Model-Based Characterization of Data and Services Integration Paul.
ABC Technology Project
1 Web-Enabled Decision Support Systems Access Introduction: Touring Access Prof. Name Position (123) University Name.
Megastore: Providing Scalable, Highly Available Storage for Interactive Services. Presented by: Hanan Hamdan Supervised by: Dr. Amer Badarneh 1.
1 Undirected Breadth First Search F A BCG DE H 2 F A BCG DE H Queue: A get Undiscovered Fringe Finished Active 0 distance from A visit(A)
Microsoft Office Illustrated Fundamentals Unit C: Getting Started with Unit C: Getting Started with Microsoft Office 2010 Microsoft Office 2010.
VOORBLAD.
Luca Maria Aiello, Università degli Studi di Torino, Computer Science department 1 Tempering Kademlia with a robust identity based system.
CAR Training Module PRODUCT REGISTRATION and MANAGEMENT Module 2 - Register a New Document - Without Alternate Formats (Run as a PowerPoint show)
1 Breadth First Search s s Undiscovered Discovered Finished Queue: s Top of queue 2 1 Shortest path from s.
Squares and Square Root WALK. Solve each problem REVIEW:
Do you have the Maths Factor?. Maths Can you beat this term’s Maths Challenge?
© 2012 National Heart Foundation of Australia. Slide 2.
Lets play bingo!!. Calculate: MEAN Calculate: MEDIAN
Executional Architecture
CSCE 668 DISTRIBUTED ALGORITHMS AND SYSTEMS Fall 2011 Prof. Jennifer Welch CSCE 668 Set 14: Simulations 1.
Chapter 5 Test Review Sections 5-1 through 5-4.
GG Consulting, LLC I-SUITE. Source: TEA SHARS Frequently asked questions 2.
Addition 1’s to 20.
25 seconds left…...
Week 1.
We will resume in: 25 Minutes.
1 Unit 1 Kinematics Chapter 1 Day
PSSA Preparation.
TASK: Skill Development A proportional relationship is a set of equivalent ratios. Equivalent ratios have equal values using different numbers. Creating.
How Cells Obtain Energy from Food
McGraw-Hill©The McGraw-Hill Companies, Inc., 2001 Chapter 16 Integrated Services Digital Network (ISDN)
CpSc 3220 Designing a Database
1 Evaluating Conjunctive Triple Pattern Queries over Large Structured Overlay Networks Erietta Liarou, Stratos Idreos, and Manolis Koubarakis Waled.
GridVine: Building Internet-Scale Semantic Overlay Networks By Lan Tian.
Topics in Reliable Distributed Systems Fall Dr. Idit Keidar.
Peer-to-Peer Data Management
Presentation transcript:

George Anadiotis, Spyros Kotoulas and Ronny Siebes VU University Amsterdam

Why do we need distribution… Why do we need anytime behavior… Why is should be (very) scalable… Why should we drop consistency and completeness… Why do we need trust/ontology ranking… etc 2

What is P2P? (1 slide) Relationship between P2P and SW(3 slides) Our Goal (1 slide) Distributed SW stores(1 slide) Structured P2P stores (3 slides) Federated stores (2 slides) Our approach (6 slides) Future work (1 slide) 3

Class of distributed systems Most important characteristics Same functionality across peers Peer autonomy Formation of overlay networks Common interface They respect some agreed-upon way to organize File-sharing networks are NOT the only Peer- to-Peer systems. 4

5

Source of semantic information to self- organize Interoperability 6

Scalable infrastructure for Storage Reasoning Collaboration Self-organization Autonomy – control of data Privacy Scalable algorithms Robustness No censorship No preferential treatment of information 7 Common misconception: All Peer-to-Peer systems can offer the above

Global-scale semantic web storage and reasoning Scalability Computation Administration 8

Structured peer-to-peer Use DHTs One global distributed store Peers do not maintain their own data Federated stores Each peer maintains its own store Stores are interconnected Either global schema or mappings between schemata 9

The mathematical abstraction for hashtables is a Map Functionality: put(key,value) get(key) Similar to normal hash-tables with the difference that each bucket now is a peer Accessing different buckets involves network traffic Routing to a bucket is done bothering approx. log(N) peers, N is network size 10

Values are stored in the peer with ID starting with the first letter of the key 11 adcbef gjihkl mponqr suvtwx

Peer 1 Peer 2 12 adcbef gjihkl mponqr suvtwx

13 adcbef gjihkl mponqr suvtwx RDFS class axioms (1), (2),

14 adcbef gjihkl mponqr suvtwx RDFS class axioms (1) FORALL O,V O[rdfs:subClassOf->V] W] AND W[rdfs:subClassOf->V]). (2) FORALL O,T O[rdf:type->T] T] AND O[rdf:type->S]).

As shown, the transitive closure has to be calculated – backwards chaining would require many DHT messages But it does not scale to large number of ontologies. E.g. a animal hierarchy: Adding the triple means that for all triples with animal, we need to insert an additional triple. Control over ontologies Provenance of information Ontologies and instance data are made public Publishers are not in control of their ontologies/data One super-user inserts all data 15

Each peer maintains its ontology and instance data Mappings are (manually) defined between ontologies Thus, a semantic topology is created Queries are posted according to such a schema and forwarded following these mappings Semantic Web counterpart of Federated Databases 16

Bootstrapping New peers have to manually map their ontologies to the ontology of a peer already in the network Finding relevant ontologies requires flooding Routing The overlay is created according to the ontologies understood by peers, not the data they contain. Possible scalability problem. Searching for instances requires flooding 17

Effort to combine both approaches Use a DHT to efficiently find ontologies and instance data Exploit semantic locality by keeping ontologies local to the publisher Whenever possible, perform reasoning peer-to- peer 18

19 Peer 1 Peer 2 19 adcbef gjihkl mponqr suvtwx animal:P1 rabbit:P1 monk_seal:P2 mseal1:P2 habitat:P1lives_in:P1 seal:P1,P2 subClassOf:P1, P2

20 Peer 1 Peer 2 20 adcbef gjihkl mponqr suvtwx animal:P1 rabbit:P1 seal:P1,P2 subClassOf:P1, P2 monk_seal:P2 mseal1:P2 habitat:P1 Query seal? P1, P2 lives_in:P1 Peer 3

21 Peer 1 Peer 2 21 adcbef gjihkl mponqr suvtwx animal:P1 rabbit:P1 seal:P1,P2 subClassOf:P1, P2 monk_seal:P2 mseal1:P2 habitat:P1 Query monk_seal? P2 lives_in:P1 Peer 3 seal? P1

Control Access Control Select which data is published on the index Trust – ban spammers, remember good peers Privacy It is possible to obfuscate descriptors stored in the DHT Responsibility Publisher has the responsibility to maintain own data Scalability DHTs can scale to millions of nodes Data is up-to-date 22

Based on the data of swoogle, there is currently small overlap between ontologies The distribution of ontology popularity follows a power-law pattern If most answers reside on the same peer, our approach outperforms those that rely on triple distribution on top of a DHT 23

Simulations using SWD from Swoogle and Watson (around ) Integration of privacy in the index Selecting the right ontologies/peers 24

? 25