Gossip-Based Computation of Aggregation Information

Slides:



Advertisements
Similar presentations
3- 1 Chapter 3 Introduction to Numerical Methods Second-order polynomial equation: analytical solution (closed-form solution): For many types of problems,
Advertisements

Random Forest Predrag Radenković 3237/10
P2P data retrieval DHT (Distributed Hash Tables) Partially based on Hellerstein’s presentation at VLDB2004.
Fast Algorithms For Hierarchical Range Histogram Constructions
JetStream: Achieving Predictable Gossip Dissemination by Leveraging Social Network Principles Jay A. Patel 1, Indranil Gupta 1, and Noshir Contractor 2.
Distributed Clustering for Robust Aggregation in Large Networks Ittay Eyal, Idit Keidar, Raphi Rom Technion, Israel.
Square root of an arbitrary positive number: Determination by estimation & iteration Given an area (A) of a square, determine its width. ? Area.
Implementation of ICP Variants Pavan Ram Piratla Janani Venkateswaran.
Error Measurement and Iterative Methods
Expediting Searching Processes via Long Paths in P2P Systems 05/30 IDEA Lab.
Peer-to-Peer Networks as a Distribution and Publishing Model Jorn De Boever (june 14, 2007)
Distributed Data Classification in Sensor Networks DE: Verteilte Daten-Klassifikation in Sensor-Netzwerken FR: Classification distribuée de données dans.
Adaptive Data Collection Strategies for Lifetime-Constrained Wireless Sensor Networks Xueyan Tang Jianliang Xu Sch. of Comput. Eng., Nanyang Technol. Univ.,
Fault-tolerant Routing in Peer-to-Peer Systems James Aspnes Zoë Diamadi Gauri Shah Yale University PODC 2002.
Efficient Content Location Using Interest-based Locality in Peer-to-Peer Systems Presented by: Lin Wing Kai.
Distributed Clustering for Robust Aggregation in Large Networks Ittay Eyal, Idit Keidar, Raphi Rom Technion, Israel.
Presented by Ozgur D. Sahin. Outline Introduction Neighborhood Functions ANF Algorithm Modifications Experimental Results Data Mining using ANF Conclusions.
Aggregating Information in Peer-to-Peer Systems for Improved Join and Leave Distributed Computing Group Keno Albrecht Ruedi Arnold Michael Gähwiler Roger.
Vassilios V. Dimakopoulos and Evaggelia Pitoura Distributed Data Management Lab Dept. of Computer Science, Univ. of Ioannina, Greece
Gossip-Based Aggregation of Trust in Decentralized Reputation Systems Ariel D. Procaccia, Yoram Bachrach, and Jeffrey S. Rosenschein.
Jana van Greunen - 228a1 Analysis of Localization Algorithms for Sensor Networks Jana van Greunen.
15-1 Introduction Most of the hypothesis-testing and confidence interval procedures discussed in previous chapters are based on the assumption that.
Intelligible Models for Classification and Regression
CISE-301: Numerical Methods Topic 1: Introduction to Numerical Methods and Taylor Series Lectures 1-4: KFUPM.
INTRODUCTION TO PEER TO PEER NETWORKS Z.M. Joseph CSE 6392 – DB Exploration Spring 2006 CSE, UT Arlington.
Introduction and Analysis of Error Pertemuan 1
PIC: Practical Internet Coordinates for Distance Estimation Manuel Costa joint work with Miguel Castro, Ant Rowstron, Peter Key Microsoft Research Cambridge.
1 GossipTrust for Fast Reputation Aggregation in Peer-to-Peer Networks Runfang Zhou, Kai Hwang, and Min Cai University of Southern California IEEE Transaction.
PNear Combining Content Clustering and Distributed Hash-Tables Ronny Siebes Vrije Universiteit, Amsterdam The netherlands
Distributed Load Balancing for Key-Value Storage Systems Imranul Hoque Michael Spreitzer Malgorzata Steinder.
Recursion, Complexity, and Searching and Sorting By Andrew Zeng.
Lecture 2 Number Representation and accuracy
22 July 2008 John Hart Toroid Field Parameterisation 1 Toroid Field Parameterisation An informal report to the RAL ATLAS meeting John Hart 22 July 2008.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved Chapter 2 ARCHITECTURES.
Introduction to Square Roots. 1. Find the exact value of each square root without a calculator. (similar to p.534 #38)
Vincent Matossian September 21st 2001 ECE 579 An Overview of Decentralized Discovery mechanisms.
Hit-and-Miss (or Rejection) Monte Carlo Method:
Section 8.4: Other Convergence Tests Practice HW from Stewart Textbook (not to hand in) p. 592 # 3-7, 12-17, odd, 33.
Mean and Standard Deviation of Discrete Random Variables.
The Application of The Improved Hybrid Ant Colony Algorithm in Vehicle Routing Optimization Problem International Conference on Future Computer and Communication,
Today Ensemble Methods. Recap of the course. Classifier Fusion
ADVANCED PERCEPTRON LEARNING David Kauchak CS 451 – Fall 2013.
A new Ad Hoc Positioning System 컴퓨터 공학과 오영준.
1 Peer-to-Peer Technologies Seminar by: Kunal Goswami (05IT6006) School of Information Technology Guided by: Prof. C.R.Mandal, School of Information Technology.
Peer to Peer A Survey and comparison of peer-to-peer overlay network schemes And so on… Chulhyun Park
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.
The new protocol of freenet Taken from Ian Clarke and Oskar Sandberg (The Freenet Project)
Chapter 6 Cross Validation.
Estimating Fractions.
Decision Trees IDHairHeightWeightLotionResult SarahBlondeAverageLightNoSunburn DanaBlondeTallAverageYesnone AlexBrownTallAverageYesNone AnnieBlondeShortAverageNoSunburn.
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 37 – Introduction to P2P (Part 1) Klara Nahrstedt.
Evolving RBF Networks via GP for Estimating Fitness Values using Surrogate Models Ahmed Kattan Edgar Galvan.
P2P Search COP6731 Advanced Database Systems. P2P Computing  Powerful personal computer Share computing resources P2P Computing  Advantages: Shared.
1 Structure Learning (The Good), The Bad, The Ugly Inference Graphical Models – Carlos Guestrin Carnegie Mellon University October 13 th, 2008 Readings:
Project on Newton’s Iteration Method Presented by Dol Nath Khanal Project Advisor- Professor Dexuan Xie 05/11/2015.
Copyright © 2012 Pearson Education, Inc. Publishing as Prentice Hall. 4.2 Adding and Subtracting Decimals.
An Evolutionary Algorithm for Neural Network Learning using Direct Encoding Paul Batchis Department of Computer Science Rutgers University.
CASCADE: AN ATTACK-RESISTANT DHT WITH MINIMAL HARD STATE
Introduction to Square Roots
Controlling the Cost of Reliability in Peer-to-Peer Overlays
Taylor series in numerical computations (review)
Partial Products Algorithm for Multiplication
Spatial Online Sampling and Aggregation
Errors in Numerical Methods
DHT Routing Geometries and Chord
Errors in Numerical Methods
Fundamental Statistics for the Behavioral Sciences, 4th edition
Let A = {image} and B = {image} . Compare A and B.
3.8 Newton’s Method How do you find a root of the following function without a graphing calculator? This is what Newton did.
Linear Algebra Lecture 16.
Presentation transcript:

Gossip-Based Computation of Aggregation Information David Kempe Alin Dobra Johannes Gehrke Presented by Hao Zhou

Introduction Gossip-based Algorithm Analyze Gossip-based Algorithm Content Introduction Gossip-based Algorithm Analyze Gossip-based Algorithm

Introduction Peer to peer network Unstructured network Gnutella, Napster Structured network DHT-based systems such as Pastry, Chord, Tepastry, CAN Advantages of DHT-based systems Fast: O (log n) Can exactly find a publishing object in a gigantic network space

Gossip-based Algorithm But if we want to get the aggregation information for the whole network Such as sum value, average value Our objective is to calculate the average value of Xavg =(x1+x2+x3…+x12)/12 Disadvantage of DHT-based systems Gossip-based algorithm Objective: let the estimation average value close to Xavg for every node X2 X3 X1 X11 X4 X10 X12 X5 X9 X6 X8 X7

Gossip-based Algorithm Xavg = (X1+X2+X3+X4)/4 is a real average value in a peer to peer network Xeavg is the estimated average value for the P2P network in a node (X4+x2)/2 time=0, Xeavg1=X1, Xeavg2=x2, Xeavg3=x3, Xeavg4=x4 Time=1, Randomly pick up another node Xeavg1=X1/ 2, Xeavg2=(X4+x2)/ 2 Xeavg3=(X2+X3)/ 2 Xeavg4= (X1+X3+X4)/ 2 X1 X2/2 (X1+x1+x3+x4)/4 X1/2 X2/2 X2 X1/2 X1/2 (X2+x2+x3+x4)/4 X3 X4/2 (X2+x3)/2 X3/2 X4 X4/2 (X1+x3+x4)/4 (X1+x3+x4)/2 X3/2 (X2+x2+x3+x4)/4 Time = 2, Xeavg1=(X1+X1+X3+X4)/ 4, Xeavg2=(X2+X2+X3+X4)/4, Xeavg3=(X2+X2+X3+X4)/ 4, Xeavg4=(X1+X3+X4)/ 4,

Gossip-based Algorithm After m rounds/iterations, Xeavg is very close to Xavg We can see Xeavg as Xavg

Define a variance error= | Xeavg-Xavg | Converge Speed Define a variance error= | Xeavg-Xavg | Our objective is to make the variance close to 0 Calculate the converge speed of this variance In every round, the variance drops to less than half its previous value var(t+1) = ( ) var(t) Xeavg Xavg

Analyze Gossip-based Algorithm Gossip-based algorithm is an approximation method We can control the accuracy Xeavg never = Xavg, but Xeavg can be very close to Xavg When variance error=| Xeavg – Xavg| <= ε, we can say Xeavg is Xavg.

Analyze Gossip-based Algorithm Roughly say, after O(logn+log(1/ ε)) rounds, can we say variance error <= ε in every node Maybe there are broken network connections

Analyze Gossip-based Algorithm We have to control the percentage of nodes who obtain err<=ε We say with probability at least 1-δ, after O(logn+log(1/ε)+log(1/δ)) rounds, The err=|Xeavg – Xavg| <= ε Their contribution: The diffusion speed of uniform gossip is O(logn+log(1/ε)+log(1/δ)) , with probability at least 1- δ, and variance error <= ε

Advantages of Gossip Algorithm Algorithm is very simple Converge speed is very fast Can automatically adjust itself Nodes join the network Nodes leave the network

Disadvantages of Gossip Algorithm From their theory, we know after O(logn+log(1/ε)+ log(1/δ)) rounds, the estimation average value in a local node can be see as a global average value. But in practice, If we do not know the size of the network, how do we know how many rounds a estimation average value is close enough to the real average value.

Thank you !