KDDCS: A Load-Balanced In- Network Data-Centric Storage Scheme for Sensor Networks Mohamed Aly In collaboration with Kirk Pruhs and Panos K. Chrysanthis.

Slides:



Advertisements
Similar presentations
Scalable Content-Addressable Network Lintao Liu
Advertisements

1 Data-Centric Storage in Sensornets with GHT, A Geographic Hash Table Sylvia Ratnasamy, Scott Shenker, Brad Karp, Ramesh Govindan, Deborah Estrin, Li.
Geo – Routing in ad hoc nets References: Brad Karp and H.T. Kung “GPSR: Greedy Perimeter Stateless Routing for Wireless Networks”, Mobicom 2000 M. Zorzi,
1 Routing Techniques in Wireless Sensor networks: A Survey.
The University of Iowa. Copyright© 2005 A. Kruger 1 Introduction to Wireless Sensor Networks WSN Routing II 21 March 2005.
Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Schenker Presented by Greg Nims.
Geographic Routing Without Location Information A. Rao, S. Ratnasamy, C. Papadimitriou, S. Shenker, I. Stoica Paper and Slides by Presented by Ryan Carr.
Good afternoon everyone.
A Framework for Secure Data Aggregation in Sensor Networks Yi Yang Xinran Wang, Sencun Zhu and Guohong Cao The Pennsylvania State University MobiHoc’ 06.
Span: An Energy-Efficient Coordination Algorithm for Topology Maintenance in Ad Hoc Wireless Networks ACM Wireless Networks Journal, 2002 BENJIE CHEN,
Common approach 1. Define space: assign random ID (160-bit) to each node and key 2. Define a metric topology in this space,  that is, the space of keys.
Data-Centric Storage in Sensor Networks With GHT Khaldoun A. Ibrahim,
1 One Torus to Rule Them All: Multi-dimensional Queries in P2P Systems Prasanna Ganesan Beverly Yang Hector Garcia-Molina Stanford University.
1 Data-Centric Storage in Sensornets Sylvia Ratnasamy, Scott Shenker, Brad Karp, Ramesh Govindan, Deborah Estrin ICSI/UCB/USC/UCLA Presenter: Vijay Sundaram.
Multi-dimensional Range Query in Sensor Networks Xin Li,Young Jim Kim, Ramesh Govindan (University of Southern California ) Wei Hong (Intel Research Lab.
1 Distributed Navigation Algorithms for Sensor Networks Chiranjeeb Buragohain, Divyakant Agrawal, Subhash Suri Dept. of Computer Science, University of.
Distributed Quad-Tree for Spatial Querying in Wireless Sensor Networks (WSNs) Murat Demirbas, Xuming Lu Dept of Computer Science and Engineering, University.
Computer Science Approximately Uniform Random Sampling in Sensor Networks Boulat A. Bash, John W. Byers and Jeffrey Considine.
A Scalable Content-Addressable Network Authors: S. Ratnasamy, P. Francis, M. Handley, R. Karp, S. Shenker University of California, Berkeley Presenter:
A Scalable and Load-Balanced Lookup Protocol for High Performance Peer-to-Peer Distributed System Jerry Chou and Tai-Yi Huang Embedded & Operating System.
UNIVERSITY OF SOUTHERN CALIFORNIA Embedded Networks Laboratory 1 Wireless Sensor Networks Ramesh Govindan Lab Home Page:
Georouting in ad hoc nets References: Brad Karp and H.T. Kung “GPSR: Greedy Perimeter Stateless Routing for Wireless Networks”, Mobicom 2000 M. Zorzi,
1 A Scalable Content- Addressable Network S. Ratnasamy, P. Francis, M. Handley, R. Karp, S. Shenker Proceedings of ACM SIGCOMM ’01 Sections: 3.5 & 3.7.
Data-Centric Storage in Sensornets Submitted to Sigcomm 2002 Authors: Sylvia Ratnasamy et al. ICIR, UCLA, UC-Berkeley Presenter:Shang-Chieh Wu
SCALLOP A Scalable and Load-Balanced Peer- to-Peer Lookup Protocol for High- Performance Distributed System Jerry Chou, Tai-Yi Huang & Kuang-Li Huang Embedded.
Distributed Quad-Tree for Spatial Querying in Wireless Sensor Networks (WSNs) Murat Demirbas, Xuming Lu Dept of Computer Science and Engineering, University.
1 Load Balance and Efficient Hierarchical Data-Centric Storage in Sensor Networks Yao Zhao, List Lab, Northwestern Univ Yan Chen, List Lab, Northwestern.
1 Load Balance and Efficient Hierarchical Data-Centric Storage in Sensor Networks Yao Zhao, List Lab, Northwestern Univ Yan Chen, List Lab, Northwestern.
SIGMOD'061 Energy-Efficient Monitoring of Extreme Values in Sensor Networks Adam Silberstein Kamesh Munagala Jun Yang Duke University.
ICDE A Peer-to-peer Framework for Caching Range Queries Ozgur D. Sahin Abhishek Gupta Divyakant Agrawal Amr El Abbadi Department of Computer Science.
Geographic Routing Without Location Information A. Rao, C. Papadimitriou, S. Shenker, and I. Stoica In Proceedings of the 9th Annual international Conference.
1 Algorithms for Bandwidth Efficient Multicast Routing in Multi-channel Multi-radio Wireless Mesh Networks Hoang Lan Nguyen and Uyen Trang Nguyen Presenter:
Roadmap-Based End-to-End Traffic Engineering for Multi-hop Wireless Networks Mustafa O. Kilavuz Ahmet Soran Murat Yuksel University of Nevada Reno.
Roger ZimmermannCOMPSAC 2004, September 30 Spatial Data Query Support in Peer-to-Peer Systems Roger Zimmermann, Wei-Shinn Ku, and Haojun Wang Computer.
Symmetric Replication in Structured Peer-to-Peer Systems Ali Ghodsi, Luc Onana Alima, Seif Haridi.
Lifetime and Coverage Guarantees Through Distributed Coordinate- Free Sensor Activation ACM MOBICOM 2009.
Energy Efficient Routing and Self-Configuring Networks Stephen B. Wicker Bart Selman Terrence L. Fine Carla Gomes Bhaskar KrishnamachariDepartment of CS.
M-GEAR: Gateway-Based Energy-Aware Multi-Hop Routing Protocol
1 SD-Rtree: A Scalable Distributed Rtree Witold Litwin & Cédric du Mouza & Philippe Rigaux.
Load Balancing of In-Network Data-Centric Storage Schemes in Sensor Networks Mohamed Aly In collaboration with Kirk Pruhs and Panos K. Chrysanthis Advanced.
GeoGrid: A scalable Location Service Network Authors: J.Zhang, G.Zhang, L.Liu Georgia Institute of Technology presented by Olga Weiss Com S 587x, Fall.
Sensor Network Databases1 Overview: Chapter 6  Sensor Network Databases  Sensor networks are conceptually a distributed DB  Store collected data  Indexes.
Geographic Hash Table S. Ratnasamy, B. Karp, S. Shenker, D. Estrin, R. Govindan, L. Yin and F. Yu.
Data centric Storage In Sensor networks Based on Balaji Jayaprakash’s slides.
Decomposing Data-Centric Storage Query Hot-Spots in Sensor Netwokrs Mohamed Aly, Panos K. Chrysanthis, and Kirk Pruhs University of Pittsburgh Proceeding.
Content Addressable Network CAN. The CAN is essentially a distributed Internet-scale hash table that maps file names to their location in the network.
Benjamin AraiUniversity of California, Riverside Reliable Hierarchical Data Storage in Sensor Networks Song Lin – Benjamin.
Multi-Criteria Routing in Pervasive Environment with Sensors Santhanakrishnan, G., Li, Q., Beaver, J., Chrysanthis, P.K., Amer, A. and Labrinidis, A Department.
Data Centric Storage: GHT Brad Karp UCL Computer Science CS 4C38 / Z25 17 th January, 2006.
Zone Sharing: A Hot-Spots Decomposition Scheme for Data-Centric Storage in Sensor Networks Mohamed Aly Nicholas Morsillo Panos K. Chrysanthis Kirk Pruhs.
Zone Sharing: A Hot-Spots Decomposition Scheme for Data-Centric Storage in Sensor Networks Mohamed Aly, Nicholas Morsillo, Panos K. Chrysanthis, and Kirk.
1 Shape Segmentation and Applications in Sensor Networks Xianjin Xhu, Rik Sarkar, Jie Gao Department of CS, Stony Brook University INFOCOM 2007.
Rendezvous Regions: A Scalable Architecture for Service Location and Data-Centric Storage in Large-Scale Wireless Sensor Networks Karim Seada, Ahmed Helmy.
Modeling In-Network Processing and Aggregation in Sensor Networks Ajay Mahimkar The University of Texas at Austin March 24, 2004.
STDCS: A Spatio-Temporal Data-Centric Storage Scheme For Real-Time Sensornet Applications Mohamed Aly (University of Pittsburgh & Yahoo, Inc.) In collaboration.
A Dynamic Query-tree Energy Balancing Protocol for Sensor Networks H. Yang, F. Ye, and B. Sikdar Department of Electrical, Computer and systems Engineering.
By: Gang Zhou Computer Science Department University of Virginia 1 Medians and Beyond: New Aggregation Techniques for Sensor Networks CS851 Seminar Presentation.
Massively Distributed Database Systems In-Network Query Processing (Ad-Hoc Sensor Network) Fall 2015 Ki-Joune Li Pusan.
An Adaptive Zone-based Storage Architecture for Wireless Sensor Networks Thang Nam Le, Dong Xuan and *Wei Yu Department of Computer Science and Engineering,
Energy Efficient Data Management for Wireless Sensor Networks with Data Sink Failure Hyunyoung Lee, Kyoungsook Lee, Lan Lin and Andreas Klappenecker †
TreeCast: A Stateless Addressing and Routing Architecture for Sensor Networks Santashil PalChaudhuri, Shu Du, Ami K. Saha, and David B. Johnson Department.
Event query processing based on data-centric storage in wireless sensor networks Longjian Guo, Yingshu Li, and Jianzhong Li IEEE GLOBECOM Technical Conference.
Query-based wireless sensor storage management for real time applications Ravinder Tamishetty, Lek Heng Ngoh, and Pung Hung Keng Proceedings of the 2006.
On Mobile Sink Node for Target Tracking in Wireless Sensor Networks Thanh Hai Trinh and Hee Yong Youn Pervasive Computing and Communications Workshops(PerComW'07)
Incrementally Improving Lookup Latency in Distributed Hash Table Systems Hui Zhang 1, Ashish Goel 2, Ramesh Govindan 1 1 University of Southern California.
Distributed P2P Protocols Gabber-Galil overlay network for data storage in sensor networks.
Introduction to Wireless Sensor Networks
Distributed database approach,
Spatial Indexing I Point Access Methods.
Net 435: Wireless sensor network (WSN)
Presentation transcript:

KDDCS: A Load-Balanced In- Network Data-Centric Storage Scheme for Sensor Networks Mohamed Aly In collaboration with Kirk Pruhs and Panos K. Chrysanthis Advanced Data Management Technologies Lab Dept. of Computer Science University of Pittsburgh ACM CIKM 2006

CIKM’06 Mohamed Aly 2 Motivating Application: Disaster Management

CIKM’06 Mohamed Aly 3 Disaster Management Sensor Networks Sensors are deployed to monitor the disaster area. First responders moving in the area issue ad-hoc queries to nearby sensors. The sensor network is responsible for answering these queries. First responders use query results to improve the decision making process in the management process of the disaster.

CIKM’06 Mohamed Aly 4 Where to Store Sensor Readings?? In-Network storage Events are temporarily stored in the sensor nodes. Best suited for ad-hoc queries issued from users roaming within the network service area. All in-Network storage schemes presented in literature are Data Centric Storage schemes. Why not simply Local Storage??

CIKM’06 Mohamed Aly 5 Data-Centric Storage (DCS) Quality of Data (QoD) of ad-hoc queries Define an event owner based on the event value Examples: Distributed Hash Tables (DHT) [Shenker et. al., HotNets’03] Geographic Hash Tables (GHT) [Ratsanamy et. al., WSNA’02] Distributed Index for Multi-dimensional data (DIM) [Li et. al., SenSys’03] Greedy Perimeter Stateless Routing algorithm (GPSR) [Karp & Kung, Mobicom’00] Among the above schemes, DIM has been shown to exhibit the best performance

CIKM’06 Mohamed Aly 6 Problem Statement: Storage Hot-Spots in DCS Schemes S1 x є [1,10] S2 x є [10,20] S3 x є [20,30] S4 x є [30,40] 50% 40% 7% 3% Definition: A storage hot-spot occurs when relatively many events are assigned to a small number of sensors. Causes: Irregular data distribution and/or non-uniform sensor deployment.

CIKM’06 Mohamed Aly 7 Roadmap Motivating Application: Disaster Management Data-Centric Storage (DCS) Problem Statement: Storage Hot-spots Previous Solutions: The Zone Sharing algorithm (ZS) running on top of DIM The K-D Tree Based DCS scheme (KDDCS) Experimental Results Conclusions

CIKM’06 Mohamed Aly 8 Roadmap Motivating Application: Disaster Management Data-Centric Storage (DCS) Problem Statement: Storage Hot-spots Previous Solutions: The Zone Sharing algorithm (ZS) running on top of DIM The K-D Tree Based DCS scheme (KDDCS) Experimental Results Conclusions

CIKM’06 Mohamed Aly 9 Distributed Index for Multi-Dimensional Data (DIM) S2 x є [10,20] y є [1,10] S3 x є [10,20] y є [10,20] S1 x є [1,10] y є [1,20] Z = 0 Z = 10 Z = 11 S1 S2S3 A = 0 A = 10 A = 11 1.Distributed physical address assignment 2.Event to bit-code mapping

CIKM’06 Mohamed Aly 10 Zone Sharing [Aly et al., DMSN’05] A hot-spot detection and decomposition scheme. Each sensor regularly compares its storage load with that of its neighbors. In case a hot-spot is detected, logical node migration takes place in the K-D tree. Ability to cope with small to medium storage hot-spots.

CIKM’06 Mohamed Aly 11 Roadmap Motivating Application: Disaster Management Data-Centric Storage (DCS) Problem Statement: Storage Hot-spots Previous Solutions The Zone Sharing algorithm running on top of DIM The K-D Tree Based DCS scheme (KDDCS) Experimental Results Conclusions

CIKM’06 Mohamed Aly 12 K-D Tree Based DCS Scheme (KDDCS) Maps the sensor network into a k-d tree as in DIM. Motivation: Hot-spot “avoidance” instead of “hot-spot detection and decomposition”. How?? Keep the K-D tree balanced all the time.

CIKM’06 Mohamed Aly 13 KDDCS versus DIM

CIKM’06 Mohamed Aly 14 KDDCS versus DIM Z = 0010Z = 0110 Z = 1000 Z = 11 Z = 000 Z = Z = Z = 010 Z = Z = 1001 Z = 1010Z = 1011 Z = 0011 Z = 100 Z = 000 Z = 0010 Z = 001Z = 0011 Z = 101 Z = 110 Z = 111 DIM resulting k-d tree KDDCS resulting k-d tree with Non-uniform Split Points Orphan Zone Orphan zone

CIKM’06 Mohamed Aly 15 KDDCS Balanced Tree Z = 100 Z = 000 Z = 0010 Z = 001Z = 0011 Z = 101 Z = 110 Z =

CIKM’06 Mohamed Aly 16 KDDCS Abstracted Theoretical Problem The Weighted Split Median Problem: Each sensor s i has an associated value, w i W is the sum of all values w i Goal: sensors to agree on a split value V such that approximately half of the values are larger than V and half of the values are smaller than V A Distributed Algorithm for the Problem: Time complexity:  O(log n) times the network diameter D  O(1) times network diameter if the number of sensors is known a priori within a constant factor Messaging complexity:  Each sensor sends O(log n) sensor IDs

CIKM’06 Mohamed Aly 17 Top Level Steps of the Algorithm Elect a leader s l and form a BFS tree T routed at s l of the network The number of sensors n and the aggregate of values W are reported to s l The leader s l collects a log-sized uniform random sample L of the values such that the expected number of times that a value from sensor s i included in the sample is around (w i. Log n) / W The value V is then the median of the reported values in L s l reports V to all sensors

CIKM’06 Mohamed Aly 18 KDDCS Components Distributed logical address assignment algorithm 1. Partitioning the region horizontally/vertically by running the algorithm on node y/x coordinates 2. Appending addresses of nodes falling above/below (left/right to) splitting line with 0/1 rightmost bit 3. Reiterating till each node falls by itself in a region (algorithm applied in parallel for different regions) Event to bit-code mapping Use a pre-known distribution of the events or assume storage is uniformly distributed as an easy startup choice Assign ranges to nodes in a way to balance storage among nodes based on the use of non-uniform split points

CIKM’06 Mohamed Aly 19 KDDCS Components Definition: Node Storage Its pertinent information and that of all its ancestors in the tree The pertinent information of a node is: Geographic region covered Split line separating its two children The attribute range, attribute split point, associated with this region Incremental event hashing and routing Logical Stateless Routing algorithm (LSR) Operates in O(log n) rounds (depth of the tree) Uses logical addresses of nodes and geographic direction of destination (Unlike GPSR where node addresses are physical)

CIKM’06 Mohamed Aly 20 Logical Stateless Routing (LSR)

CIKM’06 Mohamed Aly 21 KDDCS Components K-D Tree Rebalancing Algorithm (KDTR) Selection of tree to be rebalanced Find the highest unbalanced node in the K-D tree Rebalancing threshold h O(log n) rounds from the leaves to the root of the K-D tree Tree rebalancing Solve the weighted split median problem on the unbalanced subtree Change the split points intermediate nodes of the unbalanced subtree (the dynamic split points concept)

CIKM’06 Mohamed Aly 22 KDTR: K-D Tree Re-balancing Algorithm

CIKM’06 Mohamed Aly 23 KDDCS Components 1. Distributed logical address assignment algorithm 2. Event to bit-code mapping 3. Incremental event hashing and routing 4. K-D Tree Rebalancing algorithm (KDTR)

CIKM’06 Mohamed Aly 24 Roadmap Motivating Application: Disaster Management Data-Centric Storage (DCS) Problem Statement: Storage Hot-spots Previous Solutions: Zone Sharing Algorithm on top of DIM The K-D Tree Based DCS scheme (KDDCS) Experimental Results Conclusions

CIKM’06 Mohamed Aly 25 Simulation Description Compare: KDDCS and DIM. Simulator similar to that of DIM [Li et. al., SenSys’03]. Two phases: insertion & query. Insertion phase Each sensor initiates 5 events (1 event = 1 message). Events forwarded to owners. Query phase Queries of sizes 10% to 80% of the attributes ranges.

CIKM’06 Mohamed Aly 26 Experimental Setup ParameterValue Network size50 to 500 sensors Node Capacity10 events Initial energy50 units Energy unitEnergy needed to send 1 event Or Receive 2 events Number of hot-spots1 Hot-spot sizes (X,Y)10% to 80% of the events (X) falling into 5% to 10% of the attribute ranges (Y) Re-balancing threshold3

CIKM’06 Mohamed Aly 27 Experimental Results: QoD Result Size of a 50% Query for a network with a (80%, 10%) Hot-Spot

CIKM’06 Mohamed Aly 28 Experimental Results: Data Persistence Dropped Events for a network with a (80%, 10%) Hot-Spot

CIKM’06 Mohamed Aly 29 Experimental Results: Load Balancing Avg. Node Storage for a network with a (70%, 10%) Hot-Spot Note: Numbers are rounded to the closest integer

CIKM’06 Mohamed Aly 30 Experimental Results: Energy Consumption Average Node Energy for a network with a (70%, 10%) Hot-Spot

CIKM’06 Mohamed Aly 31 Experimental Results: Event Movements Moved Events for networks with an (X%, 10%) Hot-Spot

CIKM’06 Mohamed Aly 32 Conclusions & Future Work Storage Hot-Spots: Serious problem in DCS schemes Previous Solutions: ZS: A storage hot-spots decomposition algorithm running on top of DIM [Aly et al., DMSN’05]. Contribution: KDDCS: A new DCS scheme avoiding the formation of storage hot-spots.

CIKM’06 Mohamed Aly 33 Conclusions & Future Work KDDCS Advantages: Achieving a better data persistence by balancing storage responsibility among nodes Increasing the QoD by distributing the storage hot-spot events among a larger number of nodes Increasing the energy savings by achieving a well balanced energy consumption overhead among sensor nodes. Possible Extensions: Compare KDDCS with DIM + ZS Compare KDDCS performance against new DCS schemes like RESTORE [Krishnamurthy et al., INSS’06] Adapt KDDCS to cope with Query Hot-Spots (large number of queries accessing small number of sensors) [Aly et al., MOBIQUITOUS’06]

CIKM’06 Mohamed Aly 34 Acknowledgment This work is part of the “Secure CITI: A Secure Critical Information Technology Infrastructure for Disaster Management (S-CITI)” project funded through the ITR Medium Award ANI from the National Science Foundation (NSF). For more information, please visit:

CIKM’06 Mohamed Aly 35 Thank You Questions ? Advanced Data Management Technologies Lab