Presentation is loading. Please wait.

Presentation is loading. Please wait.

A. Kemper, R. Kuntschke, and B. Stegmaier

Similar presentations


Presentation on theme: "A. Kemper, R. Kuntschke, and B. Stegmaier"— Presentation transcript:

1 StreamGlobe Adaptive Query Processing and Optimization in Streaming P2P Environments
A. Kemper, R. Kuntschke, and B. Stegmaier TU München – Fakultät für Informatik Lehrstuhl III: Datenbanksysteme I would like to welcome everybody to this talk. My name is Bernhard Stegmaier and I will present the Vision of the StreamGlobe DMS, which is currently developed at our group at the TU München.

2 Outline Motivation StreamGlobe Current and Future Research Conclusion
The StreamGlobe Approach Architecture Overview Current and Future Research Conclusion 19/04/2019 StreamGlobe

3 Exemplary Initial Situation
WLAN Network Consists of peers Given or grown topology Data Sources Provide XML data stream Possibly infinite streams (e.g., sensor measurements) User requests Continuous queries Query language XQuery Registered at a peer B A Request a Requests a/ab erklären Request ab Request a 19/04/2019 StreamGlobe

4 General Traditional Approach
Register requests Establish data transfer → Peers may connect arbitrarily Process / Execute requests Routing of streams → Map streams to network B A Request a Request ab Request a 19/04/2019 StreamGlobe

5 General Traditional Approach (ctd.)
Drawbacks Transmission of useless data Redundant transmissions Multiple request evaluation  Network congestion and processing overhead B A 2 1 3 Request a 3 Request ab Request a 19/04/2019 StreamGlobe

6 Why StreamGlobe? Other Systems / previous work E.g. Cougar, TelegraphCQ, Multicast techniques: Focus on specific aspects (e.g., query optimization) Tailored to specific domains StreamGlobe Contribution is combination of techniques: In-network query processing combined with routing Constitutes a generic infrastructure Independent of domain Efficient data stream transformation and distribution 19/04/2019 StreamGlobe

7 Outline Motivation StreamGlobe Current and Future Research Conclusion
The StreamGlobe Approach Architecture Overview Current and Future Research Conclusion 19/04/2019 StreamGlobe

8 The StreamGlobe Approach
Intelligent Routing Multicast routing techniques Data Stream Clustering Push query execution into network Multi-query optimization Reduce network traffic Avoid redundant transmissions Reduce processing cost B A ab a Request a Request a Request ab 19/04/2019 StreamGlobe

9 Basic Concepts P2P Network Topology Classification of peers
No arbitrary communication → Communication via transfer paths No fixed P2P topology Classification of peers Thin-Peers Super-Peers Constitution of a super-peer backbone Hierarchical organization → Speaker-peer responsible for certain subnet 19/04/2019 StreamGlobe

10 StreamGlobe Peer Architecture
Based upon Open Grid Services Architecture (OGSA) Integration similar to OGSA-DAI or OGSA-DQP Layers as grid-services Availability according to peer capabilities Message exchange via RPC and notifications Data stream transfer via direct TCP connections XQuery Subscriptions XML Data Streams register StreamGlobe Interface Optimization Management Metadata Überleitung: I will show some details of the different layers on the next few slides. Query Engine Globus Toolkit 19/04/2019 StreamGlobe

11 Optimization Goals Achievement
Registration of arbitrary subscriptions at any peer Achieve good distribution of data streams Optimize evaluation of many subscriptions Achievement Pushing query execution into the network → (1) and (3) Multiquery optimization → (3) Early filtering of data streams resp. evaluation of subscriptions → (2) Data stream clustering → (2) Since multiquery optimization and data stream clustering are main aspects of StreamGlobe, I will now give a slightly more detailed view on what is done there. 19/04/2019 StreamGlobe

12 Multi-Query Optimization
Performed by speaker-peer Analyze subscriptions and streams Common subqueries Re-usability of streams Based on properties of subscriptions / streams Computes Filters and queries Data stream clustering Execution locations Request a Request a Request ab Query a Filter a Filter b Query ab In this simple example, only queries representing the two request and two filters for each data stream are computed. But, in real life, much more queries will be generated. For example, query ab might get split into several smaller queries, which are re-used by other subscriptions. 19/04/2019 StreamGlobe

13 Query Execution Basic concepts Evaluation of subscriptions with FluX
Streaming evaluation and push-based techniques Preclude unbounded buffering by requiring window constraints Extensibility by means of mobile code Evaluation of subscriptions with FluX Designed for streaming processing of XQuery Event-based extension to XQuery Usage of schema information for buffer minimization → Visit my talk at the VLDB: Tomorrow, Research Session 6: XML(II) 19/04/2019 StreamGlobe

14 Outline Motivation StreamGlobe Current and Future Research Conclusion
The StreamGlobe Approach Architecture Overview Current and Future Research Conclusion 19/04/2019 StreamGlobe

15 Current and Future Research
Current Research Optimization techniques Extension of FluX Future Research Quality-of-Service management Explicit load balancing Load shedding techniques Construction of overlay network 19/04/2019 StreamGlobe

16 Conclusion StreamGlobe
Exploiting in-network query processing capabilities In combination with data stream clustering  Minimization of network traffic Query execution with FluX  Efficient and scalable execution of subscriptions Multi-query optimization  Parallelization and load balancing in the network 19/04/2019 StreamGlobe

17 Related Work Aberer, Cudré-Mauroux, Datta, Despotovic, Hauswirth, Punceva, Schmidt. “P-Grid: a self-organizing structured P2P system”. SIGMOD Record 32(3), 2003 Arasu, Babcock, Babu, Datar, Ito, Motwani, Nishizawa, Srivastava, Thomas, Varma, Widom. “STREAM: The Stanford Stream Data Manager”. Data Engineering Bulletin 26(1), 2003 Carney, Cetintemel, Cherniack, Convey, Lee, Seidman, Stonebraker, Tatbul, Zdonik. “Monitoring Streams – A New Class of Data Management Applications”. VLDB 2002 Chandrasekaran, Cooper, Deshpande, Franklin, Hellerstein, Hong, Krishnamurthy, Madden, Raman, Reiss, Shah. “TelegraphCQ: Continuous Dataflow Processing for an Uncertain World”. CIDR 2003 Cherniack, Balakrishnan, Balazinska, Carney, Cetintemel, Xing, Zdonik. “Scalable Distributed Stream Processing”. CIDR 2003 Krämer, Seeger. “PIPES – A Public Infrastructure for Processing and Exploring Streams”. SIGMOD 2004 Madden, Shah, Hellerstein, Raman. “Continuously Adaptive Continuous Queries over Streams”. SIGMOD 2002 Sellis. “Multiple-Query Optimization”. TODS 1988 Yang, Garcia-Molina. “Designing a Super-Peer Network”. ICDE 2003 Yao, Gehrke. “The Cougar Approach to In-Network Query Processing in Sensor Networks”. SIGMOD Record 31(3), 2002 19/04/2019 StreamGlobe


Download ppt "A. Kemper, R. Kuntschke, and B. Stegmaier"

Similar presentations


Ads by Google