Presentation is loading. Please wait.

Presentation is loading. Please wait.

Query Evaluation Techniques for Cluster Database Systems Andrey V. Lepikhov, Leonid B. Sokolinsky South Ural State University Russia 22 September 2010.

Similar presentations


Presentation on theme: "Query Evaluation Techniques for Cluster Database Systems Andrey V. Lepikhov, Leonid B. Sokolinsky South Ural State University Russia 22 September 2010."— Presentation transcript:

1 Query Evaluation Techniques for Cluster Database Systems Andrey V. Lepikhov, Leonid B. Sokolinsky South Ural State University Russia 22 September 2010 1 ADBIS 2010

2 Outline 22 September 2010 ADBIS 20102  Motivation  Problem Statement  Background  Partial mirroring method  Results  Future work

3 Motivation 22 September 2010 3 Top500 Cluster: 84.8% MPP: 14.8% Others: 0.4% ADBIS 2010

4 Problem Statement 22 September 2010 4  Not expensive parallel hardware needs not expensive parallel database management system  Today we have no such chip parallel database management system ADBIS 2010

5 Background 22 September 2010 5ADBIS 2010

6 Exchange operator 22 September 2010 ADBIS 20106  p: port  ψ : distributing function

7 Parallel plan for query Q = R  S 22 September 2010 ADBIS 20107

8 Query processing in cluster system 22 September 2010 8ADBIS 2010

9 The problem 22 September 2010 ADBIS 20109  Load balancing

10 Partial mirroring method  Fragmentation strategy  Replication strategy 10

11 Fragmentation strategy 22 September 2010 ADBIS 201011 FRAGMENTATION  Source relation P0P0 P1P1 PnPn Relation is divided into fragments distributed among cluster nodes Each fragment is divided into sequence of segments with an equal length Segment is the minimal unit of replication...

12 Replication strategy 22 September 2010 ADBIS 201012 Disk D i Disk D j ρ j = 50% RiRi

13 Load balancing method 22 September 2010 ADBIS 201013 D1D1 D2D2 t1t1 D1D1 D2D2 t2t2 D1D1 D2D2 - scheduled to process D1D1 D2D2 t4t4 - processed t3t3 P1P1 P2P2 P1P1 P2P2 P1P1 P1P1 P2P2 P2P2 - not used

14 Parallel agent with two input streams 22 September 2010 ADBIS 201014

15 Load balancing algorithm 22 September 2010 ADBIS 201015

16 Parameters of experiments 22 September 2010 ADBIS 201016

17 Speedup versus replication factor 22 September 2010 ADBIS 201017

18 Speedup versus skew factor θ 22 September 2010 ADBIS 201018 0.68 corresponds to the ”80-20” rule (80 percents of tuples of the relation will be stored in 20 percents of fragments) 0 corresponds to the uniform distribution

19 Future Work 22 September 2010 ADBIS 201019  To incorporate the proposed technique of parallel query execution into open source PostgreSQL DBMS.  To extend this approach on GRID DBMS for clusters with multicor processors.

20 22 September 2010 ADBIS 201020  Thank you


Download ppt "Query Evaluation Techniques for Cluster Database Systems Andrey V. Lepikhov, Leonid B. Sokolinsky South Ural State University Russia 22 September 2010."

Similar presentations


Ads by Google