Presentation is loading. Please wait.

Presentation is loading. Please wait.

Query Execution on NetTraveler Angel L. Villalaín-García Manuel Rodríguez-Martínez University of Puerto Rico - Mayaguez Campus.

Similar presentations


Presentation on theme: "Query Execution on NetTraveler Angel L. Villalaín-García Manuel Rodríguez-Martínez University of Puerto Rico - Mayaguez Campus."— Presentation transcript:

1 Query Execution on NetTraveler Angel L. Villalaín-García Manuel Rodríguez-Martínez University of Puerto Rico - Mayaguez Campus

2 WALSAIP 1/6/2016 2 Objectives Develop a framework for Parallel and Distributed Query Optimization and Execution on NetTraveler Develop a framework for Parallel and Distributed Query Optimization and Execution on NetTraveler Facilitate and Optimize the access of data across WANs Facilitate and Optimize the access of data across WANs Transparent data access Transparent data access Uniform access interface Uniform access interface Robust operation by exploiting replication Robust operation by exploiting replication

3 WALSAIP 1/6/2016 3 Road Map Objectives Objectives Motivation Motivation Problem Formulation Problem Formulation Proposed Solution Proposed Solution Execution Example Execution Example Contributions Contributions Technical Details Technical Details Status Status Next Steps Next Steps Summary Summary Questions & Demo Questions & Demo

4 WALSAIP 1/6/2016 4 Motivation

5 5 Problem Formulation Dispersed and Heterogeneous data sources No uniformity on WANs Several limitations Bandwidth Memory Power Processing Capabilities

6 WALSAIP 1/6/2016 6 Problem Formulation (cont.) Traditional DBMS Plan Centralized Query Optimizer Scan Relations Select A.id, A.name from A,B where A.id = B.id and A.sage <30

7 WALSAIP 1/6/2016 7 Proposed Solution Decentralized Query Optimizer Distributed and Parallel DBMS Plan Scan Replicated Relations Select A.id, A.name from A,B where A.id = B.id and A.sage <30

8 WALSAIP 1/6/2016 8 Execution Example Q QSB 1 Knows: QSB 2, QSB 3, QSB 4 QSB 2 QSB 3 QSB 4 QQ IG 2 IG 1 R R R R IG 3

9 WALSAIP 1/6/2016 9 Replicates Management Manage Partitions Pre HashedOn The Fly Hashing Mechanism Parallel Ops IG Level Operations Memory Management Schedulers Mechanisms Hash Join Exchange Op Parallel Ops QSB Level Operations Site Management Scheduling Management Parallel & Distributed Ops Physical Optimizer Logical Optimizer Optimizer Level Operations Technical Details

10 WALSAIP 1/6/2016 10 Contributions Facilitate Integration for scientific applications Facilitate Integration for scientific applications Heterogeneous data sources Heterogeneous data sources Heterogeneous schemas Heterogeneous schemas Load Balancing Load Balancing Spread work to various nodes Spread work to various nodes Robustness Robustness Can get data from multiple sources Can get data from multiple sources Asynchronous Asynchronous Dynamically replace nodes used for processing Dynamically replace nodes used for processing Decentralized Query Optimization Decentralized Query Optimization

11 WALSAIP 1/6/2016 11 Status Manage Replicates Manage Partitions Pre HashedOn The Fly Hashing Mechanism Parallel Ops IG Level Operations Memory Management Schedulers Mechanisms Hash Join Exchange Op Parallel Ops QSB Level Operations Site Management Scheduling Management Parallel & Distributed Ops Physical Optimizer Logical Optimizer Optimizer Level Operations

12 WALSAIP 1/6/2016 12 Next Steps Study Scheduling Effect and improvements Study Scheduling Effect and improvements Hash Join operators and functionalities Hash Join operators and functionalities User interface for configuration and demonstration purposes User interface for configuration and demonstration purposes

13 WALSAIP 1/6/2016 13 Additional Areas of Research Distributed Catalog Manager Distributed Catalog Manager Oliver Moreno Oliver Moreno Server-Side Query Recovery Mechanism Server-Side Query Recovery Mechanism Victor Kareh Victor Kareh NetTraveler System Administration NetTraveler System Administration Osvaldo Ferrero Osvaldo Ferrero

14 WALSAIP 1/6/2016 14 Summary Facilitate and Optimize the access of data across WANs Facilitate and Optimize the access of data across WANs Query Parallelization and Execution Query Parallelization and Execution Exploiting Replication Exploiting Replication Response Time improvement Response Time improvement See website for API and user manual See website for API and user manual http://amadeus.ece.uprm.edu/~s08413 http://amadeus.ece.uprm.edu/~s08413

15 WALSAIP 1/6/2016 15 Demo and Questions


Download ppt "Query Execution on NetTraveler Angel L. Villalaín-García Manuel Rodríguez-Martínez University of Puerto Rico - Mayaguez Campus."

Similar presentations


Ads by Google