Department of Computer Science Johns Hopkins University Xiaodan Wang Advisor: Randal Burns Processing Data-Intensive Queries in Petabyte-Scale Scientific.

Slides:



Advertisements
Similar presentations
Yahoo! Research Johns Hopkins University Chris Olston Anish Das Sarma Xiaodan Wang Randal Burns Shared Scan Batch Scheduling in Cloud Computing.
Advertisements

Choosing an Order for Joins
The Big Picture Scientific disciplines have developed a computational branch Models without closed form solutions solved numerically This has lead to.
Hopkins Storage Systems Lab, Department of Computer Science Automated Physical Design in Database Caches T. Malik, X. Wang, R. Burns Johns Hopkins University.
SkewReduce YongChul Kwon Magdalena Balazinska, Bill Howe, Jerome Rolia* University of Washington, *HP Labs Skew-Resistant Parallel Processing of Feature-Extracting.
Johns Hopkins University Xiaodan Wang Eric Perlman Randal Burns Tamas Budavari Charles Meneveau Alexander Szalay Purdue University Tanu Malik JAWS: J ob-
László Dobos 1,2, Tamás Budavári 2, Nolan Li 2, Alex Szalay 2, István Csabai 1 1 Eötvös Loránd University, Budapest,
Chapter 1 Introduction 1.1A Brief Overview - Parallel Databases and Grid Databases 1.2Parallel Query Processing: Motivations 1.3Parallel Query Processing:
Distributed and Streaming Evaluation of Batch Queries for Data-Intensive Computational Turbulence Kalin Kanov Department of Computer Science Johns Hopkins.
Kangseok Kim, Marlon E. Pierce Community Grids Laboratory, Indiana University
MATE-EC2: A Middleware for Processing Data with Amazon Web Services Tekin Bicer David Chiu* and Gagan Agrawal Department of Compute Science and Engineering.
Dynamic and Decentralized Approaches for Optimal Allocation of Multiple Resources in Virtualized Data Centers Wei Chen, Samuel Hargrove, Heh Miao, Liang.
Introduction and Overview Questions answered in this lecture: What is an operating system? How have operating systems evolved? Why study operating systems?
Hopkins Storage Systems Lab, Department of Computer Science A Workload-Driven Unit of Cache Replacement for Mid-Tier Database Caching Xiaodan Wang, Tanu.
OPTIMAL SERVER PROVISIONING AND FREQUENCY ADJUSTMENT IN SERVER CLUSTERS Presented by: Xinying Zheng 09/13/ XINYING ZHENG, YU CAI MICHIGAN TECHNOLOGICAL.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
Physical Database Design & Performance. Optimizing for Query Performance For DBs with high retrieval traffic as compared to maintenance traffic, optimizing.
Ohio State University Department of Computer Science and Engineering 1 Cyberinfrastructure for Coastal Forecasting and Change Analysis Gagan Agrawal Hakan.
So far we have covered … Basic visualization algorithms Parallel polygon rendering Occlusion culling They all indirectly or directly help understanding.
Map-Reduce-Merge: Simplified Relational Data Processing on Large Clusters Hung-chih Yang(Yahoo!), Ali Dasdan(Yahoo!), Ruey-Lung Hsiao(UCLA), D. Stott Parker(UCLA)
Supercomputing Cross-Platform Performance Prediction Using Partial Execution Leo T. Yang Xiaosong Ma* Frank Mueller Department of Computer Science.
Benchmarking MapReduce-Style Parallel Computing Randal E. Bryant Carnegie Mellon University.
The exponential growth of data –Challenges for Google,Yahoo,Amazon & Microsoft in web search and indexing The volume of data being made publicly available.
Parallel dynamic batch loading in the M-tree Jakub Lokoč Department of Software Engineering Charles University in Prague, FMP.
Xiaodan Wang Department of Computer Science Johns Hopkins University Processing Data Intensive Queries in Scientific Database Federations.
Xiaodan Wang, Randal Burns Department of Computer Science Johns Hopkins University Tanu Malik Cyber Center Purdue University LifeRaft: Data-Driven, Batch.
Workshop on Networking Meets Databases (NetDB’07) Throughput-Optimized, Global-Scale Join Processing in Scientific Federations Xiaodan Wang, Randal Burns,
Copyright © 2006, GemStone Systems Inc. All Rights Reserved. Increasing computation throughput with Grid Data Caching Jags Ramnarayan Chief Architect GemStone.
Scheduling MPI Workflow Applications on Computing Grids Juemin Zhang, Waleed Meleis, and David Kaeli Electrical and Computer Engineering Department, Northeastern.
Ohio State University Department of Computer Science and Engineering Servicing Range Queries on Multidimensional Datasets with Partial Replicas Li Weng,
AQWA Adaptive Query-Workload-Aware Partitioning of Big Spatial Data Dimosthenis Stefanidis Stelios Nikolaou.
Latest Improvements in the PROOF system Bleeding Edge Physics with Bleeding Edge Computing Fons Rademakers, Gerri Ganis, Jan Iwaszkiewicz CERN.
Hopkins Storage Systems Lab, Department of Computer Science Network-Aware Join Processing in Global-Scale Database Federations X. Wang, R. Burns, A. Terzis.
VIEWS b.ppt-1 Managing Intelligent Decision Support Networks in Biosurveillance PHIN 2008, Session G1, August 27, 2008 Mohammad Hashemian, MS, Zaruhi.
Performance Assurance for Large Scale Big Data Systems
OPERATING SYSTEMS CS 3502 Fall 2017
CPU SCHEDULING.
Applying Control Theory to Stream Processing Systems
A Black-Box Approach to Query Cardinality Estimation
R SE to the challenges of ntelligent systems
So far we have covered … Basic visualization algorithms
Network Requirements Javier Orellana
Auburn University COMP7500 Advanced Operating Systems I/O-Aware Load Balancing Techniques (2) Dr. Xiao Qin Auburn University.
Chapter 1: Introduction
Chapter 6: CPU Scheduling
به نام خدا Big Data and a New Look at Communication Networks Babak Khalaj Sharif University of Technology Department of Electrical Engineering.
April 30th – Scheduling / parallel
Department of Computer Science University of California, Santa Barbara
Predictive Performance
Module 5: CPU Scheduling
Chapter 17: Database System Architectures
High Throughput Route Selection in Multi-Rate Ad Hoc Wireless Networks
KISS-Tree: Smart Latch-Free In-Memory Indexing on Modern Architectures
Ch 4. The Evolution of Analytic Scalability
Selected Topics: External Sorting, Join Algorithms, …
3: CPU Scheduling Basic Concepts Scheduling Criteria
Chapter 6: CPU Scheduling
(A Research Proposal for Optimizing DBMS on CMP)
Introduction to Operating Systems
GATES: A Grid-Based Middleware for Processing Distributed Data Streams
Operating System , Fall 2000 EA101 W 9:00-10:00 F 9:00-11:00
Distributed Databases
Chapter 6: CPU Scheduling
Database System Architectures
Module 5: CPU Scheduling
Chapter 6: CPU Scheduling
On the Role of Burst Buffers in Leadership-Class Storage Systems
Chapter 1: Introduction
Module 5: CPU Scheduling
SANDIE: Optimizing NDN for Data Intensive Science
Presentation transcript:

Department of Computer Science Johns Hopkins University Xiaodan Wang Advisor: Randal Burns Processing Data-Intensive Queries in Petabyte-Scale Scientific Databases

Processing Data Intensive Queries in Petabyte Scale Scientific Databases Big Picture Ensure high throughput for concurrent accesses to peta- scale Scientific datasets Data-Intensive analysis queries – Correlate, mine, and extract features – Batch workloads with multiple simultaneous queries – Join data partitioned and distributed across multiple nodes Scale of exploration limited – I/O: Scanning vast amounts of data over hours or days – Network: Transferring lots of data over large distances

Processing Data Intensive Queries in Petabyte Scale Scientific Databases Querying on Global-Scale SkyQuery database federation for Astronomy – Publicly accessible virtual telescope – Sharing of heterogeneous data – Geographically dispersed (30 across NA, EA, EU) High network cost for federated join queries – Joins on terabyte datasets between nodes – Queries last minutes producing hundreds of MB in results – Network transfers consume up to 70% of the time Data volume and geography limit scale

Processing Data Intensive Queries in Petabyte Scale Scientific Databases Incorporating Network Structure

Processing Data Intensive Queries in Petabyte Scale Scientific Databases Network-Aware Join Scheduling Capture network heterogeneity – Metric that exploits excess capacity for routing – Decentralized local optimizations – Two-approximate, MST-based solution – Supports parallelism and trade-offs with I/O cost Ten-fold reduction in network utilization for SkyQuery (ICDE’08)

Processing Data Intensive Queries in Petabyte Scale Scientific Databases Scanning Peta-Scale Data Data intensive scan queries – Executed against a clustered index – Span multiple nodes (partitioned by space/time in cluster) Incredibly I/O bound – Full DB scans lasting hours or days – Multiple concurrent queries (millions/month) – Significant data reuse between queries TurbulenceAstronomy

Processing Data Intensive Queries in Petabyte Scale Scientific Databases LifeRaft: Data-Driven Batch Scheduling Schedule queries greedily based on contention – Contentious regions amortize I/O over more queries – Two-fold improvement in throughput (CIDR’09) SELECT... FROM … WHERE region(‘circle ’) and specclass = 2 and … and specclass = 2 and … SELECT... FROM … WHERE region(‘circle ’) and specclass = 2 and … and specclass = 2 and … Pre-processing & Decomposition Pre-processing & Decomposition HTM Query Q1Q1Q1Q1 Q1Q1Q1Q1 Q2Q2Q2Q2 Q2Q2Q2Q2 Q3Q3Q3Q3 Q3Q3Q3Q3 LifeRaft Scheduling LifeRaft Scheduling Sub-query regions AstronomersAstronomers Query Results Query Results Reordering & Co-scheduling

Processing Data Intensive Queries in Petabyte Scale Scientific Databases Job-Aware Batch Scheduling Sequence of queries related to the same experiment – Predict I/O for long-running experiments – Queries may be order dependent Batch interface for Scientists – Session IDs to explicitly link queries – Pre-declare time/space regions of interest – Pre-package operations – Submit all queries at once Pre-fetching to improve response time – Bounding box over the data accessed – Extrapolate trajectory of job based on time/space

Processing Data Intensive Queries in Petabyte Scale Scientific Databases Job-Aware Batch Scheduling Delays evaluation of time-steps that are accessed in the future T5T5T5T5 T4T4T4T4 T1T1T1T1 T2T2T2T2 T3T3T3T3 Time Steps T1T1T1T1 JOBSJOBS J1J1J1J1 J2J2J2J2 J3J3J3J3 J4J4J4J4 J5J5J5J5 T2T2T2T2 T3T3T3T3 T4T4T4T4 T5T5T5T5 J1J1J1J1 J2J2J2J2 J3J3J3J3 J4J4J4J4 J5J5J5J5 Job-Aware LifeRaft Revisit LifeRaft T4T4T4T4 T2T2T2T2 T1T1T1T1 T2T2T2T2 T3T3T3T3 T4T4T4T4 T3T3T3T3 T4T4T4T4 T5T5T5T5 T4T4T4T4 T3T3T3T3 J1J1J1J1 J2J2J2J2 J5J5J5J5 J4J4J4J4 J3J3J3J3

Processing Data Intensive Queries in Petabyte Scale Scientific Databases Extending Batch Scheduling Provide starvation resistance – Short interactive queries that focus on small region – Soft constraints on completion order – Hard constraints on response time User Perceived Delay (Turbulence July 22 nd ) 4x overhead

Processing Data Intensive Queries in Petabyte Scale Scientific Databases Extending Batch Scheduling Cooperative LifeRaft – Beyond single node LifeRaft – Coordinate scheduling across multiple nodes Communicate to refine local decisions Avoid delaying a query that spans multiple nodes Heterogeneity in workload allocation and performance

Processing Data Intensive Queries in Petabyte Scale Scientific Databases Thank You!

Processing Data Intensive Queries in Petabyte Scale Scientific Databases Supplementary Slides

Processing Data Intensive Queries in Petabyte Scale Scientific Databases Extending Batch Scheduling Query buffering – Large intermediate results – May need to page results to disk

Processing Data Intensive Queries in Petabyte Scale Scientific Databases A Case for Batch Processing 70% of queries reuse turbulence simulation results from a dozen timesteps Varied query sizes ranging from <1s to several hours <1 sec >1hr

Processing Data Intensive Queries in Petabyte Scale Scientific Databases Scheduling Behavior Q i – Q i1, Q i2, Q i3 B1B1 B2B2 B3B3 B4B4 B5B5 B6B6 B7B7 B8B8 QiQi QjQj QkQk Sub-divide queries by bucket: Q j – Q j3, Q j4, Q j5, Q j6, Q j7, Q j8 Assumptions: - Inter-query time of 1 sec - I/O for each bucket of 1 sec - Cache size of 2 - Join cost is negligible Q j – Q j5, Q j6, Q j7, Q j8 QkQk

Processing Data Intensive Queries in Petabyte Scale Scientific Databases Arrival order with no sharing Qi1Qi1 B1B1 Q i Arr Qi2Qi2 B2B2 Qi3Qi3 B3B3 Qj1Qj1 B1B1 Q j ArrQ k Arr Qj3Qj3 B3B3 Q i End Qj4Qj4 B4B4 Qj6Qj6 B6B6 Qj7Qj7 B7B7 Qj8Qj8 B8B8 Q j End Qk1Qk1 B1B1 Qk4Qk4 B4B4 Qk8Qk8 B8B8 Q k End Q i – 3 sec Completion Times: Q j – 8 secQ k – 13 secAvg – 8 sec B1B1 B2B2 B3B3 B4B4 B5B5 B6B6 B7B7 B8B8 QiQi QjQj QkQk QkQk … Tp –.2 qry/sec

Processing Data Intensive Queries in Petabyte Scale Scientific Databases Age based scheduling (bias 1) Qi1Qi1 B1B1 Q i Arr Qi2Qi2 B2B2 Qi5Qi5 B5B5 Qi3Qj3Qi3Qj3 B3B3 Q j ArrQ k ArrQ i End Q j End Q k End Qj1Qk1Qj1Qk1 B1B1 Qj4Qk4Qj4Qk4 B4B4 Qj6Qk6Qj6Qk6 B6B6 Q i – 3 sec Completion Times: Q j – 7 secQ k – 7 secAvg – 5.6 secTp –.33 qry/sec B1B1 B2B2 B3B3 B4B4 B5B5 B6B6 B7B7 B8B8 QiQi QjQj QkQk QkQk Qj8Qk8Qj8Qk8 B8B8 Qj7Qk7Qj7Qk7 B7B7

Processing Data Intensive Queries in Petabyte Scale Scientific Databases Contention based scheduling (bias 0) Qi1Qi1 B1B1 Q i Arr Qi2Qi2 B2B2 Qi3Qj3Qi3Qj3 B3B3 Q j ArrQ k Arr Q i End Q j End Qk5Qk5 B5B5 Q k End Q j1 Q k1 Q j4 Q k4 B 1 B 4 Qj6Qk6Qj6Qk6 B6B6 Qj7Qk7Qj7Qk7 B7B7 Q i – 7 sec Completion Times: Q j – 5 secQ k – 6 secAvg – 6 secTp –.38 qry/sec B1B1 B2B2 B3B3 B4B4 B5B5 B6B6 B7B7 B8B8 QiQi QjQj QkQk QkQk Qj8Qk8Qj8Qk8 B8B8 (5.6) (.33)

Processing Data Intensive Queries in Petabyte Scale Scientific Databases Minimize cost of query execution and transitioning – 40% reduction in I/O Reducing I/O: Adaptive Physical Design