Technische Universität München + Hewlett Packard Laboratories Dynamic Workload Management for Very Large Data Warehouses Juggling Feathers and Bowling.

Slides:



Advertisements
Similar presentations
Numbers Treasure Hunt Following each question, click on the answer. If correct, the next page will load with a graphic first – these can be used to check.
Advertisements

1
Motivation Business analysis Order entry Maintenance
Feichter_DPG-SYKL03_Bild-01. Feichter_DPG-SYKL03_Bild-02.
1 Vorlesung Informatik 2 Algorithmen und Datenstrukturen (Parallel Algorithms) Robin Pomplun.
Pricing for Utility-driven Resource Management and Allocation in Clusters Chee Shin Yeo and Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS)
© 2008 Pearson Addison Wesley. All rights reserved Chapter Seven Costs.
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Chapter 1 The Study of Body Function Image PowerPoint
Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 6 Author: Julia Richards and R. Scott Hawley.
Author: Julia Richards and R. Scott Hawley
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
Properties Use, share, or modify this drill on mathematic properties. There is too much material for a single class, so you’ll have to select for your.
Objectives: Generate and describe sequences. Vocabulary:
UNITED NATIONS Shipment Details Report – January 2006.
RXQ Customer Enrollment Using a Registration Agent (RA) Process Flow Diagram (Move-In) Customer Supplier Customer authorizes Enrollment ( )
Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination. Introduction to the Business.
1 RA I Sub-Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Casablanca, Morocco, 20 – 22 December 2005 Status of observing programmes in RA I.
Conversion Problems 3.3.
Properties of Real Numbers CommutativeAssociativeDistributive Identity + × Inverse + ×
Exit a Customer Chapter 8. Exit a Customer 8-2 Objectives Perform exit summary process consisting of the following steps: Review service records Close.
Create an Application Title 1A - Adult Chapter 3.
Custom Statutory Programs Chapter 3. Customary Statutory Programs and Titles 3-2 Objectives Add Local Statutory Programs Create Customer Application For.
CALENDAR.
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Year 6 mental test 10 second questions
1 Discreteness and the Welfare Cost of Labour Supply Tax Distortions Keshab Bhattarai University of Hull and John Whalley Universities of Warwick and Western.
Rhesy S.ppt proRheo GmbH
Evaluating Window Joins over Unbounded Streams Author: Jaewoo Kang, Jeffrey F. Naughton, Stratis D. Viglas University of Wisconsin-Madison CS Dept. Presenter:
1 Click here to End Presentation Software: Installation and Updates Internet Download CD release NACIS Updates.
1 The Impact of Buy-Down on Sell Up, Unconstraining, and Spiral-Down Edward Kambour, Senior Scientist E. Andrew Boyd, SVP and Senior Scientist Joseph Tama,
REVIEW: Arthropod ID. 1. Name the subphylum. 2. Name the subphylum. 3. Name the order.
Computer Literacy BASICS
Break Time Remaining 10:00.
Advance Nano Device Lab. Fundamentals of Modern VLSI Devices 2 nd Edition Yuan Taur and Tak H.Ning 0 Ch9. Memory Devices.
PP Test Review Sections 6-1 to 6-6
EU market situation for eggs and poultry Management Committee 20 October 2011.
Bright Futures Guidelines Priorities and Screening Tables
Bellwork Do the following problem on a ½ sheet of paper and turn in.
Discrete Event (time) Simulation Kenneth.
Exarte Bezoek aan de Mediacampus Bachelor in de grafische en digitale media April 2014.
VOORBLAD.
Sample Service Screenshots Enterprise Cloud Service 11.3.
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
1 RA III - Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Buenos Aires, Argentina, 25 – 27 October 2006 Status of observing programmes in RA.
Factor P 16 8(8-5ab) 4(d² + 4) 3rs(2r – s) 15cd(1 + 2cd) 8(4a² + 3b²)
Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.
1..
CONTROL VISION Set-up. Step 1 Step 2 Step 3 Step 5 Step 4.
© 2012 National Heart Foundation of Australia. Slide 2.
Adding Up In Chunks.
Understanding Generalist Practice, 5e, Kirst-Ashman/Hull
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt Synthetic.
Note to the teacher: Was 28. A. to B. you C. said D. on Note to the teacher: Make this slide correct answer be C and sound to be “said”. to said you on.
Model and Relationships 6 M 1 M M M M M M M M M M M M M M M M
25 seconds left…...
Analyzing Genes and Genomes
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
Essential Cell Biology
Converting a Fraction to %
Intracellular Compartments and Transport
PSSA Preparation.
Essential Cell Biology
Immunobiology: The Immune System in Health & Disease Sixth Edition
Physics for Scientists & Engineers, 3rd Edition
Energy Generation in Mitochondria and Chlorplasts
Murach’s OS/390 and z/OS JCLChapter 16, Slide 1 © 2002, Mike Murach & Associates, Inc.
Presentation transcript:

Technische Universität München + Hewlett Packard Laboratories Dynamic Workload Management for Very Large Data Warehouses Juggling Feathers and Bowling Balls Stefan KrompassHarumi Kuno Alfons KemperUmeshwar Dayal TU München Germany HP Labs, Palo Alto, CA USA

Technische Universität München + Hewlett Packard Laboratories 1 Outline Problem statement Proposed solution Evaluation –Approach and settings for experiments –Impact of problem queries on a workload –Impact of execution control Conclusion and ongoing work

Technische Universität München + Hewlett Packard Laboratories 2 Background HP has been building NeoView, a highly-parallel database engine for business intelligence Challenges for DBAs –How long should they wait to kill an unexpectedly long-running query? –When should they admit a newly arriving query if the currently executing batch of queries is in danger of missing its deadline? –What if the newly arrived query was submitted by the CEO? Automate workload management

Technische Universität München + Hewlett Packard Laboratories 3 Why BI Workloads Differ from OLTP Workloads Complexity Resource demands Different types of queries Unpredictability Problem queries Objectives Time

Technische Universität München + Hewlett Packard Laboratories 4 Vision: Automate Workload Management Our approach Optimize execution of workload subject to service level objectives Explicitly consider problem queries as an inherent part of the workload Propose an architecture that allows us to … –… model problem queries with different characteristics –… implement and test workload management actions for dealing with problem queries based on their observed behavior

Technische Universität München + Hewlett Packard Laboratories 5 Outline Problem statement Proposed solution Evaluation –Implementation and settings for experiments –Impact of problem queries on a workload –Impact of execution control Conclusion and ongoing work

Technische Universität München + Hewlett Packard Laboratories 6 Workload Management Architecture

Technische Universität München + Hewlett Packard Laboratories 7 Service Level Objectives and Jobs

Technische Universität München + Hewlett Packard Laboratories 8 Service Level Objectives (SLOs) Job-facing SLOs (e.g., penalty functions used to optimize the scheduling of queries) Customer-facing SLOs –Minimize response time (derived from challenges) –Deadline-driven –Concrete quantities of computing time

Technische Universität München + Hewlett Packard Laboratories 9 Job Types Batch (e.g., reports) –Usually repetitive –All queries arrive at the database system at once –Queries may/may not have precedence constraints –SLO is deadline driven Interactive (e.g., business analysis) –All queries arrive at the database sequentially –Arrival time of the first query is not known in advance –SLO (ASAP) Submitted by a special request for business reasons

Technische Universität München + Hewlett Packard Laboratories 10 Execution Engine

Technische Universität München + Hewlett Packard Laboratories 11 Workload Manger Admission Control Scheduling Execution Control –Set of actions that apply when certain conditions hold –Example: IF relDBTime IS high AND progress IS low THEN cancel IS applicable

Technische Universität München + Hewlett Packard Laboratories 12 Workload Manger Admission Control Scheduling Execution Control –Set of actions that apply when certain conditions hold –Example: IF relDBTime IS high AND progress IS low THEN cancel IS applicable

Technische Universität München + Hewlett Packard Laboratories 13 Monitored Metrics Relative database time (derived from elapsed time of queries and processing time estimates) Query progress (derived from progress indicator) Number of cancellations Resource contention Priority

Technische Universität München + Hewlett Packard Laboratories 14 Monitored Metrics

Technische Universität München + Hewlett Packard Laboratories 15 Outline Problem statement Proposed solution Evaluation –Implementation and settings for experiments –Impact of problem queries on a workload –Impact of execution control Conclusion and ongoing work

Technische Universität München + Hewlett Packard Laboratories 16 Implementation Use simulated execution engine instead of real database system installation –Inject problem queries –Real workloads can take days to process Number of queries in a job Number of jobs in a workload Number of problem queries …

Technische Universität München + Hewlett Packard Laboratories 17 Settings for Experiments Interactive job –~ 1100 feathers –Queries arrive sequentially Inter-arrival time 0 Does not span entire workload interval Batch job –~ 1700 feathers, baseballs, and bowling balls –Average execution time of batch queries ~1000 times higher than execution time of interactive queries derived from commercial workload runs

Technische Universität München + Hewlett Packard Laboratories 18 Settings for Experiments Normal workload –Interactive and batch job executed in parallel –No problem queries Problem workload –Interactive and batch job executed in parallel –Problem queries injected into batch workload (75 queries with different stretch factors) Time Estimated execution time Actual execution time

Technische Universität München + Hewlett Packard Laboratories 19 Settings for Experiments Normal workload –Interactive and batch job executed in parallel –No problem queries Problem workload –Interactive and batch job executed in parallel –Problem queries injected into batch workload (75 queries with different stretch factors) –Problem queries have a probability for showing the problem behavior after restarting them Admit interactive queries first

Technische Universität München + Hewlett Packard Laboratories 20 Admission Control: Admit Interactive First Queue for interactive queries Queue for batch queries Admit query Execution engine

Technische Universität München + Hewlett Packard Laboratories 21 Admission Control: Admit Interactive First Queue for interactive queries Queue for batch queries Execution engine Admit query

Technische Universität München + Hewlett Packard Laboratories 22 Outline Problem statement Proposed solution Evaluation –Implementation and settings for experiments –Impact of problem queries on a workload –Impact of execution control Conclusion and ongoing work

Technische Universität München + Hewlett Packard Laboratories 23 Impact of Problem Queries on Batch Job ParallelismThrashing

Technische Universität München + Hewlett Packard Laboratories 24 Impact of Problem Queries on Batch Job Stretched queries

Technische Universität München + Hewlett Packard Laboratories 25 Impact of Problem Queries on Interactive Job Wait time

Technische Universität München + Hewlett Packard Laboratories 26 Impact of Problem Queries on Interactive Job Execution engine Batch Interactive

Technische Universität München + Hewlett Packard Laboratories 27 Outline Problem statement Proposed solution Evaluation –Implementation and settings for experiments –Impact of problem queries on a workload –Impact of execution control Conclusion and ongoing work

Technische Universität München + Hewlett Packard Laboratories 28 Workload Management Policies Fix the MPL at 5 Varying aggressiveness –If query exceeds estimated database time, take action relative database time=actual database time/estimated database time –If query is almost finished, do not execute action Queries identified as problems are killed and immediately resubmitted (cancel) Canceled queries get two more chances to run to completion If queries do not complete, they are killed (aborted)

Technische Universität München + Hewlett Packard Laboratories 29 Impact of Workload Management Actions Batch job: Reduce elapsed time by 81% (problem queries) Interactive job: Reduce wait time by 67% (wait time) But…

Technische Universität München + Hewlett Packard Laboratories 30 False Positives Lead to Unnecessary Actions Relative Database Time

Technische Universität München + Hewlett Packard Laboratories 31 False Positives Lead to Unnecessary Actions Relative Database Time

Technische Universität München + Hewlett Packard Laboratories 32 False Positives Lead to Unnecessary Actions Relative Database Time

Technische Universität München + Hewlett Packard Laboratories 33 Number of False Positives and Actions Executed Progress Reduce number of false positives

Technische Universität München + Hewlett Packard Laboratories 34 Elapsed Time for Batch and Interactive Jobs

Technische Universität München + Hewlett Packard Laboratories 35 Elapsed Time for Batch and Interactive Jobs Increased elapsed time (queries are restarted over and over again)

Technische Universität München + Hewlett Packard Laboratories 36 Elapsed Time for Batch and Interactive Jobs Decreased elapsed time (wait time for queries is reduced)

Technische Universität München + Hewlett Packard Laboratories 37 Outline Problem statement Proposed solution Evaluation –Implementation and settings for experiments –Impact of problem queries on a workload –Impact of execution control Conclusion and ongoing work

Technische Universität München + Hewlett Packard Laboratories 38 Conclusion We implemented a workload management test bed Our experiments show that … –… even few problem queries have a significant impact on the execution of a mixed workload –… the number of false positives leads to an increase in execution time Lessons we learned –Applying actions too aggressively leads to unnecessary actions –Use controller and adjust parameters to right level of aggression

Technische Universität München + Hewlett Packard Laboratories 39 Ongoing Work Evaluate impact of admission control and scheduling of BI workloads Model query execution on a more detailed level Model additional problem types Evaluate new workload management techniques

Technische Universität München + Hewlett Packard Laboratories 40 Any Questions?