SQL Server 2005 Performance Enhancements for Large Queries Joe Chang

Slides:



Advertisements
Similar presentations
Module 13: Performance Tuning. Overview Performance tuning methodologies Instance level Database level Application level Overview of tools and techniques.
Advertisements

Parallel Query Processing in SQL Server Lubor Kollar.
1 Jacob Thomas Basu Vaidyanathan Bret Olszewski Session II April 2014 POWER8 Benchmark and Performance.
SQL Performance 2011/12 Joe Chang, SolidQ
IBM 1350 Cluster Expansion Doug Johnson Senior Systems Developer.
Automating Performance … Joe Chang SolidQ
Performance and Scalability. Optimizing PerformanceScaling UpScaling Out.
VSphere vs. Hyper-V Metron Performance Showdown. Objectives Architecture Available metrics Challenges in virtual environments Test environment and methods.
©2010 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice ©2011 Hewlett-Packard Development.
University of Konstanz Advances in Database Query Processing Sahak Maloyan Avoiding Sorting and Grouping In Processing Queries Sahak Maloyan.
C-Store: Introduction to TPC-H Jianlin Feng School of Software SUN YAT-SEN UNIVERSITY Mar 20, 2009.
Presented by Marie-Gisele Assigue Hon Shea Thursday, March 31 st 2011.
Meanwhile RAM cost continues to drop Moore’s Law on total CPU processing power holds but in parallel processing… CPU clock rate stalled… Because.
SQL Server Query Optimizer Cost Formulas Joe Chang
VMware vCenter Server Module 4.
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 1 Preview of Oracle Database 12 c In-Memory Option Thomas Kyte
Parallel Execution Plans Joe Chang
Analyzing the Energy Efficiency of a Database Server Hanskamal Patel SE 521.
SQL Server Parallel Data Warehouse: Supporting Large Scale Analytics José Blakeley, Software Architect Database Systems Group, Microsoft Corporation.
© 2011 IBM Corporation 11 April 2011 IDS Architecture.
Performance and Scalability. Performance and Scalability Challenges Optimizing PerformanceScaling UpScaling Out.
#SQLSatRiyadh Special Topics Joe Chang
Shilpa Seth.  Centralized System Centralized System  Client Server System Client Server System  Parallel System Parallel System.
Early Hash Join: A Configurable Algorithm for the Efficient and Early Production of Join Results Ramon Lawrence University of Iowa
Scalable Data Warehouse & Data Marts ReportsAnalysis SQL Server DBMS SQL Server Integration Services Custom OLTP Increase usage & trust.

© 2008 Quest Software, Inc. ALL RIGHTS RESERVED. Benchmarking Advice & Recommendations August 2008.
Quantitative Performance Analysis Joe Chang
A Paradigm Shift in Database Optimization: From Indices to Aggregates Presented to: The Data Warehousing & Data Mining mini-track – AMCIS 2002 as Research-in-Progress.
Getting Started With Ingres VectorWise
Oracle Challenges Parallelism Limitations Parallelism is the ability for a single query to be run across multiple processors or servers. Large queries.
March 19981© Dennis Adams Associates Tuning Oracle: Key Considerations Dennis Adams 25 March 1998.
Module 7 Reading SQL Server® 2008 R2 Execution Plans.
Ashwani Roy Understanding Graphical Execution Plans Level 200.
Insert, Update & Delete Performance Joe Chang
© Dennis Shasha, Alberto Lerner, Philippe Bonnet 2004 DBMS Performance Monitoring.
Primary Key, Cluster Key & Identity Loop, Hash & Merge Joins Joe Chang
Applications hitting a wall today with SQL Server Locking/Latching Scale-up Throughput or latency SLA Applications which do not use SQL Server.
DBI313. MetricOLTPDWLog Read/Write mixMostly reads, smaller # of rows at a time Scan intensive, large portions of data at a time, bulk loading Mostly.
Achieving Scalability, Performance and Availability on Linux with Oracle 9iR2-RAC Grant McAlister Senior Database Engineer Amazon.com Paper
Log-structured Memory for DRAM-based Storage Stephen Rumble, John Ousterhout Center for Future Architectures Research Storage3.2: Architectures.
Parallel Execution Plans Joe Chang
Large Data Operations Joe Chang
Parallel Execution Plans Joe Chang
© 1999 FORWISS FORWISS MISTRAL Performance of TPC-D Benchmark and Datawarehouses Prof. R. Bayer, Ph.D. Dr. Volker Markl Dept. of Computer Science, Technical.
1 Chapter 10 Joins and Subqueries. 2 Joins & Subqueries Joins – Methods to combine data from multiple tables – Optimizer information can be limited based.
TPC-H Studies Joe Chang
Srik Raghavan Principal Lead Program Manager Kevin Cox Principal Program Manager SESSION CODE: DAT206.
SQL Server Scaling on Big Iron (NUMA) Systems Joe Chang TPC-H.
Exploiting Asynchronous IO using the Asynchronous Iterator Model Suresh Iyengar * S. Sudarshan Santosh Kumar # Raja Agrawal & IIT Bombay Current affiliations:
Multi-Way Hash Join Effectiveness M.Sc Thesis Michael Henderson Supervisor Dr. Ramon Lawrence 2.
Query Optimizer Execution Plan Cost Model Joe Chang
Buffer-pool aware Query Optimization Ravishankar Ramamurthy David DeWitt University of Wisconsin, Madison.
Solution to help customers and partners accelerate their data.
1 Chapter 13 Parallel SQL. 2 Understanding Parallel SQL Enables a SQL statement to be: – Split into multiple threads – Each thread processed simultaneously.
7 Strategies for Extracting, Transforming, and Loading.
Your Data Any Place, Any Time Performance and Scalability.
Scalable data access with Impala Zbigniew Baranowski Maciej Grzybek Daniel Lanza Garcia Kacper Surdy.
Contact Sambit Samal (sambits) for additional information on Benchmarks.
Execution Plans Detail From Zero to Hero İsmail Adar.
DESIGNING HIGH PERFORMANCE ETL FOR DATA WAREHOUSE. Best Practices and approaches. Alexei Khalyako (SQLCAT) & Marcel Franke (pmOne)
Understanding and Improving Server Performance
SQL Server 2000 and Access 2000 limits
IDISK Cluster 8 disks, 8 CPUs, DRAM /shelf
Introduction of Week 3 Assignment Discussion
The Key to the Database Engine
Query Execution in Main Memory DBMS
SQL Server Query Optimizer Cost Formulas
EXECUTION PLANS Quick Dive.
Database System Architectures
Presentation transcript:

SQL Server 2005 Performance Enhancements for Large Queries Joe Chang

When & Why Migrate to 2005 Performance gains in large queries In memory Disk performance Data Warehouse Applications Easiest to migrate Usually internal users Read-only queries

TPC-H benchmark Decision Support Large volumes of data Complex queries – 22 queries, 2 data refresh Power & Throughput metrics Power – run 1 queries at a time Throughput – run multiple concurrent streams

System Details 2003 Oct: SQL Server 2000 EE build Itanium 2 1.5GHz/6M, 64GB, 215 disks 2004 Aug: Oracle 10g EE 16 Itanium 2 1.5GHz/6M, 64GB, 166 disks Unisys ES7000 Aries Jun: SQL 2005 HP Integrity rx Itanium 2 1.6GHz/9M, 64GB, 342 disks 2005 Jul: Oracle 10g R disks OS: Windows Server 2003 Datacenter, SP1 for last

TPC-H 1000GB Results PowerThroughputComposite SQL 20007,330.63,687.45,199.1 Oracle 10g13,614.27,131.39,853.3 SQL 2005*19,241.09, ,637.8 Oracle 10g R2*20, , ,069.9 S2K5 SP1*23, , ,725.9 *1.6GHz/9M processor, others: 1.5GHz/6M

TPC-H 1000GB - Power Q1Q2Q3Q4Q5Q6Q7Q8Q9 Q10Q11Q12Q13Q14Q15Q16Q17Q18 Q19Q20Q21Q22 SQL Oracle 10g SQL X

TPC-H 1000GB - Power

TPC-H 1000GB - Throughput Q1Q2Q3Q4Q5Q6Q7Q8Q9 Q10Q11Q12Q13Q14Q15Q16Q17Q18 Q19Q20Q21Q22 SQL Oracle 10g SQL

TPC-H 1000GB - Throughput

TPC-H Query 1 SQL 2000 to X Power 1.30X Throughput SELECT L_RETURNFLAG,L_LINESTATUS,SUM(L_QUANTITY) AS SUM_QTY, SUM(L_EXTENDEDPRICE) AS SUM_BASE_PRICE, SUM(L_EXTENDEDPRICE*(1-L_DISCOUNT)) AS SUM_DISC_PRICE, SUM(L_EXTENDEDPRICE*(1-L_DISCOUNT)*(1+L_TAX)) AS SUM_CHARGE, AVG(L_QUANTITY) AS AVG_QTY,AVG(L_EXTENDEDPRICE) AS AVG_PRICE, AVG(L_DISCOUNT) AS AVG_DISC,COUNT(*) AS COUNT_ORDER FROM LINEITEM WHERE L_SHIPDATE <= DATEADD(dd, -90, cast('1998/12/01' as smalldatetime)) GROUP BY L_RETURNFLAG,L_LINESTATUS ORDER BY L_RETURNFLAG,L_LINESTATUS

TPC-H Query 3 SQL 2000 to X Power 3.3X Throughput SELECT TOP 10 L_ORDERKEY, SUM(L_EXTENDEDPRICE*(1-L_DISCOUNT)) AS REVENUE, O_ORDERDATE,O_SHIPPRIORITY FROM CUSTOMER INNER JOIN ORDERS ON C_CUSTKEY = O_CUSTKEY INNER JOIN LINEITEM ON L_ORDERKEY = O_ORDERKEY WHERE C_MKTSEGMENT = 'BUILDING' AND O_ORDERDATE ' ' GROUP BY L_ORDERKEY,O_ORDERDATE,O_SHIPPRIORITY ORDER BY REVENUE DESC,O_ORDERDATE

Query 8 SQL 2000 to X Power 8.9X Throughput

10GB Line Item in memory test SQL Server 2000 SP4 Query time in milli- seconds versus Max Degree of Parallelism Limited parallelism gains beyond 4-8P SQL Server 2005 June CTP Continued parallelism gains beyond 4P

10GB Line Item in memory test 1 SQL Server 2000 SP4 Query time in milli- seconds versus Max Degree of Parallelism Limited parallelism gains beyond 4-8P SQL Server 2005 June CTP Continued parallelism gains beyond 4P

10GB Line Item in memory test 2 SQL Server 2000 SP4 Query time in milli- seconds versus Max Degree of Parallelism Limited parallelism gains beyond 4-8P SQL Server 2005 June CTP Continued parallelism gains beyond 4P

10GB in memory test Max Degree of Parallelism 1 Max DOP 4 Max DOP 16 30% reduction in total time at 1P 50% reduction in total time at 16P

In-Memory Table Scan Xeon systems have better table scan performance on SQL 2000 SQL 2000 default table scan performance depends on build

Table Scan to Disk SQL 2005 Table Scan performance probably limited by disk system

Loop Join – SQL 2000 & 2005 SQL 2000 SQL 2005 No scaling from 1-2P Better overall performance, scaling from 1-8P

Hash Join SQL 2000 SQL 2005 Unpredictable behavior below 3M rows Limited scaling Consistent behavior < 1M rows Not as much fall off in large joins

Merge Join – SQL 2000 SQL 2005 does not use parallel merge joins? Slight decrease in non-parallel performance. SQL 2000 parallel merge joins has serious problems, runs slower, consumes more CPU. It should be possible to performance a parallel merge efficiently?

LiteSpeed Backup Performance HP rx8620, 16 x 1.5GHz Itanium 2 processors 2 EVA 5000 (4 HSV110 controllers, 8 enclosures x14 disks each 8 x 2Gbit/sec FC ports

SQL 2005 Performance Summary Significant improvements in large queries Data Warehouse Applications Improved disk performance

More Information SQL Server Quantitative Performance Analysis Server System Architecture Processor Performance Direct Connect Gigabit Networking Parallel Execution Plans Large Data Operations Transferring Statistics SQL Server Backup Performance with LiteSpeed

System Configuration

Bookmark Lookup Performance SQL 2005 has better bookmark lookup performance to Clustered Indexes SQL 2000 has better bookmark lookup performance to Heap organized tables