Download presentation
Presentation is loading. Please wait.
1
From adaptive to intelligent:
query processing in SQL Server 2019 Hugo Kornelis @Hugo_Kornelis
3
Hugo Kornelis I make SQL Server Fast I make SQLServerFast.com
Execution Plan Reference: Detailed description of all operators, and other relevant information Other content Blog: Articles Longer, even more detailed in-depth information Resources Deck and demo for this session (and others)
4
Hugo Kornelis I make SQLServerFast.com I do other community things
I work (consulting, training) Contact details
5
Query Processing Query optimization Query
6
Query Processing Query optimization Query execution Query
Cardinality estimates Reality
7
Adaptive Query Processing
SQL Server 2017
8
Adaptive Query Processing
Source:
9
Adaptive Intelligent Query Processing
SQL Server 2019
10
Intelligent Query Processing
Source:
11
Memory Grant Feedback Memory Grant: memory required to execute plan
Determined during compile time Based on cardinality estimates So … can be wrong
12
Memory Grant Feedback Memory Grant: memory required to execute plan
Determined during compile time Under-estimated? Spills to tempdb! Can be very slow!
13
Memory Grant Feedback Memory Grant: memory required to execute plan
Determined during compile time Under-estimated? Spills to tempdb! Over-estimated? Resources wasted! Impacts concurrency
14
Memory Grant Feedback Memory Grant: memory required to execute plan
Determined during compile time Under-estimated? Spills to tempdb! Over-estimated? Resources wasted! Memory Grant Feedback Compilation and execution as normal Cached plan updated after execution
15
Memory Grant Feedback Memory Grant Feedback: Details
Adjusts down if >50% unused Adjusts up after any spill Based on last execution only Can “chase own tail” in alternating patterns Increases thresholds after X consecutive adaptations Stops adapting after Y consecutive adaptations
16
Memory Grant Feedback Memory Grant Feedback: Details
Adjusts down if >50% unused Adjusts up after any spill Stores last execution only Information stored in plan cache Forgotten when plan recompiles
17
Memory Grant Feedback Memory Grant Feedback: Details
Adjusts down if >50% unused Adjusts up after any spill Stores last execution only Information stored in plan cache Extended events available spilling_report_to_memory_grant_feedback memory_grant_updated_by_feedback memory_grant_feedback_loop_disabled (debug channel)
18
Intelligent Query Processing
Source:
19
Table Variable Deferred Compilation
20
Table Variable Deferred Compilation
21
Table Variable Deferred Compilation
22
Table Variable Deferred Compilation
5944 ∗0.3 -- (inequality)
23
Table Variable Deferred Compilation
5944 -- (equality)
24
Intelligent Query Processing
Source:
25
? ? Batch Mode on Rowstore Traditional processing (row mode)
GetNext() GetNext() ? GetNext() ? GetNext()
26
? Batch Mode on Rowstore Batch mode processing GetNext() GetNext()
27
Batch Mode on Rowstore Batch mode processing
Introduced in SQL Server 2012 Benefits analytic operations (join, aggregation, window functions) Benefits processing of large data collections Overhead / startup cost Requires columnstore index on at least one table in query Trickery to get batch mode without columnstore index
28
Batch Mode on Rowstore Batch mode on rowstore New in SQL Server 2019
Batch mode without columnstore index – no trickery! “Interesting” tables used? CTP 2.2: >= 131,702 rows “Interesting” operations used? CTP 2.2: join, aggregation, windows aggregate with >= 131,702 rows Compare estimated cost No non-supported features
29
Batch Mode on Rowstore Batch mode on rowstore New in SQL Server 2019
Batch mode without columnstore index – no trickery! Limitations In-memory tables and indexes Can only be read in row mode, rest of plan can still use batch mode LOB data, XML, spatial, full-text search, cursors No batch mode at all
30
Batch Mode on Rowstore Batch mode on rowstore New in SQL Server 2019
Batch mode without columnstore index – no trickery! Limitations Sometimes batch mode can actually be slower than row mode Many of these cases will improve before RTM But … do test and monitor your workloads!
31
Intelligent Query Processing
Source:
32
Scalar UDF Inlining Scalar user-defined functions Good for development
Code encapsulation Code reuse Bad for performance Executed once per row Optimizer crippled No parallelism
33
Scalar UDF Inlining The solution: FROID Runs at parse time
Tries to convert UDF into equivalent query expression (In internal represenatation) Does some optimization (constant folding, dead code elimination, and others) Injects this with APPLY in query Result goes to Query Optimizer
34
Scalar UDF Inlining Example
35
Scalar UDF Inlining FROID equivalent (simplified!)
36
Scalar UDF Inlining FROID execution plan (CTP 2.2)
37
Scalar UDF Inlining FROID execution plan (CTP 3.0)
38
Scalar UDF Inlining FROID performance (measured on RC1)
Old behavior (compatibility level 140 or lower) 10 executions took 8.6 seconds FROID enabled (compatibility level 150) 10 executions took 1.7 seconds
39
Scalar UDF Inlining Limitations of FROID Limitations on UDF itself
No functions that can change between calls Time-related, e.g. GETDATE(), CURRENT_TIMESTAMP, … Affects state for future call, e.g. NEWSEQUENTIALID(), RAND() , … No loops (WHILE) in UDF code No table variables used Check sys.sql_modules.is_inlineable to verify if UDF qualifies
40
Scalar UDF Inlining Limitations of FROID Limitations on UDF itself
Limitations on how/where UDF is used UDF itself not used in GROUP BY Nesting and recursion are partly supported Replacement stops at certain level
41
http://tinyurl.com/FROID-dox Scalar UDF Inlining Limitations of FROID
Limitations on UDF itself Limitations on how/where UDF is used Actual list is too long to include here Check full documentation on MSDN
42
Intelligent Query Processing
Source:
43
Approximate Count Distinct
Sometimes, a close estimate is “good enough” Can be very beneficial for some specific problems For example “how many unique visitors on our web site this month?” (On a site that logs 10 million pageviews per day) We care about trend, not about the exact correct number
44
Approximate Count Distinct
Standard approach (using COUNT DISTINCT)
45
Approximate Count Distinct
New alternative (using APPROX_COUNT_DISTINCT)
46
Approximate Count Distinct
How does it work? HyperLogLog algorithm (based on Flajolet-Martin algorithm) Flajolet-Martin: Hash to get “pseudo-random” value for each input value Find number of zeroes after last one in the binary representation of the hash (Can also use left-most zeroes, left-most ones, etc. – just a choice) Track highest number of zeroes in the input (50% of values end in 1 / 25% in 10 / 12.5% in 100 / 6.25% in 1000 / etc) The higher this number, the less likely – so probably more distinct values
47
Approximate Count Distinct
How does it work? HyperLogLog algorithm (based on Flajolet-Martin algorithm) Flajolet-Martin Weaknesses of Flajolet-Martin High variance (even single row can have value that hashes to 0x ) Hash collisions can also cause incorrect results
48
Approximate Count Distinct
How does it work? HyperLogLog algorithm (based on Flajolet-Martin algorithm) Flajolet-Martin Weaknesses of Flajolet-Martin Flajolet-Martin HyperLog HyperLogLog Divides input in separate subsets (based on first bits in binary hash) Computes Flajolet-Martin for each subset Final result based on harmonic mean of the results per subset Error margin: within 2% for at least 97% of all use cases (In other words: 3% chance to be more than 2% wrong)
49
Intelligent Query Processing
Source:
50
Twitter: @Hugo_Kornelis
T H E E N D Questions?
52
Twitter: @Hugo_Kornelis
T H E E N D Questions?
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.