Download presentation
Presentation is loading. Please wait.
1
Azure SQL DWH: Tips and Tricks for developers
SQL Saturday Cork Welcome to Cork Sergiy Lunyakin Azure SQL DWH: Tips and Tricks for developers
2
About me Big Data Architect at CoE in SoftServe Inc.
Data Platform MVP, MCSE BI, MCSA Cloud Platform Leader of Speaker at SQL conferences Organizer of SQLSaturday Lviv Contacts: @slunyakin
3
Agenda What is Azure SQL DW Architecture of Azure SQL DW Limitations
Check compatibility Handling cross-database query Handling Identity Handling ANSI Update/Delete/Merge/SCD Handling Compute columns Handling Cursor
4
What is Azure SQL DW Microsoft Azure Platform as a Service
It’s a Massively Parallel Processing system (MPP) Distributed Compute and Distributed Storage Scale up and down in couple minutes Pause compute resources Supports a subset of T-SQL Join with external data in Azure Blob Storage and ADLS
5
Architecture of Azure SQL DW
Dist_DB_1 Dist_DB_2 Dist_DB_15 Dist_DB_16 Dist_DB_17 Dist_DB_30 Dist_DB_46 Dist_DB_47 Dist_DB_60 … … … … … …
6
Logical Overview Control Compute Storage Microsoft Build 2016
11/23/2018 6:38 AM Logical Overview Compute Control Storage © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.
7
Distributed queries Query Result Control Compute Storage
Microsoft Build 2016 11/23/2018 6:38 AM Distributed queries Result Query Control Compute Scale-out distributed query engine Storage © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.
8
Distributed Query SELECT SUM(*) FROM dbo.[FactInternetSales] ; Control
SELECT COUNT_BIG(*) FROM dbo.[FactInternetSales] ; SELECT SUM(*) FROM dbo.[FactInternetSales] ; Control Compute SELECT COUNT_BIG(*) FROM dbo.[FactInternetSales] ; SELECT COUNT_BIG(*) FROM dbo.[FactInternetSales] ; SELECT COUNT_BIG(*) FROM dbo.[FactInternetSales] ; SELECT COUNT_BIG(*) FROM dbo.[FactInternetSales] ;
9
Limitations Primary/Foreign Keys Identity Computed Columns Triggers
Cross-database joins Sequences Cursors MERGE ANSI joins on updates/deletes More limitations:
10
Check compatibility Data warehouse migration utility Free tool
Helps to identify unsupported features Helps to identify HASH distribution column Migrate scheama Migrate data (BCP tool)
11
Cross-database query Azure SQL DW doesn’t support cross-database query. Use ELT approach. Separate schemas. Use External tables as staging tables.
12
CTAS CTAS is super-charched version of SELECT...INTO Parallelized
Better for Data import Data copy Workarounds CREATE TABLE [dbo].[FactInternetSales_new] WITH ( DISTRIBUTION = ROUND_ROBIN , CLUSTERED COLUMNSTORE INDEX ) AS SELECT * FROM [dbo].[FactInternetSales];
13
Identity Handle it on source side IDENTITY property
Explicit import Doesn’t support CTAS Custom Identity with ROW_NUMBER
14
ANSI JOINS Update/Del/Merge
Update/Delete doesn’t support JOINS in FROM Use CTAS for preparing interim table with JOINS Use CTAS for Merge workaround Split Merge to operation steps and use UNION ALL Use interim table for big number of steps Use partitioning for big tables, don’t reload the whole table
15
Compute columns Handle it in source system Use CTAS during import
Create a View Use explicit data type and nullability check Wrong data during migration Schema error during partition switch
16
Cursor WHILE for lopping List of elements as a table
Loop through list using While clause and variable
17
Summary MPP PaaS Service in Azure Cloud
Storing and processing huge amount of structure data Limitation: Identity, ANSI JOINS, MERGE CTAS - Super-charged version of SELECT...INTO CTAS good way for workarounds Better reload data with CTAS than Row-By-Row operations
18
SQL Saturday Cork Thank you! Questions?
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.