A developers guide to Azure SQL Data Warehouse

Slides:



Advertisements
Similar presentations
Windows Azure Migrating SQL Server Workloads Speaker Title Organization.
Advertisements

Training Workshop Windows Azure Platform. Presentation Outline (hidden slide): Technical Level: 200 Intended Audience: Developers Objectives (what do.
Technology Drill Down: Windows Azure Platform Eric Nelson | ISV Application Architect | Microsoft UK |
Azure SQL DW – Elastic Data Analytics in the cloud Josh Sivey | Microsoft TSP #492 | Phoenix.
3 Ways to Integrate Business Systems to Partners
Microsoft Connect /6/ :05 AM
Use relational database as a service
Cloud BI with Azure Analysis Services
4/18/2018 6:56 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Data Platform and Analytics Foundational Training
Data Platform Modernization
Azure SQL Data Warehouse for Beginners
5/7/ :44 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
5/9/2018 7:28 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS.
Advanced Topics for Azure SQL Data Warehouse
System Center Marketing
Creating Enterprise Grade BI Models with Azure Analysis Services
System Center Marketing
Business Critical Application Platform
Microsoft /2/2018 3:42 PM BRK3129 Query Big Data using the Expanded T-SQL footprint with PolyBase in SQL Server 2016 Casey Karst Program Manager.
Microsoft Azure: The only consistent Hybrid Cloud
Why Is My SQL DW Query Slow?
SQL DB Elastic Pools for SaaS
Azure Functions and Automation: The SQL Agent in the Cloud
Microsoft Build /22/ :52 PM © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY,
7/22/2018 9:21 PM BRK3270 Building a Better Data Solution: Microsoft SQL Server and Azure Data Services Joey D’Antoni Principal Consultant Denny Cherry.
SQL Server + PHP: What’s New
Windows Azure Migrating SQL Server Workloads
Installation and database instance essentials
Azure API Management Jothi Prakash A
Mission-critical performance with Microsoft SQL Server 2016
9/13/2018 © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks.
Azure SQL Datawarehouse - Datawarehouse on Cloud
Machine Learning, Analytics, & Data Science Conference
02 | Design and implement database
A developers guide to Azure SQL Data Warehouse
9/21/2018 3:41 AM BRK3180 Architect your big data solutions with SQL Data Warehouse & Azure Analysis Services Josh Caplan & Matt Usher Program Managers.
Azure SQL Data Warehouse Scaling: Configuration and Guidance
Analytics for Apps: Landing and Loading Data into SQL Data Warehouse
Microsoft Build /9/2018 5:08 AM © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY,
Migrating Your BI Platform To Azure
CloudSimplified.IO.
What is the Azure SQL Datawarehouse?
Data Platform Modernization
Azure SQL Data Warehouse Performance Tuning
Server & Tools Business
BRK2279 Real-World Data Movement and Orchestration Patterns using Azure Data Factory Jason Horner, Attunix Cathrine Wilhelmsen, Inmeta -
Azure SQL DWH: Tips and Tricks for developers
20 Questions with Azure SQL Data Warehouse
Cloud BI with Azure Analysis Services
Azure SQL DWH: Tips and Tricks for developers
Azure SQL DWH: Optimization
Managing batch processing Transient Azure SQL Warehouse Resource
Microsoft Build /14/2019 8:42 AM © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY,
Sunil Agarwal | Principal Program Manager
Microsoft Connect /17/2019 9:55 PM
Welcome to Azure Notebooks
Context about the Data Warehouse
Azure SQL DWH: Tips and Tricks for developers
Azure SQL DWH: Tips and Tricks for developers
Windows Device Portal Hirsch Singhal Program
5/30/2019 1:59 PM © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION.
ETL Patterns in the Cloud with Azure Data Factory
Microsoft Virtual Academy
Microsoft Virtual Academy
New, Old or Pink - it is all good
Microsoft Virtual Academy
SQL Server 2019 Bringing Apache Spark to SQL Server
Visual Data Flows – Azure Data Factory v2
Visual Data Flows – Azure Data Factory v2
Presentation transcript:

A developers guide to Azure SQL Data Warehouse Microsoft Build 2016 11/23/2018 3:39 AM A developers guide to Azure SQL Data Warehouse James Rowland-Jones (JRJ) © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

IoT Scenario Persist Export Query Stream Import Write

What is SQL DW? (and when to use it)

Analytical workloads Store large volumes of data Microsoft Build 2016 11/23/2018 3:39 AM Analytical workloads Store large volumes of data Consolidate disparate data into a single location Shape, model, transform and aggregate data Perform query analysis across large datasets Ad-hoc reporting across large data volumes All using simple SQL constructs “SQL on SQL” © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Unsuitable workloads Operational workloads (OLTP) High frequency reads & writes Large numbers of singleton selects High volumes of single row inserts Procedural ETL Row by row processing needs Incompatible formats (JSON, XML)

Logical Overview Control Compute Storage Microsoft Build 2016 11/23/2018 3:39 AM Logical Overview Compute Control Storage © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Distributed queries Query Result Control Compute Storage Microsoft Build 2016 11/23/2018 3:39 AM Distributed queries Query Result Control Compute Storage © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Fully managed PaaS Microsoft Build 2016 11/23/2018 3:39 AM © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Geo-redundant Microsoft Build 2016 11/23/2018 3:39 AM © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Connectivity Windows or Linux ODBC JDBC ADO.NET PHP Microsoft Build 2016 11/23/2018 3:39 AM Connectivity Windows or Linux ODBC JDBC ADO.NET PHP © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Summary Scale-out distributed query engine Microsoft Build 2016 11/23/2018 3:39 AM Summary Scale-out distributed query engine De-coupled storage from compute Fully managed Completely elastic Platform as a Service (PaaS) Petabyte scale Leveraging cloud ecosystem Broad range of connectivity options © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Provisioning SQL DW

Demo: Provisioning

Summary Full provisioning experience Partial provisioning experience PowerShell Azure portal REST API Partial provisioning experience T-SQL CREATE DATABASE T-SQL scale with ALTER DATABASE sys.database_service_objectives shows current configuration sys.dm_operation_status shows progress performing provisioning operations

Designing tables Microsoft Build 2016 11/23/2018 3:39 AM © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

HASH DISTRIBUTION 02 01 01 03 HASH ( ) 01 02 03 04 05 06 07 08 09 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60

Demo: Table design

Sizing for partitioning Microsoft Build 2016 11/23/2018 3:39 AM Sizing for partitioning Factors 1 TiB Scenario 50 TiB Scenario Size of dataset 1 TiB 50 TiB Distribution Count 60 Compression Ratio 5 Skew? No GB per distribution 3.3 166.7 # Partitions 36 GB per partition 0.092 4.62 © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Guidance Keep column definitions strongly typed Distribution key is read only Aim for a not null definition for distribution key Ensure columns are defined consistently Especially true for the distribution key Hash distribution optimises data layout JOIN and GROUP BY columns are often the best candidates Partition for data management Consider columns used in the WHERE clause Evaluate query date ranges (month, quarter) as part of the partitioning strategy

Loading data

Demo: Loading Data

Trickle loading guidance >= 102,400 rows per distribution = Columnstore <102,400 rows per distribution = Row storage Assuming even distribution: 6,144,000+ rows required in Bulk Insert 500 Rows/Sec 1000 2000 Load threshold exceeded (hours) <3.5 hours < 2 hours < 1 hour

Loading compressed text Guidance Evenly split the data into multiple files One file per reader DWU Readers Writers DW100 8 60 DW200 16 DW300 24 DW400 32 DW500 40 DW600 48 DW1000

Other loading methods Azure Data Factory SSIS Bcp 3rd party data loading tools

Resources PolyBase loading: http://aka.ms/acom-polybase-load Re-visit Build on Channel 9. Continue your education at Microsoft Virtual Academy online.