SQL Server 2008 R2 Parallel Data Warehouse: Under the Hood Brian Mitchell Senior Premier Field Engineer.

Slides:



Advertisements
Similar presentations
Tableau Software Australia
Advertisements

2012 © Trivadis BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN TechTalk Beste Skalierbarkeit dank massiv.
Big Data Working with Terabytes in SQL Server Andrew Novick
WSUS Presented by: Nada Abdullah Ahmed.
Microsoft Data Warehouse Vision Massive Scalability at Low Cost Improved Business Agility and Alignment Democratized Business Intelligence Hardware.
High Performance Analytical Appliance MPP Database Server Platform for high performance Prebuilt appliance with HW & SW included and optimally configured.
Danny Tambs Solution Architect. VOLUME (Size) VARIETY (Structure) VELOCITY (Speed)
Microsoft Ignite /16/2017 5:47 PM
Fast Track, Microsoft SQL Server 2008 Parallel Data Warehouse and Traditional Data Warehouse Design BI Best Practices and Tuning for Scaling SQL Server.
Simplify your Job – Automatic Storage Management Angelo Session id:
April 10-12, Chicago, IL PDW Architecture Gets Real: Customer Implementations Brian Walker | Microsoft Corporation PDW Center of Excellence Murshed Zaman.
Build it yourself Custom configurations High IT expertise “Cooking recipe” Probably higher success Can be ‘sold’ to customers Tied to HW vendor Very.
Windows ® Powered NAS. Agenda Windows Powered NAS Windows Powered NAS Key Technologies in Windows Powered NAS Key Technologies in Windows Powered NAS.
Cluster computing facility for CMS simulation work at NPD-BARC Raman Sehgal.
Chapter 10 : Designing a SQL Server 2005 Solution for High Availability MCITP Administrator: Microsoft SQL Server 2005 Database Server Infrastructure Design.
SQL Server Warehousing (Fast Track 4.0 & PDW)
SPONSORS. Microsoft PowerPivot for SQL Server, Excel 2010, and SharePoint 2010 Michael Herman Syntergy, Inc.
Maintaining a Microsoft SQL Server 2008 Database SQLServer-Training.com.
Bob Thome, Senior Director of Product Management, Oracle SIMPLIFYING YOUR HIGH AVAILABILITY DATABASE.
Training Workshop Windows Azure Platform. Presentation Outline (hidden slide): Technical Level: 200 Intended Audience: Developers Objectives (what do.
Microsoft ® System Center Operations Manager Infrastructure Planning and Design Published: November 2012.
SQL Server Data Warehousing Overview
DBI332 ilikesql brianwmitchelll UNSTRUCTURED UNBALANCED UNPREDICTABLE.
Copyright © 2010, Scryer Analytics, LLC. All rights reserved. Optimizing SAS System Performance − A Platform Perspective Patrick McDonald Scryer Analytics,
DBSQL 14-1 Copyright © Genetic Computer School 2009 Chapter 14 Microsoft SQL Server.
SQL Server 2008 R2 for the DBA Patrick LeBlanc. Objectives  New Editions  Datacenter  Parallel Data Warehouse  Multi-server management  Utility Control.
Data Warehousing at Acxiom Paul Montrose Data Warehousing at Acxiom Paul Montrose.
Windows Azure Conference 2014 Deploy your Java workloads on Windows Azure.
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. HP Enterprise Data Warehouse.
SESSION CODE: BIE07-INT Eric Kraemer Senior Program Manager Microsoft Corporation.
Data Management Conference Data Warehousing John Plummer TSP Architect
2012 © Trivadis BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN SQL Server 2012 Parallel Data Warehouse.
Building BI Solutions with SQL Server PDW AU3 Ruwen Hess Senior Program Manager Microsoft Corporation DBI321.
Data Staging Data Loading and Cleaning Marakas pg. 25 BCIS 4660 Spring 2012.
Srik Raghavan Principal Lead Program Manager Kevin Cox Principal Program Manager SESSION CODE: DAT206.
2012 © Trivadis BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. HAMBURG MÜNCHEN STUTTGART WIEN Welcome November 2012 Vorstellung Parallel.
Solution to help customers and partners accelerate their data.
Dell Confidential Dell Database Solutions: SQL Server 2005 APR 2006.
Managing and Monitoring the Microsoft Application Platform Damir Bersinic Ruth Morton IT Pro Advisor Microsoft Canada
Infrastructure for Data Warehouses. Basics Of Data Access Data Store Machine Memory Buffer Memory Cache Data Store Buffer Bus Structure.
Last Updated : 27 th April 2004 Center of Excellence Data Warehousing Group Teradata Performance Optimization.
Rushabh Mehta Managing Director (India) | Solid Quality Mentors
Storage Netværk Mød Microsoft Feb 2005, Agenda Data Protection Server (opdatering) Microsoft og iSCSI Demo.
SMP MPP with PDW ** Workload requirements usually drive the architecture decision.
Microsoft Analytics Platform System Stefan Cronjaeger, Microsoft.
Business Connectivity Services BCS (Old BDC) New read and write access to external data –Line-of-business (LOB) systems –Web services –Databases –Other.
Configuring SQL Server for a successful SharePoint Server Deployment Haaron Gonzalez Solution Architect & Consultant Microsoft MVP SharePoint Server
DESIGNING HIGH PERFORMANCE ETL FOR DATA WAREHOUSE. Best Practices and approaches. Alexei Khalyako (SQLCAT) & Marcel Franke (pmOne)
Apache Hadoop on Windows Azure Avkash Chauhan
Sql Server Architecture for World Domination Tristan Wilson.
IBM Systems and Technology Group © 2008 IBM Corporation Oracle Exadata Storage and the HP Oracle Database Machine Competitive Seller Podcast Mark Wulf.
Presented by: Aaron Stanley King.  Benefits of SQL Azure  Features of SQL Azure  Demos, Demos, Demos!  How to query in SQL Azure  More Demos!  Recent.
…the secret sauce! Diagrams and video from Microsoft white papers and slide decks.
Data Platform and Analytics Foundational Training
Netscape Application Server
with the Microsoft BI Ecosystem
Welcome! Thank you for joining us. We’ll get started in a few minutes.
Data Warehouse in the Cloud – Marketing or Reality?
The New Possibilities in Microsoft Business Intelligence
SQL Server 2008 R2 – The Newest and the Best
Planning Data Warehouse Infrastructure
Data Warehousing: SQL Server Parallel Data Warehouse AU3 update
Auditing in SQL Server 2008 DBA-364-M
Overview of Fast Track and PDW
SQL 2014 In-Memory OLTP What, Why, and How
Oracle Architecture Overview
Designing Business Intelligence Solutions with Microsoft SQL Server
Dana Kaufman SQL Server Appliance Engineering
Microsoft Analytics Platform System 03 – Distribution Theory & Design
Dell EMC SQL Server Solutions Doug Bernhardt
Presentation transcript:

SQL Server 2008 R2 Parallel Data Warehouse: Under the Hood Brian Mitchell Senior Premier Field Engineer

Tier 1 Enterprise Data Warehouse Appliance Offering –High scalability from 10s to100s of terabytes –High performance through MPP system Flexibility and Choice –Choice of deployment options through distributed architecture – Highly Scalable Most Comprehensive Solution –Complete data warehouse solution spanning desktop, enterprise data warehouse (EDW), and data marts –Deep integration with Microsoft business intelligence (BI) –Comprehensive toolset for BI, ETL, MDM, and streaming data Introducing Parallel Data Warehouse

Agenda SQL Server 2008 R2 PDW Overview Disk CPU Memory

Appliance Model Sold as a “black box” to customers End-to-end solution includes software and hardware Preconfigured from vendor Based on a balanced reference architecture Hardware specifications promote data warehousing workloads Provides enterprise-level redundancy

Appliance Hardware Schema

PDW: High Availability Failover Clustering Dual Networking –Dual Infiniband –Dual Ethernet –Dual Fiber Channel Dual Power Storage –RAID 0 –Hot Spare

PDW Benefits Appliance Model –System arrives assembled with software pre- installed Appliance optimized for DW Workloads CPU and IO bandwidth is balanced for scan-intensive queries Simple to get running and productive

PDW Advantages All loads and queries are highly parallel, automatically All DML (Inserts, Updates) are also parallel * Can increase scale and reduce execution time by adding compute racks Fewer ‘knobs’, less complexity at DBA level –Eliminates physical file layout considerations from database and table creation –Memory, parallelism, and many other SQL configuration options preset and fixed

PDW: Built on Tech You Know Windows Server 2008 (SP2) SQL Server 2008 (SP2) Failover Clustering Web Based Admin Console SQL Server 2008 R2 BI Tools connect natively –Analysis Services –Reporting Services –Integration Services –PowerPivot

Demo: PDW Built on Tech You Know

PDW: Basic Concepts

Create Database Syntax CREATE DATABASE database_name WITH ( [ AUTOGROW = ON | OFF, ] REPLICATED_SIZE = replicated_size [ GB ], DISTRIBUTED_SIZE = distributed_size [ GB ], LOG_SIZE = log_size [ GB ] ) [;] Example CREATE DATABASE BigData WITH (AUTOGROW = ON, REPLICATED_SIZE = 1024, DISTRIBUTED_SIZE = 16384, LOG_SIZE = 1024 )

Create Table Examples Replicated Table CREATE TABLE myTable ( id integer NOT NULL, lastName varchar(20), zipCode varchar(6) ); Distributed Table CREATE TABLE myTable ( id integer NOT NULL, lastName varchar(20), zipCode varchar(6) WITH ( DISTRIBUTION = HASH (id)) );

PDW: Handling Disk I/O

Replicating Tables 15 dimTime Date Dim ID Calendar Year Calendar Qtr Calendar Mo Calendar Day Date Dim ID Calendar Year Calendar Qtr Calendar Mo Calendar Day dimStore Store Dim ID Store Name Store Mgr Store Size Store Dim ID Store Name Store Mgr Store Size dimProduct Prod Dim ID Prod Category Prod Sub Cat Prod Desc Prod Dim ID Prod Category Prod Sub Cat Prod Desc factSales Date Dim ID Store Dim ID Prod Dim ID Mktg Camp Id Qty Sold Dollars Sold DimMktCampaign Mktg Camp ID Camp Name Camp Mgr Camp Start Camp End TDTD TDTD PDPD PDPD SDSD SDSD MDMD MDMD TDTD TDTD PDPD PDPD SDSD SDSD MDMD MDMD TDTD TDTD PDPD PDPD SDSD SDSD MDMD MDMD Smaller Dimension Tables are Replicated on Every Compute Node TDTD TDTD PDPD PDPD SDSD SDSD MDMD MDMD

Ultra Shared Nothing SQL Server PDW –Stores a portion of each table in each compute node –Stores 8 “portions” per compute node Called Distributions –Table Scan: all distributions on all nodes

Distributing Tables 17 dimTime Date Dim ID Calendar Year Calendar Qtr Calendar Mo Calendar Day Date Dim ID Calendar Year Calendar Qtr Calendar Mo Calendar Day dimStore Store Dim ID Store Name Store Mgr Store Size Store Dim ID Store Name Store Mgr Store Size dimProduct Prod Dim ID Prod Category Prod Sub Cat Prod Desc Prod Dim ID Prod Category Prod Sub Cat Prod Desc factSales Date Dim ID Store Dim ID Prod Dim ID Mktg Camp Id Qty Sold Dollars Sold dimMktCampaign Mktg Camp ID Camp Name Camp Mgr Camp Start Camp End TDTD TDTD PDPD PDPD SDSD SDSD MDMD MDMD TDTD TDTD PDPD PDPD SDSD SDSD MDMD MDMD TDTD TDTD PDPD PDPD SDSD SDSD MDMD MDMD TDTD TDTD PDPD PDPD SDSD SDSD MDMD MDMD Larger Fact Table is Hash Distributed Across All Compute Nodes SF -1 SF -2 SF -3 SF -4

PDW: Database Filegroups

PDW: Database Files

PDW Compute Node SAN Architecture **N+1 cluster architecture

Handling Processing Throughput

CPU Each Compute Node is set up using Soft-Numa Each Compute Node Listens on Multiple Ports Each Port is mapped to a Soft-Numa Node

PDW: Affinity PDW Engine SELECT Name FROM tableA WHERE state = ‘TX ’ A B C D E F G H Filegroups Soft-Numa LUNs 8 Connections to SQL Server Affinitized to Cores Affinitized to Tables on FileGroups Affinitized to Disks Compute Node

PDW: Memory For Everyone

PDW Memory: Resource Governor QueryGroup_A QueryGroup_B QueryGruop_D QueryGroup_C QueryGroup_E QueryGroup_F QueryGroup_G QueryGroup_H QueryPool_A 11% QueryPool_B 11% QueryPool_D 11% QueryPool_C 11% QueryPool_E 11% QueryPool_F 11% QueryPool_G 11% QueryPool_H 11% PDW Engine SELECT Name FROM tableA WHERE state = ‘TX ’ RAM Compute Node 1

Monitoring PDW Admin Console DMV’s DBCC Commands PDW Logs DMS Logs SQL Server Logs Event Logs Cluster Logs

Monitoring PDW - Demo

Please Complete the Evaluation Form Pick up your evaluation form: In each presentation room Drop off your completed form Near the exit of each presentation room At the registration area SQL Server 2008 R2 Parallel Data Warehouse: Under the Hood 28 Presented by Dell

THANK YOU! For attending this session and PASS SQLRally Orlando, Florida Session Code | Session Title 29 Presented by Dell