Presentation is loading. Please wait.

Presentation is loading. Please wait.

Roger Moore – Data Warehouse SSP 972-955-0426.

Similar presentations


Presentation on theme: "Roger Moore – Data Warehouse SSP 972-955-0426."— Presentation transcript:

1 Roger Moore – Data Warehouse SSP

2 Microsoft Confidential Microsoft Data Warehouse Strategy SQL DW & BI SQL Server Fast Track Madison Overview – SQL MPP (DATAllegro) Hub and Spoke Multi-Temperature MTP – Technology Preview (PoC) Summary

3 END USER TOOLS & PERFORMANCE MANAGEMENT APPS Excel PerformancePoint Server BI PLATFORM SQL Server Reporting Services SQL Server Analysis Services SQL Server DBMS SQL Server Integration Services SharePoint Server DELIVERY ReportsDashboardsExcelWorkbooksAnalyticViewsScorecardsPlans

4 Heterogeneous Connectivity & Workloads Data Integrity & Quality Compliance & Security Data Warehouse Scale Data Warehouse Management Futures PB Warehouses >64 Core Processing Scale out through MPP Perf. Management Tools BI Resource Governance Improved Predictability Mixed workload support Continuous Loading Integrated DQ Services (Zoomix) Master Data Management (Stratature Integration) Rights Management 10s of TB Warehouses Parallel partitioning Data compression New Reference Architectures Policy Based Admin. DB Resource Governance High Perf. Connectors (Oracle, Teradata, SAP BW) Data Profiling Policy based auditing Multi TB Warehouses Enterprise scalability DW Reference Architectures Unified manageability Enterprise class ETL tool Data Cleansing (Fuzzy lookup/matching) Data Protection & Tracing

5 Building a traditional DW Time consuming Expensive Performance varies Scalability issues Potential bottlenecks in standard DW architecture The DW appliance model Pre-built & tuned h/w + s/w Views entire stack holistically Known performance & scalability Encapsulates best practices Leverages Sequential I/O Lower TCO Faster deployment Better performance Minimised DBA time Benefits

6 Helping Customers & Partners Accelerate Their Data Warehouse Deployments

7 7 Microsoft NDA-only Software: SQL Server 2008 Enterprise Windows Server 2008 Hardware: Tight specifications for servers, storage and networking Per core building block Configuration guidelines: Physical table structures Indexes Compression SQL Server settings Windows Server settings Loading

8 FC HBA A B A B FC HBA A B A B FC SWITCH STORAGE CONTROLLER A B A B CACHE SERVER CACHE SQL SERVER WINDOWS CPU CORES CPU Feed RateHBA Port RateSwitch Port RateSP Port Rate A B DISK LUN DISK LUN SQL Server Read Ahead Rate LUN Read RateDisk Feed Rate SQL Server 2008 Potential Performance Bottlenecks

9 Sequential I/O Ideal for data warehousing Scalable, predictable performance Large reads & writes Requires 1/3 or fewer drives for same performance Random I/O Ideal for OLTP Not as predictable & scalable for data warehousing Small reads and writes Requires large number of drives

10 2 Processor Configuration Server: HP ProLiant DL385 G5p with 2 Quad-core AMD Opteron processors Storage server: EMC or MSA Storage Scalability: up to 8 TB 4 Processor Configuration Server: HP ProLiant DL 585 G5 with 4 Quad-core AMD Opteron processors Storage server: EMC or MSA Storage Scalability: 4 – 16 TB 8 Processor Configuration Server: HP ProLiant DL 785 G5 with 8 Quad-core AMD Opteron processors Storage server: EMC or MSA Storage Scalability: 16 – 32 TB Note - Compression assumes 2.5:1

11 2 Processor Configuration Server: Dell Power Edge 2950 MLK with 2 Quad-core Intel Xeon processors Storage server: EMC CX4-240 & AX4 Scalability: up to 8 TB 4 Processor Configuration Server: Dell Power Edge R900 with 4 6-core Intel Xeon processors Storage server: EMC CX4-240 & AX4 Scalability: 12 – 24 TB Note - Compression assumes 2.5:1 - Fully loaded only adds drives to minimum HW required - Data space can be increased by using 450GB drives

12 Current Environment Teradata 4-node (5450 model) with 6TB of user data BI: Business Objects ETL: Informatica and BTEQ scripts Proposed Microsoft Platform SQL Server Fast Track Data Warehouse HP DL580 Server - 4 Quadcore Processors (16 core total) 256 GB Memory SAN Storage: MSA 2000 (Qty 4) – 8TB User Data Capacity BI: Business Objects ETL: SQL Server and SSIS

13 TeradataSQL Server Fast Track DW Comparison Loading – Subject Area 1 5:10:21 total time51:31 total time SQL Server 6x faster Loading – Subject Area 2 4:36:08 total time1:50.01 total time SQL Server 2.5x faster Query times – Subject Area 1 3:03 avg query time (using 9 benchmark queries) 0:15 avg query time (using 9 benchmark queries) SQL Server 12x faster Query times – Subject Area 2 56:44 avg query time (using 4 benchmark queries) 8:09 avg query time (using 4 benchmark queries) SQL Server 7x faster

14 14 Microsoft NDA-only

15 Fast Track offers appliance-like ease of deployment, scalability and performance for SMP Madison to offer massively parallel (MPP) scale and performance Madison hub-and-spoke architecture to include support for SMP spokes

16 Scale Out Scale Up INDUSTRY STANDARD NETWORKING INDUSTRY STANDARD STORAGE INDUSTRY STANDARD SERVERS Fast Track Data Warehouses Project Madison

17 INDUSTRY STANDARD NETWORKING INDUSTRY STANDARD SERVERS Reference Hardware Platforms Project Madison INDUSTRY STANDARD STORAGE

18 Compute Nodes Dual Infiniband Spare Compute Node Storage Node Control Nodes Active / Passive Landing Zone Backup Node Storage Servers

19 Date Dim D_ DATE _ SK D_ DATE _ ID D_ DATE D_ MONTH … Date Dim D_ DATE _ SK D_ DATE _ ID D_ DATE D_ MONTH … Store Sales Ss_sold_date_sk Ss_item_sk Ss_customer_sk Ss_cdemo_sk Ss_store_sk Ss_promo_sk Ss_quantity … Store Sales Ss_sold_date_sk Ss_item_sk Ss_customer_sk Ss_cdemo_sk Ss_store_sk Ss_promo_sk Ss_quantity … Promotion P_ PROMO _ SK P_ PROMO _ ID P_ START _ DATE _ SK P_ END _ DATE _ SK … Promotion P_ PROMO _ SK P_ PROMO _ ID P_ START _ DATE _ SK P_ END _ DATE _ SK … Customer C-C USTOMER _ SK C_ CUSTOMER _ ID C_ CURRENT _ ADDR … Customer C-C USTOMER _ SK C_ CUSTOMER _ ID C_ CURRENT _ ADDR … Item I _ ITEM _ SK I _ ITEM _ ID I _ REC _ START _ DATE I _ ITEM _ DESC … Item I _ ITEM _ SK I _ ITEM _ ID I _ REC _ START _ DATE I _ ITEM _ DESC … Store S_ STORE _ SK S_ STORE _ ID S_ REC _ START _ DATE S_ REC _ END _ DATE S_ STORE _ NAME … Store S_ STORE _ SK S_ STORE _ ID S_ REC _ START _ DATE S_ REC _ END _ DATE S_ STORE _ NAME … Customer Demographics C D _ DEMO _ SK C D _ GENDER C D _ MARITAL _ STATUS C D _ EDUCATION … Customer Demographics C D _ DEMO _ SK C D _ GENDER C D _ MARITAL _ STATUS C D _ EDUCATION … 1TrillionRows1TrillionRows 100 Million 73, Million 1, 902 2, , 000

20 Date Dim D_ DATE _ SK D_ DATE _ ID D_ DATE D_ MONTH … Date Dim D_ DATE _ SK D_ DATE _ ID D_ DATE D_ MONTH … Item I _ ITEM _ SK I _ ITEM _ ID I _ REC _ START _ DATE I _ ITEM _ DESC … Item I _ ITEM _ SK I _ ITEM _ ID I _ REC _ START _ DATE I _ ITEM _ DESC … Store Sales Ss_sold_date_sk Ss_item_sk Ss_customer_sk Ss_cdemo_sk Ss_store_sk Ss_promo_sk Ss_quantity … Store Sales Ss_sold_date_sk Ss_item_sk Ss_customer_sk Ss_cdemo_sk Ss_store_sk Ss_promo_sk Ss_quantity … Promotion P_ PROMO _ SK P_ PROMO _ ID P_ START _ DATE _ SK P_ END _ DATE _ SK … Promotion P_ PROMO _ SK P_ PROMO _ ID P_ START _ DATE _ SK P_ END _ DATE _ SK … Store S_STORE_SK S_STORE_ID S_REC_START_DATE S_REC_END_DATE S_STORE_NAME … Store S_STORE_SK S_STORE_ID S_REC_START_DATE S_REC_END_DATE S_STORE_NAME … Customer C-C USTOMER _ SK C_ CUSTOMER _ ID C_ CURRENT _ ADDR … Customer C-C USTOMER _ SK C_ CUSTOMER _ ID C_ CURRENT _ ADDR … Customer Demographics C D _ DEMO _ SK C D _ GENDER C D _ MARITAL _ STATUS C D _ EDUCATION … Customer Demographics C D _ DEMO _ SK C D _ GENDER C D _ MARITAL _ STATUS C D _ EDUCATION … Database Distributed & Replicated Tables C C I I D D CD S S P P SS C C I I D D CD S S P P SS C C I I D D CD S S P P SS C C I I D D CD S S P P SS C C I I D D CD S S P P SS C C I I D D CD S S P P SS C C I I D D CD S S P P SS C C I I D D CD S S P P SS

21

22 22 Microsoft NDA-only Central EDW Hub Regional Reporting Departmental Reporting ETL Tools High Performance HQ Reporting

23 Auto Publish FRESH DATA LOADING Most Recent - 3 Months 2 Years 7 Years User Queries BI Server Queries User Data Hot -> Warm -> Cold Stage -> ODS -> Prod Back-up / Archive Data structure in synch Fast response to users Easy Data Movement High Availability

24 UP TO 500M ROWS/DAY HIGH-SPEED PARALLEL UPDATES COST MGT REVENUE ASSURANCE MARGIN ANALYSIS 120 TB HIGH CAPACITY WARM CDRs FRAUD DETECTION BILLING 60 TB HIGH PERFORMANCE FOR MEDIATION & AUGMENTATION USING ETL TOOLS 220TB ARCHIVE DW ROLL OFF TO ARCHIVE

25 All hardware from a single vendor Multiple vendors to chose from Orderable at the rack or cluster Vendor will Assemble appliances Image appliances with OS, SQL Server and Madison software Appliance installed in less than a day Support – Vendor provides hardware support Microsoft provides software support

26 Two Programs MTP – Madison Technology Preview participants Duration of 4 to 6 weeks TAP – Beta production implementation 4-6 customers First iteration 9 to 12 weeks Requirements Focus on EDW and large data marts Migration projects, not green field Open to customers & prospects

27 Requirements Existing DW Volume of end-user data 1TB+ Considering change to BI or DW infrastructure On site survey Interview of key stake holders in Data Warehouse environment Performed by Microsoft Architect Service also available from selected Microsoft partners with deep Data Warehouse expertise 2-5 days duration Deliverables Presentation of key findings Report detailing findings Results delivered approximately 10 days after survey

28 Microsoft has a compelling EDW vision BI, ETL, scale up and out Hub & Spoke architecture Fast Track available today Up to 30TB Scale up today with SMP, scale out tomorrow with MPP MTP and TAP for Madison in June 2009 Scales up SQL Server to >1PB Sets a new bar in appliance pricing and performance Hub-and-Spoke will integrate Fast Track with Madison

29 END USER TOOLS & PERFORMANCE MANAGEMENT APPS Excel PerformancePoint Server BI PLATFORM SQL Server Reporting Services SQL Server Analysis Services SQL Server DBMS SQL Server Integration Services SharePoint Server DELIVERY ReportsDashboardsExcelWorkbooksAnalyticViewsScorecardsPlans

30 © 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.


Download ppt "Roger Moore – Data Warehouse SSP 972-955-0426."

Similar presentations


Ads by Google