Presentation is loading. Please wait.

Presentation is loading. Please wait.

Scott Hulke Microsoft Technology Center - Dallas.

Similar presentations


Presentation on theme: "Scott Hulke Microsoft Technology Center - Dallas."— Presentation transcript:

1 Scott Hulke Microsoft Technology Center - Dallas

2 What’s new in R2 for Data Warehousing Data Warehousing Architectures Customer Examples Demos

3

4

5 StreamInsight Parallel Data Warehouse RDBMS Engine Enhancements Enhanced Compression DataCenter Edition Up to 256 processor cores Not R2-specific, but… Hardware advances such as increased core density, solid state disks (SSDs)

6

7 Web Analytics: Click-stream data Online customer behavior Page layout 100,000 events /sec Manufacturing: Sensor on plant floor React through device controllers Aggregated data 10,000 events/sec Financial Services: Stock & news feeds Algorithmic trading Patterns over time Super-low latency 100,000 events /sec Healthcare: Patient monitoring Medical devices Pharmacy RFID 100,000 events/sec

8 8 Data Stream Stream Data Store & Archive Event Processing Engine Data Stream Asset Specs & Parameters Power, Utilities: Energy consumption Outages Smart grids 100,000 events/sec Visual trend-line and KPI monitoring Batch & product management Automated anomaly detection Real-time customer segmentation Algorithmic trading Proactive condition-based maintenance Visual trend-line and KPI monitoring Batch & product management Automated anomaly detection Real-time customer segmentation Algorithmic trading Proactive condition-based maintenance Web Analytics: Click-stream data Online customer behavior Page layout 100,000 events /sec Manufacturing: Sensor on plant floor React through device controllers Aggregated data 10,000 events/sec Threshold queries Event correlation from multiple sources Pattern queries Threshold queries Event correlation from multiple sources Pattern queries Lookup Asset Instrumentation for Data Acquisition, Subscriptions to Data Feeds Financial Services: Stock & news feeds Algorithmic trading Patterns over time Super-low latency 100,000 events /sec

9

10

11 Choice of hardware vendor High scale through Massively Parallel Processing (MPP) system Hub and Spoke architecture Deep integration with Microsoft BI 11

12

13 Memory becoming increasingly affordable 99% of all BI apps of fortune 5000 companies can fit in 1 TB of RAM 8-12 core processors will be standard Client Computers by 2012: Taking Advantage of Latest Trends

14 What’s new in R2 for Data Warehousing Data Warehousing Architectures Customer Examples Demos

15 SSISSSIS Microsoft & Partner Services

16 SQL 2008 Data Warehouse SMP Server Shared Network Bandwidth Enterprise Shared SAN Storage Dedicated Network Bandwidth SQL Classic DW Architecture Leverages Shared SAN Fast Track SQL DW Architecture Architecture modeled after DW Appliances “ Appliance Like” solutions Uses Dedicated SAN arrays and Network SAN Arrays 1:4 cpu cores 8 Data Disk / Array – 4 Raid 1 Pairs Simultaneous SQL Server Reads 2 Log and 1 Hot Spare EMC AX4 – HP MSA2312 IBM 3400 OLTP Applications SQL Fast Track DW supports “Scan Centric” DW workloads that are index light Dedicated SAN

17 A method for designing a cost-effective, balanced system for Data Warehouse workloads Reference hardware configurations developed in conjunction with hardware partners using this method Best practices for data layout, loading and management Relational Database Only – Not SSAS, SSIS, SSRS

18 Software: SQL Server 2008 Enterprise Windows Server 2008 Software: SQL Server 2008 Enterprise Windows Server 2008 Configuration guidelines: Physical table structures Indexes Compression SQL Server settings Windows Server settings Loading Configuration guidelines: Physical table structures Indexes Compression SQL Server settings Windows Server settings Loading Hardware: Tight specifications for servers, storage and networking ‘Per core’ building block Hardware: Tight specifications for servers, storage and networking ‘Per core’ building block

19 2 Processor Configuration Server: Dell Power Edge R710 with 2 Quad-core Intel Xeon processors 8 CPU Cores 32GB Memory Storage server: EMC CLARiiON AX4 Scalability: 4 – 8 TB 4 Processor Configuration Server: Dell Power Edge R900 with 4 6-core Intel Xeon processors 24 CPU Cores 96 GB Memory Storage server: EMC CLARiiON AX4 Scalability: 12 – 24 TB

20

21 Database Servers Dual Infiniband Control Nodes Active / Passive Spare Database Server Dual Fiber Channel Client Drivers ETL Load Interface Corporate Backup Solution Data Center Monitoring Corporate Network Private Network

22 22 Microsoft NDA-only Central EDW Hub Regional Reporting Departmental Reporting ETL Tools High Performance HQ Reporting

23 What’s new in R2 for Data Warehousing Data Warehousing Architectures Customer Examples Demos

24 CategoryMetric Largest single database80 TB Largest table20 TB Biggest total data 1 customer2.5 PB Highest transactions per second 1 db36,000 Fastest I/O subsystem in production20 GB/sec Fastest “real time” cube15 sec latency Data load for 1TB20 minutes Largest cube4.2 TB

25 Largest Astronomy project in history 4 telescopes capturing 1.5 giga pixel images Largest DB approaching 80TB+ Total data managed > 1PB 5+TB added per day HA/DR Relying on backups of the input files for now.

26 CDR Analytics 70TB Relational 4TB largest cube 100+ concurrent queries Itanium 64 core with storage system rated over 20GB/sec throughput Loading 1TB in < 30 minutes Processing 1m rec/sec in AS cubes

27 Business Online gaming applications - Europe‘s largest betting line-up Sports Poker Casino Skill Games 90 different sports covered in 22 languages > 12,000 different bets offered per day > 3 million individual and combination bets placed every day Bwin.com sponsors top world soccer teams Real Madrid AC Milan FC Bayern Munich Key Technologies Running on SQL Server 2008 & Windows 2008 Enterprise Windows Communication Foundation Synchronous database mirroring between two centers 12 km apart Added 1 ms delay on transaction 99.99x% availability @ 24 x 7 since migrating to SQL from Oracle. 100.00% uptime in 2008 and 2009 (since moving to SQL 2008 and Windows 2008) Zero data loss (financial transactions are involved) Replication and Log shipping for most databases DB Mirroring for betting data base. Full suite of SQL products - IS, AS and RS ASP.NET for application

28 Some numbers Peak financial transactions 6000 per second Peak db transactions 30,000 per second Databases800+ Instances 100+ Largest table2 billion rows Total data in SQL Server100+ TB Backup of 2 TB over network under 1 hr Largest machines64 core 512 GB IA2 HP 6 x 32 core IA2 http://sqlcat.com/whitepapers/archive/2009/08/13/a-technical- case-study-fast-and-reliable-backup-and-restore-of-a-vldb- over-the-network.aspx http://www.microsoft.com/casestudies/Case_Study_Detail.asp x?casestudyid=4000001470

29 CustomerProblemSolutionBenefits Premier Bankcard Credit Card Company Runs its Business with 17- Terabyte Mission Critical BI Solution Premier needed to enhance scalability and performance for its business intelligence (BI) data warehouse and online transaction processing (OLTP) databases. Enhanced BI infrastructure by upgrading 17- terabyte data warehouse to Microsoft® SQL Server™ 2008 Enterprise (64-bit), hosted on 16 Intel Itanium 2 processors We have about 9,000 concurrent users generating a continuous 700 transactions per second, sometimes more than doubling to 2,000 transactions per second. With the 64-bit version of SQL Server 2008 running on Itanium 2 processors we see no limit to our ability to scale our transaction processing MySpace MySpace Uses SQL Server Service Broker to Protect Integrity of 1 Petabyte of Data MySpace needed to find a data platform to support 130 million monthly active users, with 300,000 new users added each day,8 billion friend relationships it manages, 34 billion e-mail messages it stores, while adding 41 million more daily. The site’s 1 petabyte of data is managed by 440 Microsoft® SQL Server® instances and resides on 3PAR® Utility Storage. We needed to see if Service Broker could handle loads of 4,000 messages per second. Our testing found it could handle more than 18,000 messages a second. We were delighted that we could build our solution using Service Broker, rather than creating a custom solution on our own Entergy Entergy needed a data store for 3 trillion SCADA records in order to control their power grid The system’s Microsoft SQL Server handles 80- terabytes of data compressed down to 8. It continues to grow at a rate of 2 terabytes (20-terabytes compressed) per year. The ability to act proactively is the holy grail in our industry, and that is what we are gaining from our Pegasus RDS hosting of trillions of SCADA records on the Microsoft Application Platform

30 What’s new in R2 for Data Warehousing Data Warehousing Architectures Customer Examples Demos

31 Best price/performance for DW workload High scan rate through sequential I/O Data Compression reduces disk footprint

32 Load 100GB in 8 min. Note: significantly faster loads possible given more powerful HW – see SSIS World Record Benchmark Parallel data load using SSIS Take advantage of available hardware

33 © 2010 Microsoft Corporation. All rights reserved. This presentation is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.


Download ppt "Scott Hulke Microsoft Technology Center - Dallas."

Similar presentations


Ads by Google