Data Warehouse Accelerator Michael Wallace Principal Systems Consultant.

Slides:



Advertisements
Similar presentations
A TO Z MINI BILLBOARD AD NETWORK
Advertisements

From Startup to Enterprise A Story of MySQL Evolution Vidur Apparao, CTO Stephen OSullivan, Manager of Data and Grid Technologies April 2009.
Chapter 13: Query Processing
Tivoli Storage Manager v. 4.1 Executive Presentation.
Data Services for Service Oriented Architecture in Finance D. Britton Johnston Chief Technology Evangelist.
1 Senn, Information Technology, 3 rd Edition © 2004 Pearson Prentice Hall James A. Senns Information Technology, 3 rd Edition Chapter 7 Enterprise Databases.
Analysis of Computer Algorithms
The 4 T’s of Test Automation:
The ANSI/SPARC Architecture of a Database Environment
Query optimisation.
ORACLE DATABASE HIGH AVAILABILITY & ORACLE 11GR2 DATA GUARD 1 Güneş EROL.
Case Study: Photo.net March 20, What is photo.net? An online learning community for amateur and professional photographers 90,000 registered users.
Database Systems: Design, Implementation, and Management
Extreme Performance with Oracle Data Warehousing
1 Software Engineering II The Business Aspects of Software Engineering.
DiskCon September 2004 Solid State Disks: The Future of Storage?
The IP Revolution. Page 2 The IP Revolution IP Revolution Why now? The 3 Pillars of the IP Revolution How IP changes everything.
Enterprise Document Management Symposium October 5 th – 6 th 2010 Niagara Falls, Canada.
Information Systems Today: Managing in the Digital World
Blazing Queries: Using an Open Source Database for High Performance Analytics July 2010.
Database Performance Tuning and Query Optimization
Chapter 4 Memory Management Basic memory management Swapping
Go-Faster Consultancy Ltd.1 Experiences of Global Temporary Tables in Oracle 8.1 David Kurtz Go-Faster Consultancy Ltd.
Describing Complex Products as Configurations using APL Arrays.
Activity 1………………Saving vs. Investing Activity 2……….….Saving for a Rainy Day Activity 3…………………… = Saving Activity 4…..Investing for the Long Term.
© 2010 TIBCO Software Inc. All Rights Reserved. Confidential and Proprietary. TIBCO Spotfire Application Data Services TIBCO Spotfire European User Conference.
1 Use or disclosure of data contained on this sheet is subject to the restriction on the title page of this proposal or quotation. An Introduction to Data.
Database System Concepts and Architecture
BY LECTURER/ AISHA DAWOOD DW Lab # 2. LAB EXERCISE #1 Oracle Data Warehousing Goal: Develop an application to implement defining subject area, design.
Chapter 9: The Client/Server Database Environment
Introduction to Databases
Executional Architecture
1. SQL Server 2014 In-Memory by Design Arthur Zubarev June 21, 2014.
We will resume in: 25 Minutes.
Marketing Strategy and the Marketing Plan
Chapter 13 The Data Warehouse
Databases MMG508. DB Properties  Definition of a database: “A database is a collection of interrelated data items that are managed as a single unit”
Big Data Working with Terabytes in SQL Server Andrew Novick
High Performance Analytical Appliance MPP Database Server Platform for high performance Prebuilt appliance with HW & SW included and optimally configured.
A Fast Growing Market. Interesting New Players Lyzasoft.
Tableau Visual Intelligence Platform
Presented by Marie-Gisele Assigue Hon Shea Thursday, March 31 st 2011.
Designing a Data Warehouse
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Tableau Visual Intelligence Platform
Passage Three Introduction to Microsoft SQL Server 2000.
Fast Track, Microsoft SQL Server 2008 Parallel Data Warehouse and Traditional Data Warehouse Design BI Best Practices and Tuning for Scaling SQL Server.
Performance and Scalability. Performance and Scalability Challenges Optimizing PerformanceScaling UpScaling Out.
Bring Consolidation Into Focus The Value of Compaq AlphaServer and Storage Consolidation Solutions Joseph Batista Director Enterprise & Internet Initiatives.
Oracle Challenges Parallelism Limitations Parallelism is the ability for a single query to be run across multiple processors or servers. Large queries.
September 2011Copyright 2011 Teradata Corporation1 Teradata Columnar.
© Dennis Shasha, Philippe Bonnet – 2013 Communicating with the Outside.
Data Warehousing at Acxiom Paul Montrose Data Warehousing at Acxiom Paul Montrose.
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
 2009 Calpont Corporation 1 Calpont Open Source Columnar Storage Engine for Scalable MySQL Data Warehousing April 22, 2009 MySQL User Conference Santa.
© 2009 IBM Corporation Maximize Cost Savings While Improving Visibility Into Lines of Business Wendy Tam, CDC Product Marketing Manager
Introduction.  Administration  Simple DBMS  CMPT 454 Topics John Edgar2.
Infrastructure for Data Warehouses. Basics Of Data Access Data Store Machine Memory Buffer Memory Cache Data Store Buffer Bus Structure.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
Your Data Any Place, Any Time Performance and Scalability.
ORACLE & VLDB Nilo Segura IT/DB - CERN. VLDB The real world is in the Tb range (British Telecom - 80Tb using Sun+Oracle) Data consolidated from different.
Get the Most out of SQL Server Standard Edition Or How to be a SQL Miser.
Configuring SQL Server for a successful SharePoint Server Deployment Haaron Gonzalez Solution Architect & Consultant Microsoft MVP SharePoint Server
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
Data Platform and Analytics Foundational Training
What is Cloud Computing - How cloud computing help your Business?
Introduction to Teradata
11 Simplex or Multiplex?.
Database System Architectures
Presentation transcript:

Data Warehouse Accelerator Michael Wallace Principal Systems Consultant

2 Why a Data Warehouse Accelerator? Forces related to the new business climate may be fueling an acceleration in the growth of the data warehouse.... Doing more with less is both the mandate and the mantra across the board. More than ever, businesses need dramatic increases in the price/performance of their systems. -Richard Winter, VLDB Expert, Winter Corporation

3 Sound All Too Familiar? Sound All Too Familiar? How do I protect my revenue that is at risk? How can I better leverage detailed (non-aggregate)data? How can I keep more data online without incurring prohibitive storage expense? How do I perform real-time analysis? How do I run ad hoc queries anytime? How do I get queries to return within seconds or minutes, not hours or days? How do I free up my DBAs from constant warehouse tuning? How do I add users and data without causing performance or architectural disruptions? How can I help my business users be more independent and productive? How do I get this project up and running this quarter? How do I demonstrate a quick and solid return on my investment? How do I minimize my risk?

4 Need to get your data back under control?

5 Hundreds of companies have benefited from the from the power, flexibility, scalability and low, low TCO of the IQ Warehouse Accelerator Use all your warehouse data…. a subset of your warehouse data…. Or a new set of data….. Heres how.

6 Typical Data Warehouse Architecture

7 The IQ Accelerator enhances the performance and ROI of any existing data warehouse. Gauranteed!

8 Sybase IQ – What is it? The Database for Analytic Applications The Only RDBMS 100% designed for decision support Its FasterIts More Scalable Its More EconomicIt has huge market momentum

9 Sybase IQ Architecture An IQ Database Is a database with tables The tables have columns Uses indexes to speed retrieval Schema Independent Star Relational Flat Applications Connect to IQ via: ODBC JDBC Open Client An IQ Database Provides: Stored Procedures, Functions, and Batches Views On-Line Backup Concurrent Readers/Writers Transactions User Defined Functions and User Defined data types Crash Recovery and logging Common Language Processor ANSI 92 SQL Transact SQL JAVA

10 Sybase IQ – Whats Different? Data is Stored Vertically Each column is stored separately Bit-Mapped Index Index on every column Optimized Storage Input data is typically compressed Usually = 30-40% Database smaller than input data Even with all the indexes NOT an MPP Solution Much simpler implementation Much simpler management Query Engine Retrieves Only Columns Used in the Query Reduces system I/O dramatically Average 90% Less than competition Permits better data manipulation Schema Design Not Restricted Design based on application use Flat, Star, Relational, Snowflake Any Schema

11 IQ Multiplex Architecture Scalability A single copy of data shared across multiple computer nodes All data and indexes stored in the shared database No partitioning of data required No distributed lock management System does not lock on queries or refresh Individual nodes are Independent of Other Nodes. Each IQ Node has its own local Temp Space and catalog. Individual nodes can be different configurations (CPUs, memory, disk). Data Store (SAN) Fiber Channel Backbone ASIQ Reader ASIQ Writer/Reader

12 IQ Multiplex Architecture Vertical Scalability Data Store (SAN) Fiber Channel Backbone ASIQ Reader ASIQ Writer/Reader ASIQ Reader Individual nodes can be different configurations (CPUs, memory, disk). Each IQ engine runs independently, using all available CPUs on its own node. Additional CPUs scale linearly when added to existing nodes IQ is CPU, not I/O, bound

13 IQ Multiplex Architecture Horizontal Scalability Data Store (SAN) Fiber Channel Backbone ASIQ Reader ASIQ Writer/Reader ASIQ Reader No data redistribution No change in schema Start small and grow HUGE. Load balancing can be used to spread out users. Up to 120 nodes

14 Data Store (SAN) IQ Multiplex Architecture Flexible Scalability Data Store (SAN) Fiber Channel Backbone ASIQ Reader ASIQ Writer/Reader ASIQ Reader Re-use old hardware Grow writer node as needed Increase disk storage without adding nodes Up to 30+ PetaBytes of disk storage SMP-like management & tuning High Availability provided through multiplexing

15 IQ Multiplex Architecture User Scalability Enterprise Workgroup VLDB Web Flexible Scalability = User Scalability

16 Why Sybase IQ? Speed: IQ speeds up queries of your existing warehouse by X Scalability: Scales to thousands to users with virtually no degradation of performance Flexibility: Allows ad hoc queries anytime with no additional tuning required Low risk: Already tested, tuned, configured to insure success (worlds largest data warehouse built and certified on IQ/Sun boasts 48.2TB input data, 22TB final size) Simplicity and elegance: Leading-edge, patented architecture guarantees quick installation, low management and maintenance costs. Economy: Saves approx. $1 million per terabyte of input data

17 Need Proof? Best price-performance in both 300GB and 1000 GB scales Lowest disk to data ratio: 3 to 10 X better than any other system At 1000GB, IQ used 54 disks compared to 1263 and 1408 for competing systems Best storage efficiency by factor of TB raw data = 2.4TB storage in IQ, 22TB in competitor At 300 and 1000 GB scales, IQ is 9 to 25 times less expensive than competitive systems IQ scores big in TPCH benchmarks

18 Need proof? GIGA study illustrates hefty ROI* The organizations interviewed by Giga information group showed actual or expected returns on their investment ….that ranged from 72% to 175%. Simply put, for every dollar invested in the Sun/Sybase RA (IQ running on Sun hardware), $1.63 would be returned to the organization in direct cost savings or increased bottom-line profit as a result of increased business. Giga Information Group projects that a composite organization facing some of the same business and IT pressures will likewise achieve a return on investment greater than most standard IT hurdles, and such an investment will pay back its investment in a period of between 13 and 15 months of use. * The Total Economic Impact of Deploying the Sun-Sybase Enterprise Data Warehouse Reference Architecture, c. 2003

19 Need Proof? If you are used to queries that take 24 or more hours to run, and then you are told that you can run them in a matter of minutes with Sybase IQ, this may be hard to swallow. The truth is that using a column- based approach really can produce such performance improvements.… IT managers should be ready to fall off their chairs. --Bloor Research, 2002 We are able to deliver one data warehouse for all of our applications at one third the storage of conventional technologies, while seeing performance gains as advertised with IQ. -Kim Ross, CIO Nielsen Media Research Conservatively speaking, the CDWs (Compliance Data Warehouse) Return on Investment is expected to be 200 to 1. --Jeffrey Kmonk IRS Sybase IQ reduced loading and indexing from 30 minutes to 2.5 to 3 minutes. Query speeds were 20 – 50 times faster than Oracle. Time to add a column was reduced from 4 hours with Oracle to 15 minutes with IQ. Jeff Butler Department of Transportation Bureau of Transportation Statistics

20 Need Proof? Fortis Bank BeforeAfter ONE Data Mart (Marketing)Many Data Marts 4 days to load and create all mainframe data into the data warehouse 1.5 TB data loaded every month 8 days to run (in batch) all Business objects reports 115,000 ad-hoc queries/month Average response time is 4 hours75% of queries executed within 3 seconds. Raw Data is only 6 Gigabyte450 Gigabyte IQ storage Data warehouse can only be refreshed once a quarter Data warehouse refreshed every day 20 users1,000 users 7 external (PWC) consultants write each report 0 external consultants, users write reports themselves. Full time DBA workDBA work = 10 min. a day This is the strongest product I have come across in my career, something I wouldnt admit that often. There is no doubt whatsoever that another technology would not have offered our users the same service as Sybase IQ. --Jean-Louis Catin, IT

21 Hard to believe? Challenge Us! Nothing to loose – Everything to Gain! Proof of Concept.

Sybase IQ: The Ultimate Data Warehouse Accelerator