Oracle BIWA SIG Basics Worldwide association of 2000 professionals interested in Oracle Database-centric business intelligence, data warehousing, and analytical.

Slides:



Advertisements
Similar presentations
The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any.
Advertisements

Extreme Performance with Oracle Data Warehousing
Exadata Goals Ideal Oracle Database Platform
Tuning: overview Rewrite SQL (Leccotech)Leccotech Create Index Redefine Main memory structures (SGA in Oracle) Change the Block Size Materialized Views,
CHAPTER OBJECTIVE: NORMALIZATION THE SNOWFLAKE SCHEMA.
<Insert Picture Here>
Arjun Suresh S7, R College of Engineering Trivandrum.
Exadata Distinctives Brown Bag New features for tuning Oracle database applications.
OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
Copyright © 2012, Oracle and/or its affiliates. All rights reserved. 1.
Living with Exadata Presented by: Shaun Dewberry, OS Administrator, RDC Tom de Jongh van Arkel, Database Administrator, RDC Komaran Hansragh, Data Warehouse.
Big Data Working with Terabytes in SQL Server Andrew Novick
Data warehousing with MySQL MySQLMS-SQLOracleDB2 MySQL Flat Files.
Parallel Databases Michael French, Spencer Steele, Jill Rochelle When Parallel Lines Meet by Ken Rudin (BYTE, May 98)
10 REASONS Why it makes a good option for your DB IN-MEMORY DATABASES Presenter #10: Robert Vitolo.
High Performance Analytical Appliance MPP Database Server Platform for high performance Prebuilt appliance with HW & SW included and optimally configured.
1. Aim High with Oracle Real World Performance Andrew Holdsworth Director Real World Performance Group Server Technologies.
A Fast Growing Market. Interesting New Players Lyzasoft.
Oracle Data Warehouse Strategic Update Ray Roccaforte.
Oracle Database Compression with Oracle Database 12c
Presented by Marie-Gisele Assigue Hon Shea Thursday, March 31 st 2011.
Chapter 1 Introduction 1.1A Brief Overview - Parallel Databases and Grid Databases 1.2Parallel Query Processing: Motivations 1.3Parallel Query Processing:
Data Warehousing - 3 ISYS 650. Snowflake Schema one or more dimension tables do not join directly to the fact table but must join through other dimension.
Alternative: Bitmap Indexing Imagine the following query in huge table Find customers living in London, with 2 cars and 3 children occupying a 4 bed house.
Extreme Performance Data Warehousing
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
© 2009 IBM Corporation Statements of IBM future plans and directions are provided for information purposes only. Plans and direction are subject to change.
Fast Track, Microsoft SQL Server 2008 Parallel Data Warehouse and Traditional Data Warehouse Design BI Best Practices and Tuning for Scaling SQL Server.
© 2009 Oracle Corporation. S : Slash Storage Costs with Oracle Automatic Storage Management Ara Vagharshakian ASM Product Manager – Oracle Product.
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 1 Preview of Oracle Database 12 c In-Memory Option Thomas Kyte
MySQL Data Warehousing Survival Guide Marius Moscovici Steffan Mejia
1  MyOnlineITCourses.com 1 MyOnlineITCourses.com Oracle Partitioning -- A Primer.
The Sun Oracle Database Machine Barry Hodges Senior Solution Architect Oracle New Zealand.
Database Services for Physics at CERN with Oracle 10g RAC HEPiX - April 4th 2006, Rome Luca Canali, CERN.
STEALTH Content Store for SharePoint using Caringo CAStor  Boosting your SharePoint to the MAX! "Optimizing your Business behind the scenes"
5 Database Features Every DBA Needs to Know About THT11267 Doug Chamberlain - Principal Product Manger, Oracle Copyright © 2014, Oracle and/or its affiliates.
Physical Database Design & Performance. Optimizing for Query Performance For DBs with high retrieval traffic as compared to maintenance traffic, optimizing.
September 2011Copyright 2011 Teradata Corporation1 Teradata Columnar.
Data Warehousing at Acxiom Paul Montrose Data Warehousing at Acxiom Paul Montrose.
OnLine Analytical Processing (OLAP)
Oracle Advanced Compression – Reduce Storage, Reduce Costs, Increase Performance Session: S Gregg Christman -- Senior Product Manager Vineet Marwah.
1.
1 Data Warehouses BUAD/American University Data Warehouses.
1 Chapter 14 DML Tuning. 2 DML Performance Fundamentals DML Performance is affected by: – Efficiency of WHERE clause – Amount of index maintenance – Referential.
Views Lesson 7.
 The End to the Means › (According to IBM ) › 03.ibm.com/innovation/us/thesmartercity/in dex_flash.html?cmp=blank&cm=v&csr=chap ter_edu&cr=youtube&ct=usbrv111&cn=agus.
Chapter 4 Logical & Physical Database Design
Infrastructure for Data Warehouses. Basics Of Data Access Data Store Machine Memory Buffer Memory Cache Data Store Buffer Bus Structure.
Chapter 5 Index and Clustering
SAM for SQL Workloads Presenter Name.
Session id: Darrell Hilliard Senior Delivery Manager Oracle University Oracle Corporation.
ORACLE & VLDB Nilo Segura IT/DB - CERN. VLDB The real world is in the Tb range (British Telecom - 80Tb using Sun+Oracle) Data consolidated from different.
Copyright © 2009 Rolta International, Inc., All Rights Reserved Michael R. Messina, Management Consultant Rolta-TUSC, Oracle Open World 2009 (60 min) ID#:
Tackling I/O Issues 1 David Race 16 March 2010.
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Database Growth: Problems & Solutions.
OM. Platinum Level Sponsors Gold Level Sponsors Pre Conference Sponsor Venue Sponsor Key Note Sponsor.
Configuring SQL Server for a successful SharePoint Server Deployment Haaron Gonzalez Solution Architect & Consultant Microsoft MVP SharePoint Server
Exadata Distinctives 988 Bobby Durrett US Foods. What is Exadata? Complete Oracle database platform Disk storage system Unique to Exadata – intelligent.
9 Copyright © 2006, Oracle. All rights reserved. Summary Management.
SQL IMPLEMENTATION & ADMINISTRATION Indexing & Views.
Pure Columnar technology
Index An index is a performance-tuning method of allowing faster retrieval of records. An index creates an entry for each value that appears in the indexed.
Hitting the SQL Server “Go Faster” Button
HPE Persistent Memory Microsoft Ignite 2017
Azure SQL Data Warehouse Scaling: Configuration and Guidance
Blazing-Fast Performance:
Fast Track Data Warehouse for SQL SERVER 2012
Hitting the SQL Server “Go Faster” Button
Data Lifecycle Review and Outlook
C Practice Test 2019
Presentation transcript:

Oracle BIWA SIG Basics Worldwide association of 2000 professionals interested in Oracle Database-centric business intelligence, data warehousing, and analytical products, features and options. Membership is FREE Open forum to foster success in use and development of Oracle BIWA products. BIWA’s goals include sharing: “best practices” and “novel and interesting use cases” of Oracle BIWA-centric technology. See Mission Statement and Charter at oraclebiwa.org.oraclebiwa.org

Next Oracle BIWA SIG Conference BIWA Training Days at COLLABORATE 10 – IOUG Forum “Get Analytical with BIWA Training Days” April 18-22, 2010 Mandalay Bay Convention Center, Las Vegas, Nevada REGISTER with Offer Code “BIWA2010” for IOUG Member Discount Rate See oraclebiwa.org for details and links

Top 5 Tips on: Reducing Storage Cost while Improving Performance Jean-Pierre Dijcks Data Warehouse Product Management

Agenda Tip 1: Appropriate Hardware Tip 2: Tier your Storage Tip 3: Partition your Data Tip 4: Compress your Data Tip 5: Think, Plan and Design Q&A

Agenda Tip 5: Think, Plan and Design Tip 1: Appropriate Hardware Tip 2: Tier your Storage Tip 3: Partition your Data Tip 4: Compress your Data Software Forces a Paradigm Shift Q&A

Tip #5: Think, Plan and Design Understand the requirements Data retention rates What to do with the older data What are you doing with the newer data Performance requirements for all data (not just the latest stuff) Plan for the worst (kind of) What is the performance goal and can you still achieve that in 6 months or 2 years What is the data retention rate and can you deal with this at double your data size? Design the system to still work tomorrow

Tip #5: Think, Plan and Design Understand or learn about the trends Hardware Low-price commodity servers High refresh rates of components (CPUs etc.) Ever growing sizes, speeds at ever dropping prices Software More aligned with hardware Push down into storage of data intensive tasks Consolidation and more workloads are thrown at software

Tip #5: Think, Plan and Design An interpretation of the meaning of these trends is: We will see self-provisioning of vast resources by (end) users This will be achieved by a flexible grid of resources being made available More people will get and use more compute power More and more workloads are run on the “same” hardware Integrated software services will provide the value add for these users and make consolidation work… This has major implications for all of us… I think…

Tip #1: Balance your Hardware Driver: Flexibility in Performance Solid State Disks Flash Cards and Disks Solid State Disks Flash Cards and Disks 2TB 10K RPM SATA Disks Other high capacity media 2TB 10K RPM SATA Disks Other high capacity media ILM Upward Downward Memory Etc… Cost Speed Higher Lower

Tip #1: Balance your Hardware Driver: Flexibility in Performance SATA drives Flash Technology Memory SAS drives Off-line Data Archives < 10% of your data < 50% of your data 100% of your data Disclaimer: Illustration purposes only! 60% of your queries 35% of your queries 5% of your queries

Tip #2: Tier your Storage Driver: Cost and Performance Speed and Cost SATA drives Flash Technology Memory SAS drives % of capacity Performance % of capacity Capacity Disclaimer: Illustration purposes only! => cost indicator

Tip #2: Tier your Storage Driver: Cost and Performance Speed and Cost SATA drives Flash Technology Memory SAS drives % of capacity Balanced Perform Disclaimer: Illustration purposes only! => cost indicator % of capacity Balanced Capacity

Tip #1 and Tip #2 Balance and Flexibility Create a grid of compute and storage resources Allow for a hierarchy of storage solutions within the grid Balance the hardware to: Achieve acceptable performance for the majority workload Achieve great performance for mission critical actions Achieve a reasonable price / performance balance Do not size just for performance, nor just for capacity Very diverse workloads + same hardware = need for flexibility

Tip #3: Partition your Data Impact: Performance and Ease of Maintenance Maintenance: Easier to work on smaller chunks of data Allows specification of separate management and performance strategies on a smaller chunk Performance: In maintenance operations (as shown above) By reducing the data volume to scan A potential way of allowing parallel operations to optimize data processing

The Concept of Partitioning Simple yet powerful Large Table Difficult to Manage Partition Divide and Conquer Easier to Manage Improve Performance Composite Partition Higher Performance More flexibility to match business needs SALES JanFeb SALES JanFeb Europe USA

Q : What was the total sales amount for May 20 and May ? Select sum(sales_amount) From SALES Where sales_date between to_date(‘05/20/2009’,’MM/DD/YYYY’) And to_date(‘05/22/2009’,’MM/DD/YYYY’); Sales Table 5/17 5/18 5/19 5/20 5/21 5/22 Only the 2 relevant partitions are read Partition for Performance Partition Pruning

Both tables have the same degree of parallelism and are partitioned the same way on the join column (cust_id) Range partition May 18 th 2008 Sub part 2 Sub part 3 Sub part 4 Sub part 1 Parallel Processing Partition Wise Join Customer Sales Sub part 2 Sub part 3 Sub part 4 Sub part 1 Sub part 2 Sub part 3 Sub part 4 Sub part 1 A large join is divided into multiple smaller joins, each joins a pair of partitions in parallel Select sum (sales_amount) From Sales s, Customer c Where s.cust_id = c.cust_id; Sub part 2 Sub part 3 Sub part 4 Sub part 1

18 Order_dateShip_dateCust _ID Prod _ID Amount 03-SEP SEP , SEP SEP , SEP OCT , SEP OCT , SEP OCT , SEP NOV , Exadata Storage Index Transparent I/O Elimination with No Overhead Exadata Storage Indexes maintains summary information about table data in memory Stores MIN and MAX values of filter columns Typically one index entry for every MB of disk Eliminates disk I/Os if MIN and MAX can never match “where” clause of a query “Negative index” Completely automatic and transparent MIN ship_date = ’01-OCT-2009’ MAX ship_date = ’03-NOV-2009’ Select * from orders where ship_date < ’30-SEP-2009’ Only first set of rows can match MIN ship_date = ’19-SEP-2009’ MAX ship_date = ’07-OCT-2009’

Tip #3: Partition your Data Impact: Performance and Ease of Maintenance This does not mean “Apply Partitioning”, there is more that Oracle can do to allow better performance: Partitioning – of course… To improve scan speeds and maintenance operations Storage Indexes – To improve Scan Speeds Smart Scans – To reduce data moved around Breaking up a large data set delivers both performance and ease of maintenance

Tip #3: Partition your Data Driver: Flexibility in Performance SATA drives SAS drives Off-line Data Archives < 75% of your data 100% of your data Disclaimer: Illustration purposes only! 95% of your queries 5% of your queries Improve scan rates to leverage slower storage tiers = Downward Mobility

Tip #4: Compress your Data Impact: Cost and Performance Compression in Oracle: “Data Warehouse” compression 2 – 3x compression ratio Included in DB license “OLTP” compression 3 – 4x compression ratio Database Option – Advanced Compression Exadata Hybrid Columnar Compression 10 – 50x compression ratio Included in Oracle Exadata license

Tip #4: Compress your Data Hybrid Columnar Compression Data is grouped by column and then compressed Query Mode for data warehousing Optimized for speed 10X compression typical Scans improve proportionally Archival Mode for infrequently accessed data Optimized to reduce space 15X compression is typical Up to 50X for some data

Tip #4: Compress your Data Usage Matrix Apply compression on a per Partition or higher level Change compression over the lifetime of a Partition or Table Both EHCC and DW compression will start to “decompress” when data is updated WorkloadPreferredPossible Bulk Load (write once, read many times) EHCC – QueryDW Operational DW (write, update, delete, update, read) OLTP--- Archive (static data, read once in a while) EHCC – ArchiveDW

EHCC Archive Mode EHCC Query Mode OLTP Compression Tip #4: Compress your Data Applying Compression across Partitions Day 1Day 2Day 8Day 9Day 10 Month 7Month 8

Tip #4: Compress your Data Improving performance and lowering cost SATA drives Flash Technology Memory SAS drives < 30% of your data 100% of your data Disclaimer: Illustration purposes only! 75% of your queries 25% of your queries Move more data onto high performance storage tiers = Upward Mobility

Software forces a Paradigm Shift Applying software changes the balance Adding software into the mix fundamentally changes the way we use and think about storage Software driven partitioning of data changes the cost per scanned TB in relation to total data volume Compression changes the cost per TB stored significantly Compression changes the cost per scanned TB in relation to non- compressed data

Compounding Returns Less Storage, Better Performance 1 TB with compression 10 TB of user data Requires 10 TB of IO 100 GB with partition pruning 20 GB with Storage Indexes 5 GB with Smart Scans Sub-second Response times

Conclusion Software and Hardware is the Solution Building a hierarchy of storage solutions allows you to be: More Flexible Deliver better performance for lower cost Partitioning and compression are technologies that change the hardware status-quo Partitioning allows slower HW deliver better performance Compression allows faster HW to hold more data and delivers better performance With today’s technology you can improve performance while reducing storage cost

A Q &