©2011 Quest Software, Inc. All rights reserved.. Guy Harrison Director, R&D Melbourne Web:http://www.guyharrison.net.

Slides:



Advertisements
Similar presentations
Symantec 2010 Windows 7 Migration EMEA Results. Methodology Applied Research performed survey 1,360 enterprises worldwide SMBs and enterprises Cross-industry.
Advertisements

Symantec 2010 Windows 7 Migration Global Results.
Monitoring and Testing I/O
David Burdett May 11, 2004 Package Binding for WS CDL.
CALENDAR.
The Impact of Soft Resource Allocation on n-tier Application Scalability Qingyang Wang, Simon Malkowski, Yasuhiko Kanemasa, Deepal Jayasinghe, Pengcheng.
© Tally Solutions Pvt. Ltd. All Rights Reserved Shoper 9 License Management December 09.
Extreme Performance with Oracle Data Warehousing
Welcome. © 2008 ADP, Inc. 2 Overview A Look at the Web Site Question and Answer Session Agenda.
Break Time Remaining 10:00.
SE-292 High Performance Computing
Storing Data: Disk Organization and I/O
Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved.
Exadata Embracing Change What is familiar and what is new? The statements and opinions expressed here are my own and do not necessarily represent those.
I/O Management and Disk Scheduling
Solid State Drive. Advantages Reliability in portable environments and no noise No moving parts Faster start up Does not need spin up Extremely low.
This picture has been released into the public domain by its author, ChiemseeMan at the German Wikipedia projectpublic domainChiemseeManGerman Wikipedia.
M AINTAINING L ARGE A ND F AST S TREAMING I NDEXES O N F LASH Aditya Akella, UW-Madison First GENI Measurement Workshop Joint work with Ashok Anand, Steven.
Database Performance Tuning and Query Optimization
13 Copyright © 2005, Oracle. All rights reserved. Monitoring and Improving Performance.
Database Performance Tuning and Query Optimization
PP Test Review Sections 6-1 to 6-6
Removing the I/O Bottleneck with Virident PCIe Solid State Storage Solutions Jan Silverman VP Operations.
Operating Systems Operating Systems - Winter 2012 Chapter 2 - Processes Vrije Universiteit Amsterdam.
Operating Systems Operating Systems - Winter 2010 Chapter 3 – Input/Output Vrije Universiteit Amsterdam.
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
MaK_Full ahead loaded 1 Alarm Page Directory (F11)
Artificial Intelligence
: 3 00.
5 minutes.
Figure 10–1 A 64-cell memory array organized in three different ways.
Converting a Fraction to %
SE-292 High Performance Computing Memory Hierarchy R. Govindarajan
Clock will move after 1 minute
MS SQL Server & Solid State Storage November 2013 Gavin McLaughlin Solutions Development Director X-IO International Cutting through the marketing hype.
Flash storage memory and Design Trade offs for SSD performance
Select a time to count down from the clock above
Copyright Tim Morris/St Stephen's School
1 DIGITAL INTERACTIVE MEDIA Wednesday, October 28, 2009.
Exadata Distinctives Brown Bag New features for tuning Oracle database applications.
Storing Data: Disks and Files: Chapter 9
Query Processing and Optimizing on SSDs Flash Group Qingling Cao
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 11 Database Performance Tuning and Query Optimization.
Introduction to Database Systems 1 The Storage Hierarchy and Magnetic Disks Storage Technology: Topic 1.
Solid State Drive Feb 15. NAND Flash Memory Main storage component of Solid State Drive (SSD) USB Drive, cell phone, touch pad…
SQL Server 2008 & Solid State Drives Jon Reade SQL Server Consultant SQL Server 2008 MCITP, MCTS Co-founder SQLServerClub.com, SSC
 A basic overview  Presented by:  Steve Jones, Gran-IT Consulting, Inc.
Buying a Laptop. 3 Main Components The 3 main components to consider when buying a laptop or computer are Processor – The Bigger the Ghz the faster the.
Operating Systems CMPSC 473 I/O Management (2) December Lecture 24 Instructor: Bhuvan Urgaonkar.
Lecture 11: DMBS Internals
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 10 Database Performance Tuning and Query Optimization.
Physical Database Design & Performance. Optimizing for Query Performance For DBs with high retrieval traffic as compared to maintenance traffic, optimizing.
A Case for Flash Memory SSD in Enterprise Database Applications Authors: Sang-Won Lee, Bongki Moon, Chanik Park, Jae-Myung Kim, Sang-Woo Kim Published.
+ CS 325: CS Hardware and Software Organization and Architecture Memory Organization.
DMBS Internals I. What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the database concurrently.
DMBS Internals I February 24 th, What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the.
DMBS Internals I. What Should a DBMS Do? Store large amounts of data Process queries efficiently Allow multiple users to access the database concurrently.
1 Paolo Bianco Storage Architect Sun Microsystems An overview on Hybrid Storage Technologies.
Exadata Distinctives 988 Bobby Durrett US Foods. What is Exadata? Complete Oracle database platform Disk storage system Unique to Exadata – intelligent.
Indexing strategies and good physical designs for performance tuning Kenneth Ureña /SpanishPASSVC.
CMPE Database Systems Workshop June 16 Class Meeting
Lecture 16: Data Storage Wednesday, November 6, 2006.
Database Management Systems (CS 564)
The (Solid) State Of Drive Technology
File Processing : Storage Media
Lecture 11: DMBS Internals
Lecture 9: Data Storage and IO Models
File Processing : Storage Media
Presentation transcript:

©2011 Quest Software, Inc. All rights reserved.. Guy Harrison Director, R&D Melbourne Web: Making the most of Solid State Disk in Oracle 11g

Introductions

2

3 ©2011 Quest Software, Inc. All rights reserved.. Agenda Brief History of Magnetic Disk Solid State Disk (SSD) technologies SSD internals Oracle DB flash cache architecture Performance comparisons Recommendations and Suggestions

4 ©2011 Quest Software, Inc. All rights reserved.. A brief history of disk

5MB HDD circa 1956

28MB HDD RPM

The more that things change....

8 ©2011 Quest Software, Inc. All rights reserved.. Moores law Transistor density doubles every 18 months Exponential growth is observed in most electronic components: CPU clock speeds RAM Hard Disk Drive storage density But not in mechanical components Service time (Seek latency) – limited by actuator arm speed and disk circumference Throughput (rotational latency) – limited by speed of rotation, circumference and data density

Disk trends

10 ©2011 Quest Software, Inc. All rights reserved.. Solid State Disk

SSD to the rescue?

Power consumption

Economics of SSD $/GB $/IOPS

Tiered storage management Main MemoryDDR SSDFlash SSDFast Disk (SAS, RAID 0+1)Slow Disk (SATA, RAID 5)Tape, Flat Files, Hadoop $/IOP $/GB

15 ©2011 Quest Software, Inc. All rights reserved.. Storage Tiering Storage Tiering For Dummies,® Oracle Special Edition, Wiley 2011

16 ©2011 Quest Software, Inc. All rights reserved.. SSD technology and internals

Flavours of Flash SSD DDR RAM Drive SATA flash drive PCI flash drive SSD storage Server

PCI SSD vs SATA SSD PCI vs SATA SATA was designed for traditional disk drives with high latencies PCI is designed for high speed devices PCI SSD has latency ~ 1/3 rd of SATA

19 Booth 1107

20 ©2011 Quest Software, Inc. All rights reserved.. Flash SSD Technology Cell: One (SLC) or Two (MLC) bits Page: Typically 4K Block: Typically K Storage Hierarchy: Read and first write require single page IO Overwriting a page requires an erase & overwrite of the block Writes: 100,000 erase cycles for SLC before failure 5,000 – 10,000 erase cycles for MLC Write endurance:

21 ©2011 Quest Software, Inc. All rights reserved.. Flash SSD performance

Flash Disk write degradation All Blocks empty: Write time=250 us 25% part full: Write time= ( ¾ * 250 us + 1/4 * 2000 us) = 687 us 75% part full Write time = ( ¼ * 250 us + ¾ * 2000 us ) = 1562 us Empty Partially Full

Valid Data Page Empty Data Page InValid Data Page Free Block Pool Used Block Pool SSD Controller Insert Data Insert

Valid Data Page Empty Data Page Invalid Data Page Free Block Pool Used Block Pool SSD Controller Update Data Update

Valid Data Page Empty Data Page Invalid Data Page Free Block Pool Used Block Pool SSD Controller Garbage Collection

26 ©2011 Quest Software, Inc. All rights reserved..

27 ©2011 Quest Software, Inc. All rights reserved.. 11g DB flash Cache

28 ©2011 Quest Software, Inc. All rights reserved.. Oracle DB flash cache Introduced in 11gR2 for OEL and Solaris only Secondary cache maintained by the DBWR, but only when idle cycles permit Architecture is tolerant of poor flash write performance

Buffer cache and Free buffer waits Database files Buffer cache DBWR Oracle process Free Buffer Waits Write dirty blocks to disk Write to buffer cache Read from disk Read from buffer cache Free buffer waits often occur when reads are much faster than writes....

Flash Cache Database files Buffer cache DBWR Oracle process Write dirty blocks to disk Write to buffer cache Read from disk Read from buffer cache Flash Cache Write clean blocks (time permitting) Read from flash cache DB Flash cache architecture is designed to accelerate buffered reads

31 ©2011 Quest Software, Inc. All rights reserved.. Configuration Create filesystem from flash device Set DB_FLASH_CACHE_FILE and DB_FLASH_CACHE_SIZE. Consider Filesystemio_options=setall

32 ©2011 Quest Software, Inc. All rights reserved.. Flash KEEP pool You can prioritise blocks for important objects using the FLASH_CACHE clause:

33 ©2011 Quest Software, Inc. All rights reserved.. Oracle Db flash cache statistics

Flash Cache Efficiency

Flash cache Contents

36 ©2011 Quest Software, Inc. All rights reserved.. Performance tests

37 ©2011 Quest Software, Inc. All rights reserved.. Test systems Low end system: Dell Optiplex dual-core 4GB RAM 2xSeagate 7500RPM Baracuda SATA HDD Intel X-25E SLC SATA SSD Higher end system: Dell R510 2xquad core, 32 GB RAM 4x300GB 15K RPM,6Gbps Dell SAS HDD 1xFusionIO ioDrive SLC PCI SSD

Performance: indexed reads(X-25)

Performance: Read/Write (X-25)

Random reads – FusionIO

Updates – Fusion IO

Full table scan – FusionIO

43 Sorting – what we expect Memory Sort Single Pass Disk Sort Multi-pass Disk Sort

44 Disk Sorts – temporary tablespace Single Pass Disk Sort Multi-pass Disk Sort

Redo performance – Fusion IO

46 Concurrent redo workload (x10)

47 ©2011 Quest Software, Inc. All rights reserved.. Buffer Cache bottlenecks Flash cache architecture avoids free buffer waits due to flash IO, but write complete waits can still occur on hot blocks. Free buffer waits are still likely against the database files, due to high physical read rates created by the flash cache

48 ©2011 Quest Software, Inc. All rights reserved.. Write degradation In theory, high sustained write IO can lead to SSD degradation when GC fails to cope with the block erase/update cycle In practice, this is rarely noticeable from Oracle: Oracle write IO is largely asynchronous (DBWR) Almost all write activity has at least an equal amount of read activity Garbage collection and wear levelling algorithms are sophisticated in decent SSD drives

49 ©2011 Quest Software, Inc. All rights reserved..

50 ©2011 Quest Software, Inc. All rights reserved..

Fusion IO direct cache 51 Read- intensive, potentially massive tablespaces Temp Tablespace Hot Segments Hot Partitions DB Flash Cache (limited to the size of the SSD) Regular Block Device ioMemory VSL File System/ Raw Devices/ ASM directCache File System/ Raw Devices/ ASM Caching Block Device ioMemory VSL LUN

Fusion IO direct cache – Table scans

53 ©2011 Quest Software, Inc. All rights reserved.. Exadata 53

55 ©2011 Quest Software, Inc. All rights reserved.. Exadata flash storage 4x96GB PCI Flash drives on each storage server Flash can be configured as: Exadata Smart Flash Cache (ESFC) Solid State Disk available to ASM disk groups ESFC is not the same as the DB flash cache: Maintained by cellsrv, not DBWR DOES support full table scans DOES NOT support smart scans Unless CELL_FLASH_CACHE= KEEP, Statistics accessed via the cellcli program Considerations for cache vs. SSD are similar

Exadata: Flash grid disk vs ESFC 100M row table, 200,000 random PK lookups, 1M possible keys

57 ©2011 Quest Software, Inc. All rights reserved.. Summary

58 ©2011 Quest Software, Inc. All rights reserved.. Recommendations Dont wait for SSD to become as cheap as HDD Magnetic HDD will always be cheaper per GB, SSD cheaper per IO Consider a mixed or tiered storage strategy Using DB flash cache, selective SSD tablespaces or partitions Use SSD where your IO bottleneck is greatest and SSD advantage is significant DB flash cache offers an easy way to leverage SSD for OLTP workloads, but has few advantages for OLAP or Data Warehouse

59 ©2011 Quest Software, Inc. All rights reserved.. How to use SSD Database flash cache If your bottleneck is single block (indexed reads) and you are on OEL or Solaris 11GR2 Flash tablespace Optimize read/writes against hot segments or partitions Flash temp tablespace If multi-pass disk sorts or hash joins are your bottleneck FusionIO direct cache If you want to optimize both scans and index reads OR you are not on OEL/Solaris 11GR2 59

60 ©2011 Quest Software, Inc. All rights reserved..

61 ©2011 Quest Software, Inc. All rights reserved..

62 ©2011 Quest Software, Inc. All rights reserved.. References Latest version of this presentation: Quest whitepaper: Guys SSD guide