Presentation on theme: "Gavin Cole Storage Consultant SEE"— Presentation transcript:
1 Gavin Cole Storage Consultant SEE Gavin.firstname.lastname@example.org Archiving and Data ProtectionGavin ColeStorage Consultant SEE
2 Data Protection Techniques Data Archiving Techniques Tools AgendaIntroductionData Protection TechniquesData Archiving TechniquesToolsNo notes
3 Data Protection Techniques Data Archiving Techniques Tools AgendaIntroductionData Protection TechniquesData Archiving TechniquesToolsNo notes
4 Difference between Backup & Archive Backups are created to recover from errorsCost effective data protectionOff-site storageBackup is important – Restore is CriticalArchiving for Information Lifecycle ManagementStoring data on different media based on policy“Right Data, Right Place, Right Price”
5 Data Protection Techniques Disk Mirroring / ReplicationDisk to Disk BackupDisk to Disk to Tape BackupVirtual Tape LibrariesContinuous Data ProtectionDisk to Tape
6 Optimizing The Life Cycle of Data R&DData CriticalityFinancialMarketingTime
8 Why tape? $5000 Solid State Disk ($2000-$5000) SCSI, FC All prices are ASP per gigabyte for working subsystems.The price per gigabyte decreases as the ratio of cartridges to drives increases, diverging from disk costs.Automated tape prices include drives, media and library and use a 2-to-1 compression factor.Tape cartridge capacity growing faster than disk drive capacity.Tape pricing per gigabyte now diverging from disk pricing.$0$15$30$45$60$5000Average price per GBAutomated Tape ($.50-$3.00)Economy Disk ($3-$15) JBOD, MAID, SATAOptical Disk Blue Laser ($15-$35) SCSIEnterprise Disk ($40-$70) FICON, ESCON, SCSI, FCMidrange Disk ($15-$35) SCSI, FCSolid State Disk ($2000-$5000) SCSI, FCStorage Device CategoryValue advantage of tape over disk expected to hold for next yearsValue of tape increases as carts/drive ratio increasesVirtual tape will enable:Easy migrationReduce number of RTD and mediaImproved performance (random access and time to first byte)Open systems virtual solutions in developmentSource: Horison Information Strategies
9 Data Protection Techniques Data Archiving Techniques Tools AgendaIntroductionData Protection TechniquesData Archiving TechniquesToolsNo notes
10 Traditional Tape Based Backup Primary storage to archival storageDirect relationship with tape devicesAdditional device sharing softwareTape automationBackup serversApplication serversData moves directly from primary storage to tape and is managed by the backup application
11 Data protection appliance Disk to Disk BackupAdd secondary disk storagePart of existing disk subsystem or as an external applianceTape automationBackup serversApplication serversData protection applianceSnapshot/mirrorsContinuous data protection (CDP)Disk basedsnapshot/mirroringData moves from primary storage to temporary disk buffer. Backups usually take place from the temporary location to tape.
12 Disk to Disk to Tape Backup Add disk to existing backup serversNo additional hardware or softwareBackup serversTape automationApplication serversData moves from primary storage to the temporary disk buffer and then on to tape. Data movement is managed by the backup application. Recoveries will normally come from the disk buffer(s).
13 Disk to Disk with Virtual Tape Add virtual tape services with a storage applianceTape automationBackup serversApplication serversVTLvirtual tape libraryData moves from primary storage to the VTL disk buffer under control of the backup application. Recoveries normally take place from the VTL buffer. If a real tape cartridge is required then the backup application is responsible for creating it.
14 Continuous Data Protection A disk-based data protection system that continuously echoes changes as they happenACDP EngineBIP SubnetABCABCABCABCABCABCCC
15 Data Protection Techniques Data Archiving Techniques Tools AgendaIntroductionData Protection TechniquesData Archiving TechniquesToolsNo notes
16 Data Access Requirements are Always Changing Inferred by business use The reaility is the information lifecycle is dynamic. Data may need to be recalled at various times in its lifecycle, in a very unpredictable manner. Examples are when a mortgage is refiananced or when a patient gets sick, records would need to be recalled, and would need to be accessed quite often for a period of time. It may not be effective to have this data stored on secondary or archived storage solutions..The problem is that most ILM strategies move data in one direction only, from primary to secondary or tertiary storage. In a traditional ILM strategy, when data moves to tape, it stays on tape, even if the data goes active again. When users recall a file, they are either accessing it on tape and getting less-than-optimal performance, or they make a new “active” copy of the file, contributing to data sprawl.Sources: Horison Information Strategies, StorageTek1616
17 New Data “Lifecycle” Journey Challenge is Inclusive Mgt New Data “Lifecycle” Journey Challenge is Inclusive Mgt., Access & Cost of 'new' Archive DataData ConsumersNEWACCESSSecurity&ID Mgt. EnvelopeEternalDataCreationOffsite Tape VaultData ArchiveEphemeralDataDisposalData In UseBackupRetention@DateData Management
18 Archive Requirements Are Changing The dynamic nature of data is also changing the nature of archive...in the past archive meant data was protected for the purpose of recovery- data was saved to tape, which was later moved to an offsite location- and only recalled when there was some type of failure.Because of the high costs of storing data- more and more organizations are utilizing archive solutions- disk or tape for storing “persistent” data- that which is not accessed often but cannot be moved offsite for long term archive.Compliance regulations which require that particular data be accessed ithin a set period of time also drive the need for intelligent” archive devices that can quickly search for and retrieve specific data sets.
19 Data Protection Techniques Data Archiving Techniques Tools (UK Edition)Most Valuable Product Award 2004StorageTek L700AgendaProduct of the YearEnterprise Server2005StreamLine™ SL500IntroductionData Protection TechniquesData Archiving TechniquesTools(UK Edition)Backup Hardware Product of the YearStreamLine™ SL500No notesStorage Product of the Year 2005StorageTek L1400MVP Award 2005Backup HardwareStreamLine™ SL500
20 Tools of the TradeSATA disk arraysVirtual Tape librariesNearline Robotic LibrariesTape DrivesSoftware
23 Tape Drive Technology Choices Match the Technology to the NeedsAcces sCapacit yEnterprise ServerT9840CT9840BT9840AT10000T9940BT9940AHigh-end ServerDepartment ServersPortfolio map of automationThe Family spanning size and operating environmentsDLT-S4SDLT600SDLT320SDLT220LTO-4LTO-3LTO-2LTO-1DesktopCurrent model
24 Tape Drive Characteristics Enterprise classEnterprise classEnterprise classAll figures are ‘native’Average access time includes 4 seconds robotic time, load and average search
25 Let’s see the difference for archive! Components of recall timeMount tape (by robot)Load tapeFind blockTransfer dataRewindUnloadDismount tape (by robot)It doesn’t help if thiscomponent is fast when therest are slow.
26 Choose the Right Tape Technology CapacityImportant for backupVery important for most archive requirementsNot so important for HSM functionalityThroughputImportant for backup and archiveBackup must write several stripes because of the speedLoadtimeImportant for HSM functionalityReliabilty
28 SL8500SL8500 has been adopted at more than twice the rate of other new StorageTek large librariesBroad acceptance in financial, government and broadcasting segments39% shipped are to customers who have never purchased Sun/StorageTek enterprise libraries before
29 StreamLine SL8500 architecture SL8500 can be share across heterogeneous applications and serversACSLS software supportAllow multiple tape drives and media typesSL8500 designed for consolidation24x7 designMultiple and simultaneous mounts/dismountsScalability in capacity and tape drives: from to slots cartridges, from 1 to 448 tape drivesMultiple media typesHigh densityHigh performance
31 Library base size and expansion frames ~1,500 cartridges~3,000/5,000 cartridges~10,000 cartridgesCustomer Requirement: Flexibility and Scalability
32 24x7 availability Redundant powers All critical internal components are redundantRedundant robots“Live” library upgradesmicrocodeAdd and replacement of tape drivesAdd and replacement of robotsAdd and replacement of power and electronicsAdd new libraryAdd cartridges slots
33 THE L1400 TAPE LIBRARY EMPOWERS CONSOLIDATION Share library resourcesGrow with easeCustomize your configurationsUse multiple technologiesIncrease availabilityWork within flat or shrinking budgets
34 KEY L1400 LIBRARY FEATURES Continuous CapacityTM technology Superior scalability - grow easily when you need toPartitioningShare the L1400 tape libraries without additional componentsHigh PerformanceReduce backup times with high robotic and throughput performanceAny Cartridge Any SlotTM technologyConfiguration flexibility and investment protectionSun ServicesExperience and coverage to support your consolidated data
35 The ability to use four different tape formats within a single library The Judges Reach Their Decision - Sun StorageTek L1400M Product of the YearAs one of our judges put it, it may be "somewhat contrarian" to give the gold to a tape library in 2006, but Sun Microsystems Inc.'s L1400M solves a top problem in tape:The ability to use four different tape formats within a single library“It's a solution that's been a long time coming.”
36 Over 3000 Installed Worldwide... Sun StorageTek SL500 Product of the Year Enterprise Server 2005StreamLine™ SL500 modular library systemEditor’s Choice AwardStreamLine™ SL500Main points:Awards build credibility.ScriptStorageTek and Sun combined have over 6200 patents and a long history of innovation in data management. Our automated tape solution portfolio has been winning awards for years. These are a few recent awards.(UK Edition)25 Best Plants 2004Puerto Rico Manufacturing OperationsMVP Award Backup HardwareStreamLine™ SL500Most Valuable Product Award 2004StorageTek L700e Tape Library
37 THE SL500 MODULAR ARCHITECTURE SET A NEW INDUSTRY STANDARD Modular rack-mount architectureModular growth is simple to manageScalability without sacrificing reliabilityReliability without sacrificing scalabilityCost effective TCOImproves TCO by 25-80% over alternative architecturesStorageTek brought a new architecture to market, one that met ALL of the major customer needs in ONE box.It is –ScalableReliableSimple to ownAffordable – customer buys what they need, when they need it. No more, no less.37
38 SL500 Modular LibraryProtect your growing data without buying more and more libraries..30 or 50 LTO slots 2 tape drive bays 5 CAP slotsBase Module
39 SL500 Modular LibraryProtect your growing data without buying more and more libraries..30 SlotsAdd up to 4 modulesas & when you need it1Drive Expansion (8U)290 LTO slots 4 tape drive bays 10 CAP slotsScalable growth34Cartridge Expansion (8U)120 LTO slots 0 tape drive bays 10 CAP slots575 Slots
40 Sun StorageTek SL24 Tape Autoloader The Sun StorageTek SL24 Autoloader provides high capacity automated backup and recovery in a space-efficient, highly manageable productFeatures:2U form factor (24 slots)Initially supports 1 LTO 3 SCSI or FC interfaces and up to 2 LTO 2 HH Drives with SCSI interfaceTwo removable 12-slot magazinesWeb-based remote management capabilityIntegrated barcode readerRack ready or table top configurations
41 Sun StorageTek SL48 Tape Library The Sun StorageTek SL48 is an entry level library for automated backup and recoveryFeatures:4U form factor (48 slots)Initial support : Up to two LTO3 SCSI or FC tape drives or up to four half height LTO2 SCSI tape drivesFour removable 12-slot magazinesWeb-based GUITool-free drive upgradesBar code readerRedundant powerRack ready, or table top configurations
42 Choose the Right Tape Library Robotic speedUnimportant for backupVery important for most archive requirementsMixed mediaSupport for different drive typesDifferent drives for different requirementsSize/ScalabilityInitially just a sizing exerciseGrowth is good – plan for it!
44 ‘Infinite Disk’ Technology SAM-FS creates a virtualized or “infinite disk” that extends primary or secondary storage onto any number of additional media and devices.Virtualization optimizes storage capacity – automatically migrating inactive data to near-line secondary media – while also giving users immediate access to ALL of their data, regardless of where it is stored.Free SpaceActive Data (Frequently-Used Files)Pointers to Inactive Data on Secondary Media
45 SAM-FS Function: Archive Populate all tiers of storage based on data policyFile ServersUsersOn-line Secondary StorageDocumentsData copied to media by policyAutomatically populate all desired tiers of storageDocumentsSpreadsheetsPresentationsImagesPrintstreamsDatabasesVideo/AudioMessagingSpreadsheetsPresentationsImagesPrintstreamsDatabasesVideo/AudioMessagingLAN/SAN ClientsGenerate up to 4 copies concurrentlyData written in TAR formatOptimized writes based on file sizeShell command for immediate responseThe ARCHIVE function interrogates the file system for new data or for data that has been updated. There is a data policy that is defined during setup that governs how often this function runs, what tiers of storage it is to use, and how much storage resource is available for the archive function (ie 2 tape drives, 3 tape drives, etc.).Data policies governing the behavior of the ARCHIVE function can be established by application, by file system, by directory, even by file size and file type.The ARCHIVE function is transparent to the application being managed. Up to 4 tiers of storage can be populated with newly created or updated data. The data is written in TAR format which is important for worse-case scenarios where the application environment has to be rebuilt with nothing more than the data itself being available.This approach to data management enables users of ASM to automate the data protection and business continuance function.The ARCHIVE function can be set up to run on a specific time interval (ie once an hour), or at a specific time (ie 6pm each evening), or based on a specific application event (ie end-of-day process). There is a shell command for manual execution as well.Off-Site Storage
46 SAM-FS Function: RELEASE Automate primary disk space management File ServersUsersOn-line Secondary StorageDocumentsData copied to media by policyDocumentsSpreadsheetsSpreadsheetsPresentationsPresentationsImagesImagesPrintstreamsPrintstreamsDatabasesAutomatically populate all desired tiers of storageDatabasesVideo/AudioVideo/AudioMessagingMessagingLAN/SAN ClientsRelease only after valid copy existsRelease directly after archivingRelease at WatermarksNever Release – data stays on diskShell command for immediate responseThe RELEASE function like the ARCHIVE function, has a data policy that is established during product set up. It’s function is to automate the space management function for primary disk. These data policies can be changed on the fly. The customer can even have one set of data policies set to run during normal business hours and have a completely different set of policies, usually to reallocate shared resources, for off-hour operations.Freeing up primary disk space can be accomplished as copies of the primary data are created, or at specific watermarks. The policies for the RELEASE function can even be set up to “Never Release” certain critical data that is under ASM’s control, populated to secondary storage, but must remain on disk at all times.As mentioned for the ACHIVE function, the ASM administrator can manually run the RELEASE function from any ASM master console.Off-Site Storage
47 SAM-FS Functions: STAGE Managing retrievals from secondary storage File ServersUsersOn-line Secondary StorageDocumentsSpreadsheetsPresentationsImagesPrintstreamsDatabasesVideo/AudioMessagingData copied to media by policyDocumentsSpreadsheetsPresentationsImagesPrintstreamsDatabasesVideo/AudioMessagingAutomatically populate all desired tiers of storageLAN/SAN ClientsAssociative StagingFiles can be staged in groupsRead-behind Stage- Data immediately available while file is being retrievedNever Stage- Direct read on retrieval request from secondary storagePolicies for the STAGE function control how data is to be managed when being retrieved from secondary storage.I can have a data policy set for seldom retrieved data, to mount a tape for example, read the requested file, present it directly back to the requestor without that data ever being staged back to primary disk.At the same time, I can have a completely different application that may requests data that has been archived to secondary storage and released from primary disk, that once requested, may have many requests for that same data for a short period of time, to be staged back to primary disk and retained there for the time frame of usage (ie a few days, a week, a month, etc.)During the STAGE function, there is a feature called Read-Behind-Stage, that allows for portions of a large file to be immediately available as each logical container is written back to disk. This enables much improved response time from an end-user perspective. Without this ASM feature, the user would have to wait for the entire file to be restored to disk before any part of it could be accessible.Off-Site Storage
48 THANK YOU!Sun, Sun Microsystems, StorageTek, the Sun and StorageTek logos, StreamLine, Virtual Storage Manager, VSM, T-Series, Sun StorageTek, are trademarks or registered trademarks of Sun Microsystems, Inc. in the United States and other countries.48