Presentation on theme: "EMC VNXe AND DATADOMAIN"— Presentation transcript:
1 EMC VNXe AND DATADOMAIN Capabilities, options and use casesMadis PärnSenior Technology ConsultantEMC
2 EMC VNXe SERIESNote to Presenter: Present to customers and prospects to provide them with a high-level discussion of the EMC VNXe series that delivers simplicity, efficiency, and affordability.
3 Next-Generation Unified Storage Optimized for today’s virtualized ITEMC UnisphereThe EMC VNX family of hardware and software solutions are simple to provision, efficient with lower-capacity requirements, affordable for any budget, and powerful enough to handle the demands of virtual applications.In fact, the family delivers the world’s simplest array to manage and the world's highest midtier performance.The VNX family is designed from the ground up for virtual application environments—from simple, money-saving server and storage consolidation for small business, to the next- generation virtual data center applications.The VNX family is comprised of two series: VNXe series and VNX series.The VNXe series represents the entry point of the VNX family, and is designed specifically for small-to-medium businesses (SMB), remote offices or branch offices (ROBO), or departmental applications where traditional storage administration skills may not be available.The VNX series is the next-generation midrange platform. For those of you familiar with EMC’s current CLARiiON and Celerra platforms, the VNX series combines the capabilities of these systems into a single modular unified storage offering.Although new, the entire VNX family shares the tradition of and builds on EMC’s years of expertise with the world’s most popular SAN and NAS platforms—CLARiiON and Celerra. Everything EMC has learned about high performance and high reliability culminates with the VNX family.EMC Unisphere provides a common unified management capability for EMC’s VNX family and CLARiiON and Celerra products. And as you will see, Unisphere also provides a way to simplify and automate other common storage management tasks, such as replication and backup reporting.VNXe3150VNXe3300VNX5100VNX5300VNX5500VNX5700VNX7500Affordable. Simple. Efficient. Powerful.
4 Simple. Efficient. Affordable. Configure in secondsGet more capacity at less costRest easy, rock solidStarting under $10KA new way to think about shared storageThe efficiency and operational benefits of networked storage are generally understood—easier information sharing, higher capacity utilization, centralized data management and protection, better scaling, etc.—yet many companies still make wide use of laptop shares and application servers doubling as file servers. As a result, data risks, server sprawl, inefficient direct-attached storage, increasingly complicated multi-server backup processes, and limited storage expertise is becoming a challenge. This is especially true for small-to-medium businesses where many of their IT operations want to realize the benefits of server virtualization in combination with storage consolidation to drive even greater economy and efficiency.Note to Presenter: Click now in Slide Show mode for animation.There’s a new way to think about shared storage.What if you could configure storage for your virtual applications in seconds, without being a storage expert?What if your system could automatically double your effective storage efficiency using advanced technologies like deduplication and thin provisioning?What if the system could protect data better by having very high availability in combination with rock-solid RAID and replication?What if you could do all of this within a modest budget?Introducing the VNXe series. Let’s see what it looks like…
5 Storage.Click.Done.Note to Presenter: View in Slide Show mode for animation.The VNXe series has a clean, compact, rock-solid, and high-availability design.EMC took all it has learned about storage and built this new platform for the entry-level storage company. It had to be simple and easy to use. It had to be efficient and not take up more space than needed. And it had be affordable to everyone.It is all there. The VNXe series delivers a great user experience. It is highly efficient and compact, and it starts at less than $9,300.Storage. Click. Done.
6 More Storage. Less Effort. The easiest, most-efficient array everStorage.Click.Done.Next-generation architectureAdvanced multi-core platformNo-single-point-of-failure designMulti-protocol: CIFS, NFS, and iSCSIPowerful 6 Gb/s SAS back endCompact 2U and 3U designStreamlined for ease of useDesigned for the IT generalistApplication-optimized wizardsConfigurable with just a few clicksBuilt-in snapshots and replicationPerfect for small or remote officesNote to Presenter: View in Slide Show mode for animation.It is all about making life easier—more storage with less effort. To deliver on that promise, VNXe series makes use of the latest Intel Xeon 5600 multi-core processors. And the VNXe series’ no single-point-of-failure, yet compact 2U and 3U design, provides needed reliability and data protection.Disk attachment is handled by four lanes of fully redundant 6 Gb/s SAS (serial-attached SCSI) for high performance and bandwidth. Front-end connectivity supports both 1 and 10 Gigabit Ethernet with full unified support for Windows CIFS and UNIX NFS file sharing as well as iSCSI SAN.The VNXe series is streamlined for ease of use. It provides all the benefits of advanced shared storage without the complexity. Common tasks are easily handled using application-specific wizards. Provision and protect new storage for Microsoft Exchange, VMware, Microsoft Hyper-V, file sharing, and volumes—all guided by the system with a few clicks of confirmation.The VNXe series uses advanced storage technology to reduce initial capacity requirements and improve storage efficiency, including Thin Provisioning, and File Deduplication with Compression. This technology reduces average storage capacity requirements by up to 50 percent, saving on your hardware investment and operating expenses. Capacity is added over time as users and applications consume it, rather than reserving it up-front.Finally, the VNXe series has unified replication support. Files and block replication provides disaster recovery for both NAS and SAN environments. VNXe can even replicate to the larger VNX series, which makes it perfect for small or remote offices.
7 Instant ExpertiseBest-practice wizards configure storage with just a few clicksWizardSet up hundreds of Exchange mailboxes in fewer than 10 clicksVMware WizardSet up 1 TB VMware datastore in 10 minutesHyper-V WizardSet up 1 TB Hyper-V datastore in 10 minutesNote to Presenter: View in Slide Show mode for animation.The VNXe series was specifically designed to integrate with your server and application environments. The wizards for provisioning new storage do it in the context of the application, rather than as just generic capacity. There’s no need to be a RAID expert. Simply let the wizard be your guide. Gain instant expertise and let the system do the hard work.For example, the Microsoft Exchange wizard asks you for your Exchange version, how many mailboxes are needed, and what size, and then automatically creates the storage volumes needed for the data and log files.The wizards take the guesswork out provisioning storage and embodies years of EMC experience and best practices. If you are creating volumes using iSCSI, or file shares for CIFS or NFS, the appropriate wizard will create the storage, set up appropriate access, and enable snapshots and even external replication, all in less than 10 clicks. Simply confirm your selections or accept the defaults, and it is all done for you.With the VNXe series, storage is now easier than ever.Share WizardSet up NFS and CIFS shares in minutesVolume WizardSet up iSCSI volumes in minutes
8 Instant Insight with Unisphere Note to Presenter: View in Slide Show mode for animation.The Unisphere user interface for VNXe series is clean and intuitive.The Dashboard view provides a snapshot of system resource use and open alerts, as well as direct access to essential storage and system task options.Sliding the cursor over any of the tabs at the top of the screen activates a pop-up of related activities and information drill-down options.Navigate all common tasks through an easy dashboard
9 Instant Insight with Unisphere Note to Presenter: View in Slide Show mode for animation.The VNXe series not only provides up-to-date information on your storage resources, it continues the application-driven theme by displaying how capacity is used across applications, file and generic iSCSI volumes, and reserves for data protection. In addition to the current system view, Unisphere also tracks and presents storage growth over time, allowing you to track the pace of growth and anticipate future needs based on historical trends.View detailed reports of how resources are being used
10 Instant Insight with Unisphere Note to Presenter: View in Slide Show mode for animation.Monitoring storage system health is a key part of maintaining data availability and performance. System status is always available on the System view. Unisphere displays any system alerts through the Dashboard, and clicking on a hardware-related notification brings up the System view and identifies the specific component affected.The detailed visuals provided in Unisphere make it simple for operators to relate directly to system components. This is especially critical when there are multiple and redundant components present, such as controllers and power supplies, as well as several enclosures of disk drives.Selecting any of the system enclosures expands it to show the drives and controllers within. Select an item such as a individual controller or disk drive to see it highlighted in the diagram.Visually drill down to individual components
11 Instant Insight with Unisphere Note to Presenter: View in Slide Show mode for animation.Finally, using the Support view, you can access a variety of types and sources of information to help get the most from the VNXe series. Unisphere provides a single window into all of these resources, known as the VNXe ecosystem.Online user resources include:How-to videos: Familiarize yourself with a product expansion, enhancement, or self- service task by first watching how it’s done.Online documentation: Find hardware, software, installation, and operation guides.Online training: Expand your knowledge of storage and best practices.Online community access: Search questions and answers from the VNXe community of users and product experts, or post a question of your own.Online support: Log a warranty support call, open a live chat session, or even order a replacement part if needed. Search EMC’s extensive repository of product, compatibility, and application information.Enlist community help when solving problems
12 VNXe3150: New, Entry-Level Model The ideal choice for SMB, ROBO, and FederalImproved density and performanceQuad core processor for improved performance2U DPEs and DAEsChoice of 2.5-inch or 3.5-inch hard drivesSupports 2.5-inch solid state drivesUp to 100 drivesOptional 10G-BaseT I/O moduleEMC has introduced the new VNXe3150 with quad-core performance at no additional cost, and support for Flash dives. Flash delivers high transactional performance for a defined set of data, like VDI boot volumes. In addition, EMC has added 2.5-inch drive support which yield a 50 percent improvement in capacity and performance density.10G-BaseT connectivity is now optional for improved network bandwidth.Note to Presenter: VNXe3150 is targeted for shipment in QQuad-core performance at no additional costReduces OpEx (power/cooling) by 33%50% more disk performance and capacity per rack U
13 VNXe3300: New Scalability and Features Improved density and performanceNew 2.5-inch DPE and DAE300 GB, 600 GB, and 900 GB 2.5-inch 10K SAS100/200 GB 2.5-inch solid state drives (SSD)New 3.5-inch drives900 GB 10K SASUp to 150 drives*Optional 10G-BaseT I/O module*Supports two I/O modules per controller*The VNXe3300 (which already supported Flash) has increased the maximum number of drives from 120 to 150. In addition, EMC has added 2.5-inch drive support to all the VNXe systems. This yields a 50 percent density improvement. With support for the 3 TB NL-SAS drive, users can save up to 33 percent in rack space, cooling, and power requirements.Note to Presenter: VNXe3300 enhancements are targeted for shipment in QReduces OpEx (power/cooling) by 33%50% more disk performance and capacity per rack U* Requires VNXe OE 2.3 SP1
14 VNXe Series Models Simple. Efficient. Affordable. VNXe3150 VNXe3300 Form factor2U3UStorage processors (SPs)1 or 22Backend Disk ports per SP1 x 6 Gb/s x4 SASMaximum drives50 or 100150Drive types3.5” 300 GB/600 GB 15K SAS 1 TB/2 TB/3 TB NL-SAS2.5” – 100, 200 GB SSD 300GB, 600GB, 900GB 10K3.5” GB SSD 300 GB/600 GB 15K SAS 1 TB/2 TB/3 TB NL-SAS2.5” – 100 GB/200 GB SSD 300GB/600GB/900GB 10KProtocolsNFS, CIFS, iSCSIEmbedded I/O ports per SP2 x 1 Gb/s Ethernet4 x 1 Gb/s EthernetConfigurable I/O slots per SP1Optional I/O ports per SP4 x 1GBaseT Ethernet2 x 10GBaseT Ethernet2 x 10G EthernetSystem memory4 or 16 GB24 GBTwo system models—one management environment. The VNXe series provides a choice of hardware platforms and capabilities to meet you specific needs.Common capabilities include:6 Gb/s SAS disk interface delivers the highest bandwidth for high performance and full redundancy for high availabilitySupport for enterprise Flash, 15k rpm high-performance and high-capacity 7,200 rpm near-line disks.Standard 1 Gigabit Ethernet ports for shared iSCSI and NAS connectivityOptional 10 Gig Base-T Ethernet ports for shared iSCSI and NAS connectivityI/O expansion slotsManagement and protocols: Unisphere, CIFS, NFS, iSCSIAdvanced functionality: Thin provisioning, and File Deduplication with CompressionChoose:VNXe3150 for compact, highly integrated solutions with class-leading features and the option of single or dual controllers to achieve the right combination of price, performance, and availability.VNXe3300 for greater storage scalability and performance, and two I/O slots per controller for added expandability.
15 Key Considerations: Comparing Drive Technologies Disk drives are for more than capacity:Drives are needed for performance, as a general rule of thumbFlash drive provides approximately 3500* IOPS per drivePerformance SAS drives provide approximately:180* IOPS per 15,000 RPM drive140* IOPS per 10,000 RPM driveNL-SAS drives provide approximately 90* IOPS per 7,200 RPM driveHigher drive count always has a positive effect on data services (snapshots and replication)The right type of drive matters:Drive performance lends itself to different types of applications.Flash Drives for VDIPerformance SAS drives for Databases and VMDKsNL -SAS drives for file services and archiving
16 Total Data Management and Protection Peace of mind without the complexitySoftware PacksTotal ProtectionPackTotal ValuePackSoftware SuitesVNXe3300VNXe3150 VNXe3100Security and Compliance Suite Version integrity and audit readinessAs you respond to requirements for data protection, application availability, and business continuity and compliance, the VNXe series offers incremental capabilities in the form of system and integration software. These titles can be purchased in suites to meet specific needs, or in value-priced packs to provide even greater value.Security and Compliance Suite: Manage the file retention and tracking process for compliance and file-level anti-virus protection.Local Protection Suite: Snapshots for block and file, integrated with the Unisphere application and virtualization storage wizards. (Note to Presenter: For the VNXe3150 and VNXe3100, this suite is included in the VNXe3100 base software.)Remote Protection Suite: External IP replication for file and block to other VNXe, VNX file/unified, or EMC Celerra platforms.Application Protection Suite: Provides consistent multi-volume replication and integration with messaging, database, and backup applications.Local Protection Suite File and block snapshots*Remote Protection Suite Disaster recovery and business continuityApplication Protection Suite Application-driven data protection* Included in the VNXe3150/3100 base software
17 VNXe Solution Packages Seven VNXe3150-based application solutions with streamlined orderingSimple, pre-packaged configurationsEasy to orderRapid time to productivityENTRY SystemVNXe Entry SystemVNXe TBVNXe3150, Single Processor6 x 300GB SAS DrivesENTRY SolutionsVNXe Capacity SolutionVNXe Application SolutionVNXe Consolidation SolutionVNXe TB Capacity SolutionVNXe3150, Dual Processor6 x 2TB NL SAS DrivesVNXe TB Application Solution6 x 600GB 15K RPM SAS DrivesVNXe TB Consolidation SolutionENHANCED SolutionsVNXe Database SolutionVNXe3150 – 24TB Capacity Solution12 x 2TB NL SAS DrivesVNXe3150 – 7.2TB Application Solution12 x 600GB 15K RPM SAS DrivesVNXe3150 – 7.5TB Database Solution25 x 300GB 10K RPM SAS DrivesEMC is offering six new packages designed to target the most common entry storage deployment environments, including storage consolidation, application deployment and application optimized database solutions. A dedicated, streamlined order interface means that partners can quickly deliver the right VNXe configuration and focus more of their errors on meeting customer needs for integration, deployment, and support.
18 Typical Customer Environment Server-based, direct-attached storageFile serversChallengesIslands of direct-attached storageSeparate storageSeparate managementInefficient use of storage capacityOver-provisioningCan’t share capacityData availability and protection is a major challengeStorage growth is expensive and complex to manageAdding capacity means adding serversEmployee desktopsMany organizations continue to rely on direct-attached or internal server storage as they scale their operations. However, growth exposes the issues with server and storage sprawl, and this approach does not prepare users to adopt and leverage the value of server virtualization.Note to Presenter: Click now in Slide Show mode for animation.Creating islands of direct-attached storage results in capacity that cannot be shared or quickly redeployed. Data is also stranded behind individual servers in case of a local outage.Since each application has its own storage resources, growth of each of these must be monitored and a separate buffer of spare capacity maintained.These organizations also continue to identify backup and data availability as two of their biggest headaches. Distributed backup resources or complex network backup schemes are needed to ensure coverage across distributed application storage.Ultimately, relying on server-based storage can actually contribute to greater server sprawl, as organizations exceed the disk capacity of individual servers and are forced to purchase additional servers to meet storage growth.WebserverSQLServerApplicationserverExchangeServerApplication servers
19 Typical Customer Environment Storage consolidationVNXe series benefitsFile serversConsolidate storage and managementPerformance and capacity poolsCentralize free capacityEliminate file serversReduce average capacity requirementsThin provisioning and file deduplication save up to 50 percentStreamline backup and recoveryAutomate centralized backupApplication-intelligent snapshot managementApplication best practicesEmployee desktopsNote to Presenter: View in Slide Show mode for animation.VNXe series unified storage provides users with the maximum degree of consolidation and protection for their business-critical storage assets.Capacity is easy to deploy, manage, and redeploy. Users can create pools of performance- and capacity-specific storage to meet their application needs, and manage them as shared resources across multiple environments.Note to Presenter: Click now in Slide Show mode for animation.With VNXe series’ unified storage model and centralized management of block and file storage, organizations can also eliminate the use of separate file servers and storage, with the added benefit of eliminating the administrative workload associated with managing and patching these additional systems.VNXe supports multiple advanced technologies for reducing capacity provisioning requirements, including thin provisioning, file-level deduplication, and data compression. These technologies can reduce the average capacity requirements over time by up to 50 percent when compared with server-based storage.With file and block data centralized on the VNXe series, organizations can gain control over data protection and backup, eliminating distributed tape devices and the network load associated with backup operations. In addition, system-based snapshots simplify backup of live applications and provide far more granular restore options.Backup and external replication is also built into the storage provisioning wizards, along with default best practices options, streamlining the task of setting up routine data protection and improving coverage.WebserverSQLServerApplicationserverExchangeServerApplication servers
20 Solution Packages Title Month Year These four bundles are a great way to quickly help customers reduce cost and complexity and can serve as the their foundation for the future.Print this slide to have ready for a discussion with customers and your sales team.SwitchSwitchSwitchSwitch
21 VNXe Series – General Positioning General ConsiderationsLikeliest choiceIT generalist, high ease of use, self service, IP only storage for Small & medium size businessesExtremely price sensitive, high availability is not an issueVNXe3150single SPPrice sensitive, small form factor, balanced performance, lower storage capacity requirementslow price, small form factor & HARobust & demanding applications, OLTP & users, highest performance per “u” of rack space, 100+ drives, >connectivity, 500+ UsersVNXe3300highest performance and scalabilityDesign Point
22 Make Your Storage Easier Simple. Efficient. Affordable.Designed for the IT generalistWizard-driven provisioningStreamlined ease of useIdeal for small and remote officesStarting at less than $9,300In summary, VNXe series unified systems are easy to deploy and use. They’re exceedingly robust, scalable, and solid, yet simple, efficient, and affordable.
23 COMPLIANCE and DISCOVERY IT Pressures20090.8 Zettabytes202035.2 ZettabytesDATA DELUGEBUDGET DILEMMAReaffirming last year’s IDC study is the fact that in 2010 there were 1.2 zetabytes of information. That’s a trillion billion—that’s a massive amount of information to be managed. It is estimated that by 2020, there will be 35 zetabytes of information. Data deluge will grow 44 times this decade. This growth is putting a strain on backup windows, storage costs, and management.IT organizations are not dissatisfied with the amount of money they’re spending on their IT, but rather with the fact that nearly 3 quarters, or 73%, goes to maintaining existing legacy systems, both infrastructure and applications. Customers complain about the complexity of their backup solutions. The feeling is that too much time, expertise and effort is spent keeping the current recovery system(s) afloat. They want the backup process to be less costly, need less supervision and administrative attention.Infrastructure is the foundation of IT so that’s what changes first. We saw in 2009, an important trend happen when virtual servers began being delivered in greater numbers than physical servers , i.e. we have reached a tipping point. Server virtualization changes the environment in both subtle and profound ways. In many shops today backup methods have been barely altered to accommodate this new virtual infrastructure so it’s important to lay the proper data protection foundation early on.And now customer must ensure compliance with regulatory and litigation requirements and can no longer rely on past traditional practice of simply keeping tape backups around for years and years.TransformationINFRASTRUCTURE SHIFTCOMPLIANCE and DISCOVERY
24 EMC DATA DOMAIN OVERVIEW Note to Presenter: Present to customers and prospects to provide them with an overview of EMC Data Domain.
25 Ongoing Evolution Backup Software Backup Storage DR Storage Archive TransformationalTraditionalBackupSoftwareTapeTapeTape/OpticalNote to Presenter: This is a build slide, advance the animation at the designation.So given the pressure on backup it is not surprising that a transition has been underway. First from tape to disk and then disk enabled deduplication backup solutions. Both new deduplication storage systems that work with existing backup software applications and, more increasingly, new end to end solutions that deliver new software and storage for backup and recovery. In fact, 48% of large enterprises have implemented deduplication according to a 2011 storage study by TheInfoPro.Archiving has undergone a significant transformation as well. Early archive solutions were based on tape and optical solutions, providing cost effective long-term storage of data, albeit with limited performance. Archiving played an increasing role in helping organizations reduce backup windows, with customers “archiving before backup” to help them reduce the amount of data to be backed up. Deduplication has effectively eliminated that requirement but archiving remains a valuable tool for addressing compliance and discovery requirements.BackupSoftwareVTLVTL/TapeDisk(Archive Before Backup)Backup SoftwareDeduplication StorageComplianceandDiscoveryDeduplication Backup Software and System
26 EMC Data Domain: Leadership and Innovation A history of industry firsts2003200420052006200720082009201020112012First long-term retention system for backup and archiveFirst deduplication NASFirst deduplication virtual tape libraryLargest deduplicationarrayFastest backupcontrollerAs you can see here, Data Domain systems have a history of leadership and innovation in the deduplication storage category—starting with the first deduplicated NAS storage system back in and spanning to 2012 when EMC introduced the first inline deduplication storage system for compliant archiving.First deduplicationvolume replicationFirst deduplicationdirectory replicationCascaded replicationFirst deduplication nearline storageFirst distributed processingFirst inline deduplication for compliant archiving
27 Deduplication Dramatically Reduces Storage Capacity Requirements 10–30 times less data stored versus fulls + incrementals with typical retention policies1020301515Weeks in UseData StoredDeduplication storageTraditional storageBackup can be an inefficient process that involves repetitively moving mostly the same data again and again. Deduplication dramatically reduces the amount of redundancy in backup storage and is defined as “the process of finding and eliminating duplication within sets of data.” The deduplication process uses well understood concepts such as cryptographic hashes and content-addressed storage. Only unique segments are stored along with metadata needed to reconstitute the original dataset.This chart gives you an indication of why nine out of 10 respondents to TheInfoPro Wave 15 Storage Study already have, or have plans for, deduplicated backup, and shows one angle on how to look at its impact.There are two points that are important to note here:First, the effect grows over time. The more redundant data that is stored, the greater the degree of deduplication effect between the amount stored by the backup software—the light blue area—and the amount of capacity used, which is the dark blue area on the bottom.Second, these numbers are based on a typical backup policy schedule of a full backup on a weekly basis. The amount of data reduction varies primarily on the basis of that policy and how long that data is kept. So the retention policy will guide the degree of deduplication more than any other factor.One thing is clear—the impact is significant.Note to Presenter: Details of the May 2011 release of TheInfoPro Wave 15 Storage Study can be found at this URL: theinfopro-f1000-enterprises-2011-storage-spend-continues-at-a-strong-pace/.TheInfoPro’s “Technology Heat Index” is widely regarded as effective measure of user “demand” for a technology, and from a vendor’s perspective, a good indicator of the relative size of the market opportunity.
28 With Data Domain Deduplication Storage Systems, You Can… Retain longerKeep backups onsite longer with less disk for fast, reliable restores, and eliminate the use of tape for operational recoveryReplicate smarterMove only deduplicated data over existing networks with up to 99% bandwidth efficiency for cost-effective disaster recoveryRecover reliablyContinuous fault detection and self- healing ensure data recoverability to meet service level agreementsNote to Presenter: View in Slide Show mode for animation.Let’s look at what kind of transformational advantages you’ll get from Data Domain.You’ll be able to:Retain backups longer. By reducing data amounts by 10 to 30 times, you can keep backups onsite longer using less disk for fast, reliable restores, and eliminate the use of tape for operational recovery.Replicate smarter. Move only deduplicated data over existing networks for up to 99 percent bandwidth efficiency and cost-effective disaster recovery.Recovery reliably from disk. With continuous fault detection and system self-healing, you can ensure that data is recoverable and easily meet service level agreements.WAN
29 Data Domain Basics Easy integration with existing environment Control TierBackup and Archive ApplicationsEMCSymantecCommVaultIBMHPVeeamQuestTarget TierDisaster Recovery TierCIFS, NFS,NDMP, DD BoostEthernetVirtual TapeLibrary (VTL) overFibre ChannelNow I’ll introduce you to the Data Domain storage system and move from the outside in.This is a picture of what you would see in a Data Domain deployment. A Data Domain appliance is a storage system with shelves of disks and a controller. It’s optimized, first to back up and second to archive applications, and supports most of the industry-leading backup and archiving applications.I’ll talk primarily about backup in this discussion, and get to archiving later in the presentation. The list on the left is composed primarily of leading backup applications—not only EMC’s offerings with EMC NetWorker, but also Symantec, CommVault, and so on…even niche vendors like Veeam for VMware.On the way into the storage system, data can pass through either Ethernet or Fibre Channel. With Ethernet, it can use mass protocols and NFS or CIFS; it can also use optimized protocols or products, such as Data Domain Boost, a custom integration with leading backup applications.After the data is stored and it’s deduplicated during the storage process, it can replicate for disaster recovery. Only the compressed deduplicated unique data segments that have been filtered out through the right process on the target tier are replicated.ReplicationDD890 applianceDD890 appliance
30 Data Deduplication: Technology Overview Store more backups in a smaller footprintFriday Full BackupABCDEFGBackup EstimatedData Logical Reduction PhysicalFRIDAY FULL 1 TB 2–4x 250 GBMon IncrementalABHMonday Incremental 50 GB 7–10x 5 GBTues IncrementalCBIA technology overview of data deduplication will help illustrate how you can store more backups in a smaller footprint with Data Domain.Note to Presenter: Click now in Slide Show mode for animation.On Friday, the backup application initiates the first full backup of 1 TB, but only 250 GB is stored on Data Domain. This occurs because as the data stream is coming into Data Domain, the system is deduplicating before storing data to disk. On average this results in a two- to four-times reduction in data on a first full backup.Over the course of the week, 50 GB daily incremental backups result in a seven- to 10-times reduction and only require 5 GB to be stored. As the graphic on the left shows, during the week incremental backups contain data that was already protected from the first full backup.Finally, on the second Friday, the second full backup contains almost all redundant data. Therefore of the 1 TB backup dataset, only 18 GB needed to be stored.In total over the course of a week, 2.2 TB of data was backed up to Data Domain, but the system only required 288 GB of capacity to protect this dataset. Overall, this resulted in a 7.6- times reduction in one week.Tuesday Incremental 50 GB 7–10x 5 GBWeds IncrementalEGJWednesday Incremental 50 GB 7–10x 5 GBThurs IncrementalACKThursday Incremental 50 GB 7–10x 5 GBSecond Friday Full BackupBCDEFLGHSecond FRIDAY FULL 1 TB 50–60x 18 GBTOTAL 2.2 TB 7.6x 288 GBABCDEFGHIJKL
31 Retain: Store More for Longer with Less Over one year of retention in 3U of Data Domain deduplication storageBackup Cumulative Estimated PhysicalData Logical ReductionFirst Full 1 TB 4x 250 GBWeek 1April TB 8x 288 GBNote to Presenter: View in Slide Show mode for animation.If you extend this scenario out to four months of backups, you’ll see how you could retain more backups longer with less disk by eliminating redundant data from your backup stream and reduce the necessary amount of backup storage. By doing this, you’ll be able change the economics of using disk, eliminating or minimizing the use of tape for operational recovery.This chart shows the dramatic reduction in storage required for backups. Just like the previous slide, the first column is the type of backup data—the first full backup, full backups accumulated after week one, week two, and all the way through to month four in a four-month retention policy.The cumulative logical column is next and shows you how much data has been protected and would be stored without deduplication. Then there’s the estimated reduction from deduplication in the third column, with the last column representing the actual physical storage used with Data Domain.As you can see, at the end of three months, you’ve protected the equivalent of 15.4 TB of backups but only used 706 GB of disk—a 21-times reduction. Or viewed differently, the three- month deduplicated total is 50 percent less than the single week total using non-deduplicated storage. This dramatic impact shows you why so many companies have redesigned their backup around disk-optimized storage.Week 2April TB 10x 326 GBWeek 3April TB 13x 364 GBMonth 1April TB 14x 402 GBMonth 2May TB 19x 554 GBMonth 3June TB 21x 706 GBTOTAL 15.4 TB 21x GB
32 DD160 Appliance For small enterprise data centers and remote offices Up to 4.6-times more capacity than DD140Up to 195 TB logical capacityUp to 3.98 TB usable capacityUp to 2.2-times faster than DD140*Up to 1.1 TB/hr aggregate write throughputSupport for DD VTLSame stream counts as DD140Up to 16 backup write and 4 backup read streamsUp to 15 source and 20 destination replication streamsData Domain Replicator included in system price* Using DD Boost
33 DD160 Appliance Features Single socket, dual-core Xeon processor Two capacity configurations7x 500 GB HDDs: 1.6 TB usable12x 500 GB HDDs: 3.98 TB usableField upgrade from 7 HDD to 12 HDDTwo I/O slots for data access connectivityUp to two dual-port 1 GbE NICs, copper or opticalUp to two quad-port 1 GbE NICs, copperUp to one dual-port 4 Gb Fibre Channel VTL HBASimultaneous usage of NIC and VTL cardsSingle 1 GB NVRAM card
35 DD Boost SoftwareDistributes parts of deduplication process to backup server or application clientsSpeeds backups by up to 50 percentEnables more efficient resource utilizationProvides application control of Data Domain replication processSupports majority of backup software market and native utilities in industry leading databasesEMC Avamar and NetWorkerSymantec NetBackup and Backup ExecEMC Greenplum and Oracle RMANQuest vRangerDDBoostIn the traditional backup world, backup software is backup software, and storage is storage. DD Boost software distributes part of the deduplication process out of the Data Domain system and onto the backup server or application clients. This makes the backup network more efficient, makes Data Domain systems 50 percent faster, and makes the whole aggregate system more manageable. It works across the entire Data Domain product line and supports the majority of the backup market and now it also supports native utilities in industry leading databases.
36 Additional Data Domain Software Options Data Domain Retention LockSecure data retention for file and archive dataSatisfies internal governance and compliance regulationsData Domain ReplicatorNetwork-efficient and encryptedConsolidate up to 270 remote sites into a single systemData Domain Virtual Tape LibraryEasily integrates with Fibre ChannelSupports open systems and IBM i operating environmentsData Domain Extended RetentionLong-term retention of backup dataUp to 65 PB logical capacityIn addition to DD Boost, EMC offers four additional Data Domain software options that can enhance the value of a Data Domain system in your environment.Note to Presenter: Click now in Slide Show mode for animation.The first is DD Retention Lock software enables you to easily implement deduplication with file locking to satisfy IT governance and compliance standards including SEC 17a-4(f) for archive data.Next is DD Replicator software, which provides fast, network-efficient , encrypted replication for disaster recovery, remote office data protection, multi-site tape consolidation, and long-term offsite retention. DD Replicator asynchronously transfers only the compressed, deduplicated data over the WAN, making network-based replication cost-effective, fast, and reliable. In addition, you can replicate up to 270 remote sites into a single Data Domain system for consolidated protection of your distributed enterprise.Next, DD Virtual Tape Library software, which eliminates tape-related failures by enabling all Data Domain systems to emulate multiple tape devices over a Fibre Channel interface. This software option provides easy integration of deduplication storage in open systems and IBM i environments.Next is DD Extended Retention software, which enables long-term retention of backup data on the DD860 or DD990 with up to 65 PB of logical capacity.Finally, DD Encryption software protects backup and archive data stored on Data Domain systems with encryption that is performed inline—before the data is written to disk. Encrypting data at rest satisfies internal governance rules and compliance regulations and protects against theft or loss of a physical system. The combination of inline encryption and deduplication provides the most secure data-at-rest encryption solution available.Data Domain EncryptionInline encryption of data at restProtects against theft or loss of a physical system
37 Industry’s Most Scalable Inline Deduplication Systems Data Domain Software OptionsLarge EnterpriseDD BoostDD EncryptionDD Extended RetentionDD ReplicatorDD Retention LockDD Virtual Tape LibraryMidsize EnterpriseSmall Enter./ROBOHere’s a look at the latest Data Domain product family including the new DD990. The capabilities previously available in a DD Archiver are now only available with the ‘DD Extended Retention software option’ on two platforms – as you can see the capacity supported for the DD860 and DD990 now includes a line dedicated to DD Extended Retention.DD160DD620DD640DD670DD860DD890DD990Speed (DD Boost)1.1 TB/hr2.4 TB/hr3.4 TB/hr5.4 TB/hr9.8 TB/hr14.7 TB/hr31.0 TB/hrSpeed (other)667 GB/hr2.3 TB/hr3.6 TB/hr5.1 TB/hr8.1 TB/hr15.0 TB/hrLogical capacity40–195 TB83–415 TB0.32–1.6 PB0.6–2.7 PB1.4–7.1 PB 5.7–28.5 PB12.9–14.2 PB5.7–28.5 PB 13– 65 PB1Usable capacityUp to 3.98 TBUp to 8.3 TBUp to 32.2 TBUp to 55.9 TBUp to 142 TB Up to 570 TB1Up to 285 TBUp to 570 TB Up to 1.3 PB11 With DD Extended Retention software option
38 Why Data Domain? Less disk to resource, less to manage CPU-centric deduplicationInline deduplicationSimple, mature, and flexibleSimple, mature applianceAny fabric, any software, backup or archive applicationsResilience and disaster recoveryStorage of last resortFast time-to-disaster recovery (DR) readinessCross-site global compressionData center or remote officeWhy Data Domain?To summarize, it starts from economics. There’s less disk to resource and less to manage. The CPU-centric deduplication approach of SISL Scaling Architecture allows the system to be simpler to manage as well as easier to provision and green.In addition, Data Domain is more mature and flexible than most of its competitors. Data Domain has been sold longer, and all the problems that most of EMC’s competitors are just starting to discover have been fixed. It works as advertised, and that alone is highly differentiated in this particular category.Finally, because of their resilience and replication flexibility, Data Domain systems not only work as advertised but work reliably.
40 What is deduplicationData deduplication (often called "intelligent compression") is a method of reducing storage needs by eliminating redundant data.Only one unique instance of the data is actually retained on storage media, such as disk or tape.Redundant data is replaced with a pointer to the unique data copy.Data deduplication can generally operate at the file, block, and even the bit level.
41 Data deduplication: HOW Data Domain OverviewSeptember 2009Data deduplication: HOWGranularity:File level = Single Instance StoreEMC CenteraEMC CelerraSub-File level = Block levelFixed block lengthe.g. Some EMC CompetitorsVariable block length (dynamic)EMC AvamarEMC Data Domain
42 Data de-duplication: WHERE Data Domain OverviewSeptember 2009Data de-duplication: WHERESourceClient software agents identify repeated sub-file data segments at the sourceOnly new, unique segments are transferred across the network and stored to disk during backup operationsBenefits include shorter backup window and lower bandwidth requirementsTargetBackup application sends native data to a target storage deviceData is de-duplicated once it reaches the targetDe-duplication can happen during or after backupTransparency to backup application offers users a “plug and play” experienceBackup de-duplication can occur in two main places – at the data source or the backup target. The right de-duplication technology and strategy will depend on several factors, including the use case, service level requirements, and what’s currently implemented in the environment.With source-based de-duplication, data is de-duplicated as the backup process begins and before the data is sent over the network to be stored. This provides the benefit of shorter backup windows and lowered bandwidth requirements, making it ideal for remote or WAN- based backup, VMware, large file servers, and other environments where the backup process is hampered by network or other resource bottlenecks. Typically, these are environments where traditional backup software is unable to meet business or technical objectives.For target de-duplication offerings the main challenge being addressed is the growth of back-end storage. The backup application sends data to the target storage device and the data is de-duplicated at the device, either immediately or at a scheduled time. It is found in VTLs and LAN backup to disk appliances or platforms and provides the benefit of plug and play with existing backup applications. This is ideal for customers generally satisfied with their backup software and how it performs and are not experiencing bottlenecks in getting the data to the backup storage device.There is a clear need for both offerings to address the range of customer requirements – and only EMC can offer both. More on this later.DE-DUPLICATION AT SOURCEDE-DUPLICATION AT TARGETNetworkNetwork
43 Data de-duplication: WHEN Data Domain OverviewSeptember 2009Data de-duplication: WHENWhile the backup is Running – InlineContent is de-duplicated while backup is happeningOnly possible if de-duplication is fast enoughIdeal for capacity optimizationAfter all or some backup is complete – ScheduledContent is initially stored in original format, then de-duplicated at a later timeDe-duplication can start as soon as a subset of the backup completes, or once the whole backup finishesNeeds careful planning of activitiesNote to Presenter: This slide highlights an EMC Disk Library data de-duplication differentiator – having the choice to de-duplicate immediately or scheduled.For Target De-duplication technologies, there can be different times when the de-duplication can occur – as the backup is happening, or immediate, or after all or some of the backup is complete, or scheduled. Immediate data de-duplication is good for when the backup window isn’t a limiting factor and is good because it requires the least amount of storage capacity.Scheduled de-duplication allows data to be ingested first, then de-duplicated at a later time – either as soon as a subset of the data is backed up or when the whole backup finishes. This method is ideal for situations where backup windows are tight and optimal, fast backup to disk is required.Why is this important to understand? Because most target de-duplication solutions only offer one or the other. Wouldn’t it be great to have both and choices based on the various requirements? Well EMC backup to disk solutions can provide both – and we’ll also get to this soon as well.INLINE DE-DUPLICATIONSCHEDULED DE-DUPLICATIONDeduplicationStoreDeduplication
44 De-Duplication Ratio vs Disk Space Savings Data Domain OverviewSeptember 2009De-Duplication Ratio vs Disk Space SavingsIt doesn’t take a very high de-duplication ratio to save a lot of diskDedup RatioDisk Savings1:10%2:150%3:167%4:175%5:180%6:183%7:186%8:187%9:189%10:190%20:195%50:198%100:199.0%500:199.8%
45 How do I determine if data will dedupe well? Factors that affect dedupe ratios:Dedupes wellDoes NOT Dedupe wellRetention Policy**> 2 Weeks Retention< 2 Weeks RetentionData TypeFile systems, , databasesDatabase logs, Scientific data, video streamsClient side data modificationNo Encryption / No Compression /No MultiplexingEncryption / Compression / Multiplexing** NOTE: Increasing retention policy may enable a higher dedupe ratio