Presentation is loading. Please wait.

Presentation is loading. Please wait.

Demystifying Deduplication

Similar presentations


Presentation on theme: "Demystifying Deduplication"— Presentation transcript:

1 Demystifying Deduplication

2 Deduplication eliminates redundant copies of data
What is deduplication? Deduplication eliminates redundant copies of data by leveraging pointers to point duplicate files or blocks to a single object APPROACH: Eliminate redundant data Start with the backup environment as the first phase Maintain references to single instances of data across data store Deduplication can decrease disk capacity requirements by up to 98% and decrease bandwidth requirements for data transfer by up to 50 times.

3 Dell’s point of view on deduplication
Data Deduplication is a capacity optimization feature – not a capacity optimization solution Need to understand what problem you are trying to fix Dell can help find the right solution to your storage challenges As deduplication matures it will be ubiquitous across a wide range of storage products Deduplication integrated into software functionality provides the greatest benefits Deduplication technology will expand beyond backup to include static archive data and inactive primary data

4 Deduplication – Confusion abounds
Different Architectures Source Target Single Instance Storage VTL File Block Sub-block Inline Processing Post Processing Different technologies

5 Unique data saved to disk
Types of deduplication A B C D E Data object #1 Data object #2 F Unique data saved to disk Data deduplication eliminates common data at a file, block, or sub-block level. File aka Single Instance Store or SIS Typically primary storage Block aka Sub-file or Fixed block Typically secondary storage Better dedupe ratios Sub-block aka Variable block or Byte-level Best dedupe ratios Most processor intensive Disk Capacity Required

6 Deduplication Enables Cost Effective Disk To Disk Backup
Shorten backup window and restore faster with B2D – At a cost similar to tape Reduce storage capacity, power, cooling and space requirements Centralize data protection and archive, reducing the burden on offices not staffed to manage it Enable cost-effective DR Backup to Disk with Dedupe 2 Primary Disk Deduplication Replication Backup Archive Secondary Disk

7 Why optimize disk-based backup with deduplication?
B2D without deduplication B2D with deduplication 10:1 ratio Total capacity needed after 3 years (TB’s) 34.6 3.46 # drives needed 36 5 Total storage cost ~$34k ~$8k Example – 20TB of data growing 20% per year How long until the deduplicated storage capacity required equals 3 years without dedupe? Just over 15 years

8 How deduplication fits into the backup environment
Application Servers Backup Server Deduplication Appliance JBOD/NAS/SAN OR Deduplication Here here or Deduplication Server-based (Source) & Integrated (Hybrid) Advantages: Common management Ease of use Can be less expensive solution (lower TCO) Reduces network traffic Global deduplication opportunity Appliance-based (Target) Advantages: Ease of implementation Works with variety of backup SW Disadvantages: Can be more expensive solution Replication target restrictions Greater network traffic overhead Often on proprietary hardware


Download ppt "Demystifying Deduplication"

Similar presentations


Ads by Google