Data & Storage Services CERN IT Department CH-1211 Genève 23 Switzerland www.cern.ch/i t DSS Bit preservation cost outlook: Cost for 10, 20, 30 years archive.

Slides:



Advertisements
Similar presentations
TWO STEP EQUATIONS 1. SOLVE FOR X 2. DO THE ADDITION STEP FIRST
Advertisements

You have been given a mission and a code. Use the code to complete the mission and you will save the world from obliteration…
Advanced Piloting Cruise Plot.
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Working with MS-ACCESS IS 240 – Database Management Lecture #2 – Assoc. Prof. M. E. Kabay, PhD, CISSP Norwich University
Chapter 1 The Study of Body Function Image PowerPoint
1 Copyright © 2013 Elsevier Inc. All rights reserved. Chapter 1 Embedded Computing.
Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 6 Author: Julia Richards and R. Scott Hawley.
Author: Julia Richards and R. Scott Hawley
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
1 Copyright © 2010, Elsevier Inc. All rights Reserved Fig 2.1 Chapter 2.
UNITED NATIONS Shipment Details Report – January 2006.
Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination.
Summary of Convergence Tests for Series and Solved Problems
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Title Subtitle.
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Addition Facts
Year 6 mental test 10 second questions
Data & Storage Services CERN IT Department CH-1211 Genève 23 Switzerland t DSS CASTOR Status Alberto Pace.
Negative Numbers What do you understand by this?.
2010 fotografiert von Jürgen Roßberg © Fr 1 Sa 2 So 3 Mo 4 Di 5 Mi 6 Do 7 Fr 8 Sa 9 So 10 Mo 11 Di 12 Mi 13 Do 14 Fr 15 Sa 16 So 17 Mo 18 Di 19.
ZMQS ZMQS
1 Copyright Copyright 2012.
Solve Multi-step Equations
REVIEW: Arthropod ID. 1. Name the subphylum. 2. Name the subphylum. 3. Name the order.
1 Disks Introduction ***-. 2 Disks: summary / overview / abstract The following gives an introduction to external memory for computers, focusing mainly.
ABC Technology Project
EU Market Situation for Eggs and Poultry Management Committee 21 June 2012.
Green Eggs and Ham.
VOORBLAD.
15. Oktober Oktober Oktober 2012.
1 Breadth First Search s s Undiscovered Discovered Finished Queue: s Top of queue 2 1 Shortest path from s.
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
BIOLOGY AUGUST 2013 OPENING ASSIGNMENTS. AUGUST 7, 2013  Question goes here!
Factor P 16 8(8-5ab) 4(d² + 4) 3rs(2r – s) 15cd(1 + 2cd) 8(4a² + 3b²)
Squares and Square Root WALK. Solve each problem REVIEW:
Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.
1..
CONTROL VISION Set-up. Step 1 Step 2 Step 3 Step 5 Step 4.
© 2012 National Heart Foundation of Australia. Slide 2.
Understanding Generalist Practice, 5e, Kirst-Ashman/Hull
Addition 1’s to 20.
Model and Relationships 6 M 1 M M M M M M M M M M M M M M M M
25 seconds left…...
Subtraction: Adding UP
Equal or Not. Equal or Not
Slippery Slope
H to shape fully developed personality to shape fully developed personality for successful application in life for successful.
Januar MDMDFSSMDMDFSSS
Week 1.
Analyzing Genes and Genomes
We will resume in: 25 Minutes.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
Intracellular Compartments and Transport
A SMALL TRUTH TO MAKE LIFE 100%
PSSA Preparation.
Essential Cell Biology
1 Chapter 13 Nuclear Magnetic Resonance Spectroscopy.
Energy Generation in Mitochondria and Chlorplasts
CpSc 3220 Designing a Database
CERN IT Department CH-1211 Genève 23 Switzerland t The Wigner Data Centre An Extension to the CERN Data Centre.
HEPiX bit-preservation WG update – Spring 2014 Dmitry Ozerov/DESY Germán Cancio/CERN HEPiX Spring 2014, Annecy.
Presentation transcript:

Data & Storage Services CERN IT Department CH-1211 Genève 23 Switzerland t DSS Bit preservation cost outlook: Cost for 10, 20, 30 years archive DPHEP Topical Workshop on "Full Costs of Curation Jan 13/ Germán Cancio Tapes, Archives and Backup CERN IT-DSS

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 2 Overview What is the approximate cost of a data archive over 10, 20 and 30 years? Generic archive (not necessarily at CERN) Start from scratch in terms of HW/media, with some initial data to be added Consider hardware, media, maintenance and electrical power costs 3 base scenarios a) 10 PB initially, 50PB / year b) 10 PB initially, 50PB +15% / year c) 100 PB initially, no further data (stable large archive preservation)

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 3 Assumptions / limitations (1) Archive is tape based with a disk cache front-end –Single copy of data on tape Archived data is not compressible / deduplicable, tapes working at 100% capacity Access patterns: –write w/o high deletion –read of ~30% of archive/year, high latency for non-cached data Model based on 3 year cycles (10 cycles = 30 years) –Corresponding to HW generations and warranty lifetime –After each cycle, all disk cache servers and tape drives are replaced by new generation equipment Tape media is kept for 2 cycles –Enterprise-class equipment (IBM or Oracle, LTO discarded) –All media repacked to higher density on second cycle Disk cache capacity for 10% of the archive –No replication (JBOD or RAID0) –Disk cache used for data influx, reading, repacking Duty cycle of 30% for both disk and tape servers –Relevant for power consumption

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 4 Assumptions / limitations (2) Technology evolution forecast risky for 30 years –Model assumes no architecture paradigm shift (tape/disk) –Forecasts may hold true for 5 years, but longer-term extrapolation is risky –Will cloud storage affect storage capacity/pricing evolution? But, assuming similar storage capacity growth rates as over the last 30 years, archive cost becomes almost insignificant after 20 years Example: TODAY, CERNs 100PB archive requires 11.7K new- generation tapes 8.5TB each) With 11.7K tapes, what were we able to store in the past? –10 years ago (2004): 200GB -> 2.4 PB -> 277 of todays tapes –20 years ago (1994): 20 GB -> 235 TB -> 28 –30 years ago (1984): 200MB -> 2.35 TB -> less than one of todays tapes!!!

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 5 Pricing mostly based on USD prices for a public US contracting alliance –Including educational discount Manpower costs not included –Estimations: 1FTE (engineer) + 0.5FTE (technician) for disk; 2 FTE (engineer) FTE (technician) for tape Software development / licensing costs not included General DC operations / floor space cost not included No assumptions on HW/media resale –Outdated / redundant HW/media is just decommissioned No inflation / interest rates; payments done upfront Assumptions / limitations (3)

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 6 Technology evolution Assuming –+20% yearly disk capacity per constant $ –+30% yearly tape capacity per constant $ (Note: log scale)

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 7 Technology evolution(2) Assuming –+20% yearly disk capacity per constant $ –+30% yearly tape capacity per constant $ 2014: 4TB 2024: 20TB 2034: 106TB2044: 550TB 2014: 8TB 2024: 85TB 2034: 900TB 2044: 9.5PB(!)

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 8 XLS spreadsheet Available on Indico page (link)link 1 tab for global parameters 1 tab for each scenario –Including graphs (scrolling down) Green fields == input data Please try it out and feed back

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 9 Global parameters

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 10 Case A) steady growth Start with 10PB, then +50PB/year (150PB / 3y period)

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 11 Case A) steady growth

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 12 Case A) steady growth Total cost: ~31.6M$ (~1M$ / year)

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 13 Case A) steady growth

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 14 Case B) increasing archive growth Start with 10PB, then +50PB/year, then +50% every 3y (or +15% / year)

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 15 Case B) increasing archive growth

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 16 Total cost: ~59.9M$ (~2M$ / year) Case B) increasing archive growth

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 17 Case B) increasing archive growth

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 18 Case C) stable large archive Start with 100PB, do not add any data

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 19 Case C) stable large archive

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 20 Total cost: ~12.3M$ (400K$ / year) Case C) stable large archive

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 21 Case C) stable large archive

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 22 References INSIC Tape Roadmap, A TCO analysis for Tape and Disk, The Clipper Group, Enterprise Tape for Archival Storage?, The Clipper Group, Year Archive Requirements Survey, Bit Preservation: A Solved Problem?, D. Rosenthal, Stanford University, Oracle - Western States Contracting Alliance price list html html

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 23 Discussion

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 24 Reserve material

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 25

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 26

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 27

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services DSS 28