22 September 2017, ESA/ESRIN - Frascati

Slides:



Advertisements
Similar presentations
Data Storage Solutions Module 1.2. Data Storage Solutions Upon completion of this module, you will be able to: List the common storage media and solutions.
Advertisements

XenData SX-520 LTO Archive Servers A series of archive servers based on IT standards, designed for the demanding requirements of the media and entertainment.
XenData SX-10 LTO Archive Appliance An Archive Appliance based on IT standards, designed for the demanding requirements of the media and entertainment.
OCLC Digital Archive: Creating Long Term Access to Digital Masters Roberta Gebhardt, Montana Historical Society Research Center Sarah McHugh, Montana State.
Sony Pictures Digital Backbone From the lens to the screen.
DRS 2 one in a series of periodic updates Harvard University Library Andrea Goethals October 21, 2009 DRS = Digital Repository Service.
SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA, SAN DIEGO Disk and Tape Storage Cost Models Richard Moore & David Minor San Diego Supercomputer.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
11© 2011 Hitachi Data Systems. All rights reserved. HITACHI DATA DISCOVERY FOR MICROSOFT® SHAREPOINT ® SOLUTION SCALING YOUR SHAREPOINT ENVIRONMENT PRESENTER.
Chronopolis: Preserving Our Digital Heritage David Minor UC San Diego San Diego Supercomputer Center.
Evolution of Enterprise Services in the Statistics Canada IT Environment Silver Buckler Chief, Managed Storage Section Informatics Technology Services.
Servers Redundant Array of Inexpensive Disks (RAID) –A group of hard disks is called a disk array FIGURE Server with redundant NICs.
Harvard’s Digital Repository Service (DRS) Architecture Harvard University Library (HUL) Andrea Goethals, Randy Stern December 10, 2009.
XenData Digital Archives Simplify your video archive workflow XenData LTO Video Archive Solutions Overview © Copyright 2013 XenData Limited.
Costing. Life 2 Model Research (JISC Project) Application to whole operation of Library Attempts to cost common factors Does not take into account certain.
© 2011 IBM Corporation Smarter Software for a Smarter Planet The Capabilities of IBM Software Borislav Borissov SWG Manager, IBM.
Johannes Spitzbart Phonogrammarchiv, Austrian Academy of Sciences Österreichische Tage der Digitalen Geisteswissenschaften save the data - workshop on.
Meeting the Data Protection Demands of a 24x7 Economy Steve Morihiro VP, Programs & Technology Quantum Storage Solutions Group
1 © 2010 Overland Storage, Inc. © 2012 Overland Storage, Inc. Overland Storage The Storage Conundrum Neil Cogger Pre-Sales Manager.
Access Across Time: How the NAA Preserves Digital Records Andrew Wilson Assistant Director, Preservation.
Agenda 11:00 am (3D Tech Center, Capra 116) Introduction to 3D at Sony and The Digital Backbone Project Chris Cookson President, Sony Pictures Technologies.
GStore: GSI Mass Storage ITEE-Palaver GSI Horst Göringer, Matthias Feyerabend, Sergei Sedykh
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan, Florida Center for Library Automation DCC Workshop on Long-term Curation within Digital Repositories.
ASI-Eumetsat Meeting Matera, 4-5 Feb CNM Context Matera, February 4-5, 20092ASI-Eumetsat Meeting.
Archival Workshop on Ingest, Identification, and Certification Standards Certification (Best Practices) Checklist Does the archive have a written plan.
CLASS Information Management Presented at NOAATECH Conference 2006 Presented by Pat Schafer (CLASS-WV Development Lead)
Persistent Digital Archives and Library System (PeDALS)
OAIS: From Requirements to Reality at OCLC FLICC / CENDI Symposium, Dec Pam Kircher Product Manager, Digital Archive OCLC Digital & Preservation.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Enterprise Solutions Chapter 10 – Enterprise Content Management.
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan Florida Center for Library Automation (FCLA)
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
ESA Report to DAI/IPR WG Gian Maria Pinna DAI/IPR meeting Toulouse 2-5 November 2004.
Fedora and the Preservation of University Electronic Records Project NHPRC Electronic Records Research Grant Kevin L. Glick Manuscripts and Archives, Yale.
Preserving Electronic Mailing Lists as Scholarly Resources: The H-Net Archives Lisa M. Schmidt
DAITSS and the Florida Digital Archive Priscilla Caplan Florida Center for Library Automation iPRES 2006.
Digital Production Pipeline
August 28, 2003APAN, Logistical Networking WS DiDaS Distributed Data Storage Ludek Matyska Masaryk University, Institute of Comp. Sci. and CESNET, z.s.p.o.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
© 2012 IBM Corporation IBM Linear Tape File System (LTFS) Overview and Demo.
EGEE is a project funded by the European Union under contract IST Generic Applications Requirements Roberto Barbera NA4 Generic Applications.
Katherine Skinner, Martin Halbert & Matt Schultz Educopia Institute and MetaArchive Cooperative NDSA Infrastructure Committee
Research and Service Support Resources for EO data exploitation RSS Team, ESRIN, 23/01/2013 Requirements for a Federated Infrastructure.
Compute and Storage For the Farm at Jlab
XenData SX-10 LTO Archive Appliance
Open-E Data Storage Software (DSS V6)
Data Stewardship Interest Group WGISS-43 Meeting
DAITSS: Dark Archive in the Sunshine State
DAITSS and the Florida Digital Archive
Intelligent Archiving for Media & Entertainment
Technology for Long-Term Digital Preservation
Digital Archiving & Preservation : How to compare and contrast
Avid Integration.
Joseph JaJa, Mike Smorul, and Sangchul Song
INTA ESA-ESRIN 1st LTDP+ Workshop Canaries Space Centre
ESA Report to DAI/IPR WG
Ákos Frohner EGEE'08 September 2008
Computing Infrastructure for DAQ, DM and SC
Research Data Archive - technology
Technology for Long Term Digital Preservation Workshop ESA 22/09/2017
XenData SX-550 LTO Archive Servers
The VITO Earth Observation LTDA Facility
Airbus Archive Technologies
Storage Trends: DoITT Enterprise Storage
Operational Dataset Update Functionality Included in the NCAR Research Data Archive Management System Zaihua Ji Doug Schuster Steven Worley Computational.
Sony Pictures Digital Backbone
DATS International Portfolio.
Robin Dale RLG OAIS Functionality Robin Dale RLG
ESA EO Data Preservation System: CBA Infrastructure and its future
Successful Data Curation for Large Data Archives
Presentation transcript:

22 September 2017, ESA/ESRIN - Frascati Technology for Long-Term Digital Preservation - First workshop Gilbert Barrot, ACRI-ST Gaston Briot, adwaïsEO 22 September 2017, ESA/ESRIN - Frascati

One significant example: EODAS ESA project ACRI-ST and its subsidiary adwaisEO are involved in several projects involving data storage One significant example: EODAS ESA project

EO Data Archiving Service (EODAS) Implements the ESA Master Archive through a dedicated service: Contract Kicked-off in August 2016 Two copies of the data held > 200 Km apart Connection to the ESA EO WAN Checks of the data at all stages from ingestion to delivery Scalability required for growth of contents Management of the data information (MetaData) in fully redundant Databases

EODAS Consortium The industrial consortium is made of ACRI-ST (FR), adwäisEO (LU) and KSAT (NO), with the following distribution of roles: ACRI-ST: Data Archiving (and delivery) Service provider – Prime contractor adwäisEO: data archiving (and delivery) operation provider with its leading edge IT and connected secured infrastructure in Luxembourg KSAT: data knowledge provider and overall ingestion validation

EODAS drivers Strong drivers from H/Div: Requirements limited to “Archival and bulk extraction” of non Copernicus data No operational use of the data Data extracted as it was ingested – no transformation / conversion No tight integration to the PDGS of the missions; not for end-user dissemination Added requirements for Data Ingestion / Evolution / Alignment / Management Added requirements for Service Management and Reporting Out of scope: transcription from historical media, consolidation, transformation, conversion

Tape transfer to Backup Archive Data retrieval 8 LTO7 drives 500 TB cache 11 PB capacity (LTO7) Data ingestion Data evolution Data alignment DMS Tape transfer to Backup Archive Data retrieval

NAS HDD tapes ftp access listings Data ingestion Data evolution Data alignment DMS EODAS central archive Tiering solution (Quantum) Disk cache (XCellis) Tape library (i6000 - ANTF) Weekly tapes transfer or real-time FTP transfer (LM) Data retrieval (Quantum) Tiering solution (Quantum) Disk cache (M440) Tape library (i6000 - ANTF) Tape storage (vault) Archive backup

Product status evolution during the ingestion process Data ingestion / archiving process Ingestion of historical + Live missions (Cryosat-2 and SMOS already operational) data Products/files from source media to scratch storage Packaging (AIP) Copy to archive (disk  tape) Transfer to backup center (tape or FTP) Copy in backup archive + DMS update Dire que l’EODAS est une archive bande Product status evolution during the ingestion process aaaaa

Data packaging process (AIP) One single internal package format to simplify the internal process Distributed data = same format as original

Data integrity Data integrity verified after each process: copy/packaging/transfer zip integrity or MD5 checksum

Ingestion chain Ingestion chain is based on ACRI-ST DPMC system Data and Processing Management Core system  DMS + PMS + orchestrator Most of EODAS operations are performed in parallel on several processing nodes (up to 136 simultaneous processing threads)

DELL (servers) + Quantum (storage/library) + Cisco (network) Infrastructure DELL (servers) + Quantum (storage/library) + Cisco (network)

Quantum tapes tower + Synology NAS (from ESA) Infrastructure Quantum tapes tower + Synology NAS (from ESA)

http://www.eodas.info/

On-line access archives Disk or tapes? Volume Full-tape Hybrid (e.g. HSM) On-line access archives Cold archives EODAS Full-disk Performance

Disk vs tapes: the criteria Disk or tapes? Disk vs tapes: the criteria Bulk or random-access ? Volume ? Short or long-term data archive ? Cost ? Versioning ?

Disk or tapes?

Many thanks for listening. Q&A ?