PRACE-2IP WP10 - iRODS workshop iRODS CINES Gerard GIL (CINES) – (Linkoping 26-28 September 2012)

Slides:



Advertisements
Similar presentations
Christiane Stock Emmanuelle Rocklin Aurélie Cordier
Advertisements

Texas Digital Library Services Preservation Network.
A centre of expertise in data curation and preservation DigCCur2007 Symposium, Chapel Hill, N.C., April 18-20, 2007 Co-operation for digital preservation.
Digital Preservation A Matter of Trust. Context * As of March 5, 2011.
EUROCRIS2013, Porto, /15 Publishing structural health monitoring data Fábio Costa, Gabriel David, Álvaro Cunha INESC TEC, Faculty of Engineering.
Fedora Users’ Conference Rutgers University May 14, 2005 Researching Fedora's Ability to Serve as a Preservation System for Electronic University Records.
Trusted Digital Archives. Experiences from the Landesarchiv Baden-Württemberg, nestor and DIN Dr. Christian Keitel Johannesburg, 27/2/2013.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
PREMIS in Thought: Data Center for LC Digital Holdings Ardys Kozbial, Arwen Hutt, David Minor February 11, 2008.
Sustainable Preservation Services for Archivists through Distributed Custody Caryn Wojcik State of Michigan Records Management Services.
Chronopolis: Preserving Our Digital Heritage David Minor UC San Diego San Diego Supercomputer Center.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
UMIACS PAWN, LPE, and GRASP data grids Mike Smorul.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
Persistent Digital Archives and Library System (PeDALS) A Guide for Wisconsin State Agencies.
A Digital Preservation Repository for Duke University Libraries Jim Coble Digital Repository Developer Open Repositories 2013.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
ETD Repositories Using DSpace Software Andrew Penman The Robert Gordon University 27 th September 2004.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Trusted Digital Repositories,
Finding a New Way Richard Pearce-Moses Deputy Director for Technology & Information Resources Arizona State Library, Archives and Public Records Using.
San Diego Supercomputer CenterUniversity of California, San Diego Preservation Research Roadmap Reagan W. Moore San Diego Supercomputer Center
Persistent Digital Archives and Library System (PeDALS) SC Department of Archives and History.
Rule-Based Data Management Systems Reagan W. Moore Wayne Schroeder Mike Wan Arcot Rajasekar {moore, schroede, mwan, {moore, schroede, mwan,
Publisher’s Perspective: Digitization of print resources, and archiving of digital resources Judy Best, June 13, 2006.
CCSDS Meeting ’Producer-Archive Interface, 31/03/ 2003 CNES 1 CCSDS - MOIMS Area Data Archive Ingest WG CNES Report Athens meeting - April 2004 Claude.
Richard MarcianoChien-Yi Hou Caryn Wojcik University of University of State of Michigan North Carolina North Carolina Records Management ServicesSALT DCAPE.
IRODS workshop, September , Linköping (Sweden) iRODS Workshop users needs summary Agnès Ansari – Wednesday, 26 September.
HathiTrust’s Past, Present and Future. Short- and Long-term Functional Objectives Short-term Page turner mechanism (and Mobile!) Branding (overall initiative;
MTA SZTAKI Department of Distributed Systems The problems of persistent identifiers in the context of the National Digital Data Archives of Hungary András.
The Canadian Information Network for Research in the Social Sciences and Humanities Tim Au Yeung and Mary Westell Libraries.
Fedora and OAIS Glen Robson OAIS Model.
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
| Ingest Levels and Persistent Identification | October Ingest Levels and Persistent Identification Services for R & D and heritage organisations.
Archival Workshop on Ingest, Identification, and Certification Standards Certification (Best Practices) Checklist Does the archive have a written plan.
Data Management Planning Session Kevin Gomes Michael Meisinger Arcot Rajasekar Michael Wan October 19, 2007.
OAIS: From Requirements to Reality at OCLC FLICC / CENDI Symposium, Dec Pam Kircher Product Manager, Digital Archive OCLC Digital & Preservation.
Selene Dalecky March 20, 2007 FDsys: GPO’s Digital Content System.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Carcanet Case Study Fran Baker, John Rylands University Library University of Manchester SPRUCE event 19 January 2012.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No The pan-European.
Digital Library The networked collections of digital text, documents, images, sounds, scientific data, and software that are the core of today’s Internet.
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
The Project Three-year grant from the National Historical Publications and Records Commission (NHPRC), April 2010-March 2013 Develop electronic records.
ARIADNE is funded by the European Commission's Seventh Framework Programme Archiving and Repositories Holly Wright.
Replicate Research Data Safely eudat.eu/b2safe B2SAFE How to replicate your data using EUDAT’s B2SAFE Version 3 November 2015 This work is.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT- Towards.
National Archives and Records Administration1 Integrated Rules Ordered Data System (“IRODS”) Technology Research: Digital Preservation Technology in a.
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
Core elements of GIS Guidance and practical steps toward harmonisation By Albrecht Wirthmann, GISCO, Eurostat 2 nd.
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
Open Science and Research – Services for Research Data Management © 2014 OKM ATT 2014–2017 initiative Licenced under.
Chang, Wen-Hsi Division Director National Archives Administration, 2011/3/18/16:15-17: TELDAP International Conference.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT Aalto Data.
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
Government Printing Office Future Digital System (FDsys) Special Library Association Open Access and Public Access: New Models for Information Access June.
ENEA GRID & JPNM WEB PORTAL to create a collaborative development environment Dr. Simonetta Pagnutti JPNM – SP4 Meeting Edinburgh – June 3rd, 2013 Italian.
International Planetary Data Alliance Registry Project Update September 16, 2011.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Aalto Data Repository.
General Model of E-ARK Services
Building A Repository for Digital Objects
An Overview of Data-PASS Shared Catalog
An Introduction to Tessella and The Safety Deposit Box Platform
VI-SEEM Data Repository
GSAF Grid Storage Access Framework
Presentation transcript:

PRACE-2IP WP10 - iRODS workshop iRODS CINES Gerard GIL (CINES) – (Linkoping September 2012)

2 2/2/2010 PRACE-2IP WP10 - iRODS workshop (Linkoping September 2012)  CINES presentation  CINES projects using iRODS  Adonis  Archive Replication  ISAAC

3 CINES is supervised and funded by the Ministry of Higher Education and Research. 2/2/2010  CINES is located in Montpellier (South of France)  30 years serving the National Academic Research  CINES provides the french public research community with computing resources and services.  50 persons : (technicians, engineers and administratives) 2 missions : Digital preservation High Performance Computing PRACE-2IP WP10 - iRODS workshop (Linkoping September 2012)

4 Digital preservation  In 2004 the CINES was given the mandate to provide long-term preservation capabilities for digital objects related to scientific and technical information.  Electronic PhD theses  Digitized publications  Multimedia pedagogics  Scientific datasets  International certification process: CINES is one of the four pilot sites (UKDA, DNb, DANS) to test European Certification Framework for long term preservation (supported by European Commission) ISO certification  National agreement : (Dec 2010) CINES has received the agreement by the French Institution of Archives for his Digital Archive Repository (PAC platform) PRACE-2IP WP10 - iRODS workshop (Linkoping September 2012)  Member of EUDAT

5 High Performance Computing  Scalar, vector and parallel processing  French T1 system : JADE  SGI altix ICE 8200  intel core, 267 Tflops  600 TB Lustre Filesystem  Accelators : GPU, CELL, FPGA,...  PRACE member on behalf of GENCI National HPC Center since 1980 PRACE-2IP WP10 - iRODS workshop (Linkoping September 2012)

6 iRODS in CINES projects  ADONIS  Archives Replication  ISAAC PRACE-2IP WP10 - iRODS workshop (Linkoping September 2012)

7 ADONIS TGE (Very Large Infrastructure) TGE is in charge of building a digital infrastructure for: Unified Access to Digital Document and Data produced by « Human and Social Sciences » CRDO (Resource Center for Oral Data) Pilot project selected to validate options for « Oral data » (2010) PRACE-2IP WP10 - iRODS workshop (Linkoping September 2012) ADONIS

8 Archive Platform Transfer Synchronization Formats Conversions Dissemination System ADONIS iRODS iRODS ? replication/ Synchronization deposit Fault tolerence Integrity control Resource management rules High speed data transfers Fault tolerence Integrity control Resource management rules High speed data transfers PRACE-2IP WP10 - iRODS workshop (Linkoping September 2012)

9 ARCHIVES REPLICATION Several solutions studied : Arcsys, iRODS, iSCSI, … iRODS was selected : Open solution Metadata description Resource management rules Fault tolerence PRACE-2IP WP10 - iRODS workshop (Linkoping September 2012) To complete its national agreement for Digital Archive, CINES has to provide a distant copy of the archives it manages. The Archives Replication Project has been defined to reach this goal. Integrity control Authorization management …

other iRODS serverCINES iRODS server CINES Archive system other Archive system 10 ARCHIVES REPLICATION (cross-replication) PRACE-2IP WP10 - iRODS workshop (Linkoping September 2012) CINES ZONE other ZONE replication Other distant storage resource CINES distant storage resource

11 ISAAC Information Scientifique Archivée Au CINES Mid-term preservation of Scientific Data 3 to 4 years preservation/archive;  Objectives :  preserve/archive for 3 to 4 years  Give additional time for the researcher to appraise the relevance / importance of the information  Put in place processes for scientific data valorization / preservation  This goes well beyond a simple storage or backup.  At the end of this 3-4 years period, two options :  Migration onto the long-term preservation platform (PAC)  Restitution to the producer/owner. PRACE-2IP WP10 - iRODS workshop (Linkoping September 2012)

INGEST Document audit Format validation Metadata input Unique persistent indetifier Additional checks Rights management INGEST Document audit Format validation Metadata input Unique persistent indetifier Additional checks Rights management STORAGE Fixity checks Replication Event logging STORAGE Fixity checks Replication Event logging ACCESS Search on metadata Files catalog Rights management File download ACCESS Search on metadata Files catalog Rights management File download TRANSFER DATA PRODUCER DATA USER Producer Authorized user Communities DATA USER Producer Authorized user Communities Preservation context definition and service level agreeement. - Metadata; - File formats; -Knowledge base; -Etc. Preservation context definition and service level agreeement. - Metadata; - File formats; -Knowledge base; -Etc. EXPERTS GROUP THEMATIC COMMITTEE ISAAC PRACE-2IP WP10 - iRODS workshop (Linkoping September 2012) iRODS Distributed data Resource management rules Metadata description Versatile management for Big Data Authorization management Fault tolerence Integrity checks High speed data transfer … 12

STORAGE Storage abstract Layer Ingest API Format validation Metadata management Integrity checks Rights management ………. Ingest API Format validation Metadata management Integrity checks Rights management ………. DATA USER Producer Authorized user Communities DATA USER Producer Authorized user Communities WEB INTERFACE WEB INTERFACE PRACE-2IP WP10 - iRODS workshop (Linkoping September 2012) ISAAC 13

14 ISAAC Information Scientifique Archivée Au CINES  A generic platform with a national/european scope  A prototype is being put in place for the PRECCINSTA datasets Prediction and Control of Combustion Instabilities for industrial gas turbines potentially 2TB of data, (up to 10 TB June 2012) HDF5, Netcdf, XDMF  Developments based on standard, open technologies : iRODS, Java, PostgreSQL, OpenLDAP, PRACE-2IP WP10 - iRODS workshop (Linkoping September 2012)

15 Questions ? Informations :   (Digital Archive Dept. PRACE-2IP WP10 - iRODS workshop (Linkoping September 2012)