Rosetta at ETH Zurich – routes into the digital archive

Slides:



Advertisements
Similar presentations
Current design issues for digital archives Robert Munro (presented by David Nathan) Endangered Languages Archive (ELAR), School of Oriental and African.
Advertisements

Mirror Mirror on the wall does your repository reflect it all? Peter West and Timothy Miles-Board EPrints Services University of Southampton Southampton,
Implementation of a Validated Statistical Computing Environment Presented by Jeff Schumack, Associate Director – Drug Development Information September.
28 April 2004Second Nordic Conference on Scholarly Communication 1 Citation Analysis for the Free, Online Literature Tim Brody Intelligence, Agents, Multimedia.
Engaging repository policy with preservation Steve Hitchcock and Neil Jefferies* Preserv 2 Project School of Electronics and Computer Science (ECS), Southampton.
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
National Diet Library Digital Archive Portal - PORTA - Gateway to digital information in Japan April 3, 2008 Hideki Takeuchi Planning.
Cultural Heritage in REGional NETworks REGNET Project Meeting Content Group
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
DRIVER Long Term Preservation for Enhanced Publications in the DRIVER Infrastructure 1 WePreserve Workshop, October 2008 Dale Peters, Scientific Technical.
Kristīne Pabērza Ministry of Culture State Agency Culture Information Systems Latvia Member States' Expert Group on Digitization and Digital Preservation.
Open Access Niamh Brennan Trinity College Dublin DRIVER Summit, Goettingen, January 17th 2008 Local Integration, National Federation TCD-RSS, TARA, IReL-Open,
Michigan Electronic Grants System Plus
Open Scholarship 2006 Bielefeld Academic Search Engine a Scientific Search Service for Institutional Repositories Open Scholarship 2006 New Challenges.
A Future for UK theses, University of London, Senate House, 22-Jan-2004 E-thesis submission workflow issues Simon J. Bevan Information Systems Manager.
Opening access and closing the risk: delivering the mandate for e-theses deposit 10 th International Symposium on Electronic Theses and Dissertations Uppsala.
Metadata for preservation: the Cedars perspective
Pure Silver Reusing and Repurposing Bibliographic Data in a Current Research Information System and Institutional Repository 15 September.
A centre of expertise in data curation and preservation DigCCur2007 Symposium, Chapel Hill, N.C., April 18-20, 2007 Co-operation for digital preservation.
Seite 1 Heinz Nixdorf Zentrum für Informationsmanagement in der Max-Planck-Gesellschaft 17 Oct 2002 Gerhard Beier MPG eDocument Server Building an institutional.
Digital Preservation A Matter of Trust. Context * As of March 5, 2011.
Preserving and Sharing Digital Data Greg Colati, Director, Archives and Special Collections May 11, 2012.
Niagara Portal Introduction January 2007 Scott Muench - Technical Sales Manager.
HE in FE: The Higher Education Academy and its Subject Centres Ian Lindsay Academic Advisor HE in FE.
Software change management
OOAD – Dr. A. Alghamdi Mastering Object-Oriented Analysis and Design with UML Module 3: Requirements Overview Module 3 - Requirements Overview.
Collections and services in the information environment JISC Collection/Service Description Workshop, London, 11 July 2002 Pete Johnston UKOLN, University.
12 June 2014 Library & IT Services 1 Renovating the Library: Creating Learning Spaces and Moving to E-Only Hans Geleijnse Library Strategy Consultant Tilburg.
Building repositories Iryna Kuchma, eIFL Open Access program manager, eIFL.net Presented at “Open Access: Maximising Research Impact” workshop, May 25.
Digital Futures International Forum - Tuesday 18th September 1 Digital Futures International Forum The Digitisation Standard: Back & Forth Stephen Clarke.
Distributed search for complex heterogeneous media Werner Bailer, José-Manuel López-Cobo, Guillermo Álvaro, Georg Thallinger Search Computing Workshop.
1. 2 Captaris Workflow Microsoft SharePoint User Group 16 May 2006.
A Virtual Research Environment for the Study of Documents and Manuscripts 1 1 Research administration Resource discovery Data creation, use and analysis.
Klaus Kempf Long Term Preservation: Needs and Activities at the Bavarian State Library (BSB) „ALA 2010 Washington“
The future’s so bright…. DAITSS DIGITAL PRESERVATION SYSTEM: RE-ARCHITECTED, RE- WRITTEN, AND OPEN SOURCE Priscilla Caplan Florida Center for Library Automation.
How the University Library can help you with your term paper
1 Distributed Agents for User-Friendly Access of Digital Libraries DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen,
1 What is the Internet Archive We are a Digital Library Mission Statement: Universal access to human knowledge Founded in 1996 by Brewster Kahle in San.
Fedora Users’ Conference Rutgers University May 14, 2005 Researching Fedora's Ability to Serve as a Preservation System for Electronic University Records.
PubMed Central Mahyar Ahmadpour-B. Kowsar Publicatin Corp. Kowsar Editorial Meeting 1 September 19th, 2013 Tehran, Iran.
OPEN RESEARCH DATA, EPFL, 28 October 2014, M. Töwe, M. Bärlocher docuteam packer: viewer and editor for file structures and metadata.
Building of the Digital library of Brno University of Technology Barbara Šímová /
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
Fedora 3.0 and METS: A Partnership for the Organization, Presentation and Preservation of Digital Objects Open Repositories Georgia Tech, Atlanta,
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
Bibliography in the Digital Age - IFLA Satellite Meeting Warsaw, 9 August Online materials published in Austria collecting, archiving and metadata.
Open Archives for Library and Information Science: an international experience Antonella de Robbio and Paula Sequeiros IV EBIB Conference: Open Access.
Archiving and Presenting Journals with Rosetta Matthias Groß, Bavarian State Library, Munich, Germany 10th IGeLU Conference, Budapest, September 2 nd 2015.
Libra: Thesis and Dissertation Submission. What is Libra? UVA’s institutional repository, providing online archiving and access for the scholarly output.
Electronic Theses at Rhodes University presented by Irene Vermaak Rhodes University Library National ETD Project CHELSA Stakeholder Workshop 5 November.
The DiVA System: Current Status and Ongoing Development Uwe Klosa Electronic Publishing Centre, Uppsala University, Sweden Eva Müller.
BMC Open Access Colloquium, 8 February Morgan: "Open Access Repositories"
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
Digital Commons & Open Access Repositories Johanna Bristow, Strategic Marketing Manager APBSLG Libraries: September 2006.
| Ingest Levels and Persistent Identification | October Ingest Levels and Persistent Identification Services for R & D and heritage organisations.
Tsinghua University Library Yang Zhao & Airong Jiang Tsinghua University Library, Beijing China 4 June, 2004 Electronic Thesis and Dissertation System.
Selene Dalecky March 20, 2007 FDsys: GPO’s Digital Content System.
Managing Access at the University of Oregon : a Case Study of Scholars’ Bank by Carol Hixson Head, Metadata and Digital Library Services
SEDAC Long-Term Archive Development Robert R. Downs Socioeconomic Data and Applications Center Center for International Earth Science Information Network.
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
Preservation Functionality in a Digital Archive Erik Oltmans Koninklijke Bibliotheek Raymond J. van Diessen IBM Business Consulting Services Hilde van.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
Moving on : Repository Services after the RAE
Building A Repository for Digital Objects
An Introduction to Tessella and The Safety Deposit Box Platform
Sophia Lafferty-hess | research data manager
Jisc Research Data Shared Service (RDSS)
ArchivesSpace – Archivematica – DSpace Workflow Integration
Presentation transcript:

Rosetta at ETH Zurich – routes into the digital archive IGeLU 7th Conference Zurich, 11th to 13th September 2012 Dr. Matthias Töwe

Outline ETH Zurich and ETH-Bibliothek Background and objectives Integration of Rosetta Approach for local data management Ongoing work 12.09.2012 M. Töwe

ETH Zurich ETH Zurich (2011) Swiss Federal Institute of Technology Zurich Founded in 1855 University for technology and the natural sciences More than 17‘ooo students from 80 countries, 3‘700 among them are doctoral candidates More than 450 professors in engineering, architecture, mathematics, natural sciences, system-oriented sciences, and management and social sciences 12.09.2012 M. Töwe

ETH-Bibliothek Central university library for ETH Zurich with four special libraries Swiss centre for technical and scientific information Special collections including ETH Archives, Image Archive, Map Collection, ETH‘s Collection of Prints and Drawings and others Hosting nationwide services for university libraries, e.g. NEBIS library network (http://www.nebis.ch) Consortium of Swiss Academic Libraries (http://lib.consortium.ch) Swiss electronic library e-lib.ch (http://www.e-lib.ch) National digitization projects with more than 5.5 mio. pages: http://www.e-rara.ch (rare books) http://retro.seals.ch (Swiss journals) 12.09.2012 M. Töwe

Observations since (at least) 2002 Library had turned digital to a degree where ensuring long- term preservation and access became major issues «Sooner or later we will need an OAIS-compliant system to support long-term preservation» No product in the market No chance to do full development ourselves Since then: increasing pressure Digital deposits on the horizon for ETH Archives Research data gaining attention within ETH Zurich 12.09.2012 M. Töwe

Background Research Data Challenges Research process as a whole relies on digital data Good scientific practice requires retention of data in usable form Funding organisations require data management plans (NSF, DFG) Re-use of data increasingly important and should be facilitated Data which cannot easily be reproduced and has permanent relevance must remain available Published or referenced supplementary material must be citable and remain available 12.09.2012 M. Töwe

Objectives Facilitate re-use of data Accountability, re-assessment during a limited period Citability of data referred to in publications  DOI-registration already operational Re-use by own group, by granting access to known colleagues or as fully Open Data Preserve the scientific and historic heritage of ETH Zurich Administrative records from ETH Archives Library materials (born digital theses, digitization masters; excluding licensed / acquired e-journals and e-books) 12.09.2012 M. Töwe

Why Rosetta ? Concept in accord with OAIS reference model (Open Archival Information System, ISO 14721) Developed with partners from the library community  including side effects Supports relevant functions in preservation Makes use of available tools from the wider LTP community Includes a format library which interacts with PRONOM 12.09.2012 M. Töwe

Why Rosetta ? Opportunity for a development partnership… …based on an «out of the box» product Scalability in both object numbers and data volume Operation on virtual servers corresponding to ETH’s IT strategy Integration with existing system environment at ETH-Bibliothek using available know-how Strategic preference for vendor supported products 12.09.2012 M. Töwe

Vision 12.09.2012 M. Töwe

Systems’ view OAIS 12.09.2012 M. Töwe Final report on second phase „Pilot Langzeitarchivierung“, S. 23f; Aliesch, P. et al., 2007: Projekt „Pilot Langzeitarchivierung“. Internal. 12.09.2012 M. Töwe

Options for Submission Manual: Web dialog for upload and metadata entry by staff users Semi-automatic: Batch-upload of files with existing metadata in CSV-spreadsheet Automated: Submission application packages structured batches of files with existing metadata in XML-format and submits them Submission application may also interface directly with existing source applications 12.09.2012 M. Töwe

The «simple» case: Library materials Born digitals, digitization masters Unique to ETH’s heritage Dark archive, online-access on dedicated platform Metadata and files are available in a structured way from well- defined sources Example use case: ETH E-Collection (institutional repository) 12.09.2012 M. Töwe

Rosetta’s Role: Use case «E-Collection» MD or fulltext query MD query Primo Link to «external resource» Files Submission Application SIP Rosetta MD MD NEBIS (Aleph) 12.09.2012 M. Töwe

Functional LEvels What? Why? Who? Data Curation Ensure intellectual re-usability Data Producers Content Preservation Ensure technical re-usability ETH-Bibliothek Bitstream Preservation Ensure technical stability IT-Services ETH Zurich Adapted after Jens Ludwig, Wissgrid 12.09.2012 M. Töwe

Concerns with research data Keep it simple – at least for researchers Digital curation cannot «improve» data retroactively: «garbage in – garbage out» Researchers need to contribute (e.g. structure, metadata, documentation) Can we actually «raise awareness» with researchers?  begin with those with high level of awareness and interest Who decides about data when the producer is no longer available? Public access must not be a prerequisite for preservation Researchers should be provided with a convincing added value service which makes their lives easier 12.09.2012 M. Töwe

Data management and OAIS ? Researchers’ requirements only partly concern functions within the OAIS-frame High degree of flexibility required in Pre-Ingest or even earlier Consequences for the role of Rosetta (or any LTP-system): Little use in further increasing complexity of LTP-system with new functionality which has little to do with LTP Where available, transfer data from existing upstream applications Achieve flexibility in local data management as needed, not in central application 12.09.2012 M. Töwe

Local Data management SIP Package Handler (Docupack) Viewer and editor for structure and metadata Data and metadata remain on each research group’s local storage as long as desired Creates Submission Information Package (SIP) of selected data for ingest into Rosetta when archiving is initiated Origin in archival sector no coincidence: requirements are similar 12.09.2012 M. Töwe

Research Data Management  Preservation User Library Rosetta Point of time X: selection for archiving and «Submit» incl. access rights in archive Structuring and metadata-capture, DOI-generation Ingest, DOI-registration Delivery via DOI, if rights ok Assigned storage (local PC, share); permissions via file system functionality 12.09.2012 M. Töwe

Workflow for ETH Archives Organisational unit of ETH ETH Archives Library Rosetta Offer of deposit Ingest, DOI-registration Appraisal, selection, description embargo period? DOI-generation Appraisal ok? Full MD and structure Full text delivery via DOI Appraisal approved Primo MD MD query Deliver full MD and structure Archival Information System Assigned storage, temporary access for depositing unit 12.09.2012 M. Töwe

Rosetta: Commissioned Developments Collections API • Create/delete collection via submission of an externally produced METS-file • Update collection metadata and location of an existing collection • Get collection by ID or path Retention Period Functionality • Manage Retention Policies based on date or period • Assign Retention Period through task or via METS • Reports on IEs to be deleted and job for deletion Collection Viewer • Display of collections’ context (tree structure) • «Explorer-like» navigation 12.09.2012 M. Töwe

Work so far DOI-registration by ETH Zurich as member of DataCite operational Full survey of research groups (Profs.) at ETH Zurich Identification of pilot partners and their requirements Check and update inventory of data hosted by the library Submission application for institutional repository (Productive operation planned to start 09/2012) Gap analysis and decision on development Migration to version 3.0, now 3.0.1 12.09.2012 M. Töwe

Work To Do and in progress Commissioned developments Rosetta Specification Development (Beta-version 10/2012) Testing (10-11/2012) Development and testing for local data management Development (beta versions between 07-10/2012) Testing (Continuously between 07-10/2012) Ongoing work with respect to service provision (2012) Definition of service levels Analysis of formats in pilot groups Definition of white list for supported formats Drafting of a data policy to support researchers 12.09.2012 M. Töwe

Thank you very much! Questions? Dr. Matthias Töwe Head Digital Curation ETH-Bibliothek Rämistrasse 101 8092 Zürich Switzerland +41 (0)44 632 60 32 matthias.toewe@library.ethz.ch http://www.library.ethz.ch 12.09.2012 M. Töwe