OPEN RESEARCH DATA, EPFL, 28 October 2014, M. Töwe, M. Bärlocher docuteam packer: viewer and editor for file structures and metadata.

Slides:



Advertisements
Similar presentations
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
Advertisements

Rosetta at ETH Zurich – routes into the digital archive
Long-Term Preservation. Technical Approaches to Long-Term Preservation the challenge is to interpret formats a similar development: sound carriers From.
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
WHY CMS? WHY NOW? CONTENT MANAGEMENT SYSTEM. CMS OVERVIEW Why CMS? What is it? What are the benefits and how can it help me? Centralia College web content.
HP Quality Center Overview.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
US GPO AIP Independence Test CS 496A – Senior Design Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong Faculty advisor: Dr. Russ.
Michael Donovan, River Campus Libraries – 12/03 DocuShare Overview and Training.
Producer-Archive Workflow Network (PAWN) Goals Consistent with the Open Archival Information System (OAIS) model Use of web/grid technologies and platform.
US GPO AIP Independence Test CS 496A – Senior Design Fall 2010 Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong.
Ingest and Loading DigiTool Version 3.0. Ingest and Loading 2 Ingest Agenda Ingest Overview and Introduction Ingest activity steps Transformers Task Chains.
Supporting Customized Archival Practices Using the Producer-Archive Workflow Network (PAWN) Mike Smorul, Mike McGann, Joseph JaJa.
Brief Overview of Major Enhancements to PAWN. Producer – Archive Workflow Network (PAWN) Distributed and secure ingestion of digital objects into the.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
AIP Archival Information Package – Defines how digital objects and its associated metadata are packaged using XML based files. METS (binding file) MODS.
Fundamentals of Information Systems, Second Edition
US GPO AIP Independence Test CS 496A – Senior Design Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong Faculty advisor: Dr. Russ.
Installing software on personal computer
Persistent Digital Archives and Library System (PeDALS) A Guide for Wisconsin State Agencies.
THE NEW WAY TO WORK TOGETHER Share 4 CreateControlProtect Create and organize content easily with the help of relevant discovered information Manage.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Metadata standards, tools and processes for audio preservation at the British Library: An overview of new systems for audio description, preservation and.
RMIS - Building a Research Management Information System at the University of Glamorgan Leanne Beevers & Neil Williams.
Good practice in Research Data Management Module 6: Tools, training and support.
Statewide Digitization and the FCLA Digital Archive Priscilla Caplan, Florida Center for Library Automation Statewide Digitization Planners Meeting OCLC,
Metadata Harvesting The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing Workshop.
5-7 November 2014 DR Workflow Practical Digital Content Management from Digital Libraries & Archives Perspective.
Human Resource Management Lecture 27 MGT 350. Last Lecture What is change. why do we require change. You have to be comfortable with the change before.
DSpace. TM 2 Agenda  Introduction to DSpace  DSpace community  Institutional Repository  Easy to add/find content in DSpace  Building Online Communities.
Fundamentals of XML Management Greg Alexopoulos Systems Engineer Documentum.
Lead Management Tool Partner User Guide March 15, 2013
State Records Office of Western Australia.NET Proof of Concept Project Slideshow: Prototype Online Disposal Authority/Recordkeeping Plan System Project.
UVa Library Research Data Services
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
Data Management Practices for Early Career Scientists: Closing Robert Cook Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN.
Implementing a Data Publishing Service via DSpace Jon W. Dunn, Randall Floyd, Garett Montanez, Kurt Seiffert May 20, 2009.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
Overview of the SAS® Management Console
| Ingest Levels and Persistent Identification | October Ingest Levels and Persistent Identification Services for R & D and heritage organisations.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Series 2013 Data Management at the National Climate Change and Wildlife Science Center.
WGISS /09/2015 DATA PRESERVATION – CNES APPROACH B. Chausserie-Laprée.
THE NEW WAY TO WORK TOGETHER Share 4 CreateControlProtect Create and organize content easily with the help of relevant discovered information Manage.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
CSUN eCommons Submitting Learning Objects to CSUN eCommons: A Preliminary Guide February 7, 2008.
L.T.E :: Learning Through Experimenting Using google-svn for MtM Docs Development Denis Thibault Version 3.2 Mar 12 th, 2009.
Implementing PREMIS in DigiTool Michael Kaplan ALA 2007 Update.
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
Preservation Functionality in a Digital Archive Erik Oltmans Koninklijke Bibliotheek Raymond J. van Diessen IBM Business Consulting Services Hilde van.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
Setting up long term digital preservation DLM Forum Member Meeting Luxembourg 14-15th October 2015.
International Planetary Data Alliance Registry Project Update September 16, 2011.
MANAGEMENT OF STATISTICAL PRODUCTION PROCESS METADATA IN ISIS
Moving on : Repository Services after the RAE
An Overview of Data-PASS Shared Catalog
Exercise: understanding authenticity evidence
Statewide Digitization and the FCLA Digital Archive
Active Data Management in Space 20m DG
SowiDataNet - A User-Driven Repository for Data Sharing and Centralizing Research Data from the Social and Economic Sciences in Germany Monika Linne, 30.
Reviewing Items in the Campus Digital Archive (CDA)
Exercise: understanding authenticity evidence
Institutional role in supporting open access, open science, open data
11/16/2018.
Márton Németh – László Drótos How to catalogue a web archive?
Presentation transcript:

OPEN RESEARCH DATA, EPFL, 28 October 2014, M. Töwe, M. Bärlocher docuteam packer: viewer and editor for file structures and metadata

||  Aim of the Workshop  docuteam packer  Purposes and limitations  Use cases  Configuration  Demo  Testing (Link to download: M. Töwe, M. Bärlocher2 Overview

||  M. Töwe, M. Bärlocher3 Download Optional Open Office, for file preview only Manual in German Manual in English Application

|| M. Töwe, M. Bärlocher4 Aims of the workshop  Aims of using docuteam packer are clear  Participants have a rough understanding of the tool’s strengths and limitations  Technical prerequisites for using the tool are known

|| M. Töwe, M. Bärlocher5 Workflow «Small Data» Filesystem (networked, local) Selection for archiving Structuring and DOI-generation* Sub- mission Point of time X: «Submit» a SIP ETH Data Archive Delivery via DOI docuteam packer Library Metadata capture Researchers Access via Knowledge Portal IT-Services Server Storage Network Bitstream preservation Curation and usabilitySelection and documentation of context docuteam feeder

|| M. Töwe, M. Bärlocher6 What is docuteam packer? For users  Viewer and editor for local preparation of archival packages for transfer to ETH Data Archive  Create and edit folder structure, as it should be reflected in ETH Data Archive  Enter and edit metadata  DOI-creation (Digital Object Identifier; to be registered by ETH Data Archive)  Assign access rights and retention periods to be enforced by ETH Data Archive In the background  Create a Submission Information Package (SIP) or Archival Information Package (AIP) of metadata + structure (METS-format, Metadata Encoding and Transmission Standard) and data

|| M. Töwe, M. Bärlocher7 What docuteam packer is not!  No comprehensive data management solution  No records management solution  No collaboration platform  No data repository  No long term archive - but a tool to prepare for and submit to archive  No solution for local rights management  Not tied to use with Rosetta as the only long term archive  Consider alternative approaches where these are more appropriate  Be careful with using the tool without submitting to an archive

|| M. Töwe, M. Bärlocher8 Example Use Cases Research groups  Data belonging to a manuscript are collected, submitted to the long term archive and made accessible via DOI for reviewers and readers  Research group has a structured filing without metadata; it should be edited and submitted into the long term archive  PhD students of a group are presented with a filing structure they should follow when managing their data … Administrative staff within ETH  Delivers structured data to ETH Zurichs university archives…  …archives’ staff appraises and selects content and adds metadata

|| M. Töwe, M. Bärlocher9 The GUI and its elements (1) Tree view of folders and files Statistics per element Technical metadata Events Event details

|| M. Töwe, M. Bärlocher10 The GUI and its elements (2) Tree view of folders and files Descriptive metadata Preview functions

|| M. Töwe, M. Bärlocher11 Practical use 1.Import an existing file structure (drag and drop) and add metadata afterwards 2.Import single files into a unified structural template for members of a group: «We always put files of type N into folder Y of our structure» 3.Build a structure from scratch with defined hierarchical elements with predefined metadata fields

|| M. Töwe, M. Bärlocher12 Why the effort?  «Local data management light»: Data are structured and described locally within the group  Group retains full control, but important work is already done to facilitate long term preservation  Metadata can be configured – within reasonable limits  Structure and metadata in METS-XML can be submitted to ETH Data Archive automatically (via docuteam feeder as Submission Application)  DOI are generated and can be used in citations (registration follows later in ETH Data Archive)  Selection of retention periods and access rights to be enforced in ETH Data Archive

|| M. Töwe, M. Bärlocher13 Issues to observe  Configuration is flexible, but must remain consistent with MD schema  If flexibility of configuration is exploited, effort for maintenance strongly increases and the approach will not scale well  Early discussion with the Digital Curation Office is important!  No installation, but it must be possible to run docuteam packer locally  Users can get themselves into trouble by manipulations on the file system  If data remains on local storage for years, problems with respect to long term preservation can occur once data is submitted to long term archive

|| M. Töwe, M. Bärlocher14 Complete process into the archive

|| M. Töwe, M. Bärlocher15 Questions? Dr. Matthias Töwe Head Digital Curation ETH-Bibliothek Rämistrasse Zurich Martin Bärlocher Library IT Services ETH-Bibliothek Rämistrasse Zurich

|| M. Töwe, M. Bärlocher16 Use Case Research Data – «Small Data»  Distinct from «Big Data»  Structured data in discrete files; produced everywhere – even in projects which actually deal with «Big Data» as their research topic  Interface between data management  Long term preservation  Facilitate compliance with accountability and verifiability  Ensure citability of data  DOI-registration  Support producer’s own re-use, access by colleagues or Open Data  From Restricted Access to Open Access  Retention period at least 10 years  We expect increasing requirements by funders and universities for data management

|| M. Töwe, M. Bärlocher17 Submission Process

|| M. Töwe, M. Bärlocher18 (Close to) ideal approach for Research Data Researchers Inter- face ETH Data Archive (Rosetta) LibraryIT Services Ingest Server Storage Network Access via DOI Processing and documentation of context Preservation of usability Bit stream preservation Existing data management or workflow solution Examples: LIMS Digital Lab Notebook Data Mgmt. Platform (e.g. openBIS) Virtual Research Environment Researchers Inter- face ETH Data Archive (Rosetta) Library Ingest Server Storage Network Access via DOI Processing and documentation of context Preservation of usability Existing data management or workflow solution Examples: LIMS Digital Lab Notebook Data Mgmt. Platform (e.g. openBIS) Virtual Research Environment