PRESERV a JISC 4/04 project Bid conditionally accepted Friday 24 th September Steve Hitchcock Intelligence Agents Multimedia Group, School of Electronics.

Slides:



Advertisements
Similar presentations
Richard Jones, Systems Developer Technical Issues for Repository Software Theses Alive! Edinburgh University Library SHERPA Nottingham.
Advertisements

Capturing preservation metadata from institutional repositories Preserv Project Presented by Steve Hitchcock Intelligence Agents Multimedia Group, School.
Preserv Preservation Eprint Services Simple Preservation Services – towards Proactive Support for the Institutional Repository.
PRESERV PReservation Eprint SERVices A two-year JISC 4/04 project: iii Institutional repository infrastructure development Steve Hitchcock and Jessie Hey.
Preserv: Preservation architecture and interface A brief overview of ideas wrt to the project plan For Preserv partners meeting, BL, London, 18th November.
A brief overview of the Open Archives Initiative Steve Hitchcock Open Citation Project (OpCit) Southampton University Prepared for Z39.50/OAI/OpenURL plenary.
Engaging repository policy with preservation Steve Hitchcock and Neil Jefferies* Preserv 2 Project School of Electronics and Computer Science (ECS), Southampton.
Engaging repository policy with preservation Steve Hitchcock and Neil Jefferies* Preserv 2 Project School of Electronics and Computer Science (ECS), Southampton.
Preserv Preservation Eprint Services Scenario: Digital lifecycle begins with author creation and deposit of paper or data content into the institutional.
Preservation Features in Repository Software PRESERV: Tim Brody University of Southampton.
IRs: towards preservation services Steve Hitchcock Preserv Project Intelligence Agents Multimedia Group, School of Electronics and Computer Science (ECS),
Reshaping Preserv 2 from a Life(cycle) perspective Steve Hitchcock and Dave Tarrant Preserv 2 Project School of Electronics and Computer Science (ECS),
Repository models and policies for preservation Steve Hitchcock Preserv Project Intelligence Agents Multimedia Group, School of Electronics and Computer.
Repository preservation services: divisible, viable and sustainable? Steve Hitchcock Preserv 2 Project Intelligence Agents Multimedia Group, School of.
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
Theses Alive! - an ETD management system for the UK Theo Andrew and Richard Jones Theses Alive! University of Edinburgh.
A Tour of the OAIS Reference Model Brian Lavoie Research Scientist Office of Research OCLC Museum Computer Network Annual Conference September 2002.
Creating Institutional Repositories Stephen Pinfield.
Building Repositories of eprints in UK Research Universities Bill Hubbard SHERPA Project Manager University of Nottingham.
Preservation as a Process of a Repository David Tarrant University of Southampton (UK) Preserv Repository Preservation and Interoperability.org.uk.
University of Southampton EdSpace Hugh Davis, Leslie Carr, Jessie Hey and Debra Morris edspace.ecs.soton.ac.uk.
Metadata for preservation: the Cedars perspective
Supporting education and research Repositories in Context Digital repositories as components of an integrated infrastructure for education Leona Carpenter.
The OAIS Reference Model: current implementations Michael Day, UKOLN, University of Bath Chinese-European Workshop.
The PREMIS Data Dictionary Michael Day Digital Curation Centre UKOLN, University of Bath JORUM, JISC and DCC.
Preservation Metadata Initiatives: Practicality, Sustainability, and Interoperability Michael Day UKOLN, University of Bath ERPANET Training.
UKOLN is supported by: JISC Information Environment update Repositories and Preservation Programme meeting, October 24-25, 2006 Rachel Heery UKOLN
A centre of expertise in data curation and preservation DigCCur2007 Symposium, Chapel Hill, N.C., April 18-20, 2007 Co-operation for digital preservation.
Long-Term Preservation. Technical Approaches to Long-Term Preservation the challenge is to interpret formats a similar development: sound carriers From.
Preserving E-Prints: Scaling the Preservation Mountain Sheila Anderson, Arts and Humanities Data Service Stephen Pinfield, University of Nottingham.
Integrating metadata schema registries with digital preservation systems to support interoperability Michael Day UKOLN, University.
Fedora Users’ Conference Rutgers University May 14, 2005 Researching Fedora's Ability to Serve as a Preservation System for Electronic University Records.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
Merrilee Proffitt e(X)literature / Digital Cultures Project April 2003 News from the Digital Library The Metadata Encoding and Transmission Standard; the.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
An Overview of Selected ISO Standards Applicable to Digital Archives Science Archives in the 21st Century 25 April 2007 Donald Sawyer - NASA/GSFC/NSSDC.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Trusted Digital Repositories,
Ingest and Dissemination with DAITSS Presented by Randy Fischer, Programmer, Florida Center for Library Automation, University of Florida DigCCurr2007.
Geoff Payne ARROW Project Manager 1 April Genesis Monash University information management perspective Desire to integrate initiatives such as electronic.
International Council on Archives Section on University and Research Institution Archives Michigan State University September 7, 2005 Preserving Electronic.
Data Archiving and Networked Services DANS is an institute of KNAW en NWO Trusted Digital Archives and the Data Seal of Approval Peter Doorn Data Archiving.
Supporting further and higher education The UK FAIR Programme: OAI in context Chris Awre OAI3, CERN, February 2004.
A disaggregated model for preservation of E-Prints Gareth Knight SHERPA DP Project Arts and Humanities Data Service.
OAIS Open Archival Information System. “Content creators, systems developers, custodians, and future users are all potential stakeholders in the preservation.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Metadata in a distributed information environment: Interoperability as recombinant potential Lorcan Dempsey OCLC/SCURL pre-IFLA conference, 15/16 Aug 02.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Digital Preservation: Current Thinking Anne Gilliland-Swetland Department of Information Studies.
Integrating metadata schema registries with digital preservation systems to support interoperability Michael Day UKOLN, University of Bath, UK
Metadata for digital preservation: a review of recent developments Michael Day UKOLN, University of Bath ECDL2001, 5th European Conference.
Selene Dalecky March 20, 2007 FDsys: GPO’s Digital Content System.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
From ePrints to eSPIDA: Digital Preservation at the University of Glasgow William J Nixon, Service Development DAEDALUS, University of Glasgow DPC: Digital.
M-1 INGEST OVERVIEW Don Sawyer National Space Science Data Center NASA/GSFC October 13, 1999.
The OAIS Reference Model Michael Day, Digital Curation Centre UKOLN, University of Bath Reference Models meeting,
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
The OAIS model SEEDS meeting May 5 th, 2015, Lausanne Bojana Tasic.
Cedars work on metadata Michael Day UKOLN, University of Bath Cedars Workshop Manchester, February 2002.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Implementing an Institutional Repository: Part II
PRESERV PReservation Eprint SERVices
Open Archival Information System
Robin Dale RLG OAIS Functionality Robin Dale RLG
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
Presentation transcript:

PRESERV a JISC 4/04 project Bid conditionally accepted Friday 24 th September Steve Hitchcock Intelligence Agents Multimedia Group, School of Electronics and Computer Science (ECS), Southampton University These slides prepared for the TARDis Project Review Meeting on September 28, 2004, Southampton

PRESERV PReservation Eprint SERVices JISC 4/04 Supporting Institutional Digital Preservation and Asset Management iii Institutional repository infrastructure development PRESERV is planned to be a two year project to September 2006

PRESERV project partners Southampton University (IAM, Eprints) Lead site The National Archives (Pronom software) The British Library Oxford University

Why preservation based on Eprints? It is important to build the concept of preservation from the outset (JISC Circular 4/04, note 10). In the digital era, the outset for most new research and educational materials will be the institutional archive, or repository. The most widely used software for building institutional archives is Eprints (Crow 2004), developed at Southampton University and now used in over 130 archives in all regions of the world. Eprints is thus an established, flexible infrastructure that is used to collect and manage user-defined metadata, and can therefore be seen as contributing to a critical component in the widely accepted digital preservation reference model, the Open Archival Information System (OAIS). Specifically, it forms a process in what the OAIS refers to as ingest.

OAIS functional entities SIP = Submission Information Package AIP = Archival Information Package DIP = Dissemination Information Package SIP Descriptive Info. AIP DIP Administration PRODUCERPRODUCER CONSUMERCONSUMER Queries, orders Result sets MANAGEMENT Ingest Access Data Management Archival Storage Descriptive Info. Ack. Don Sawyer, October Ingest/Ingest%20Plenary%20Pre.PPT

Open Archival Information System (OAIS): Ingest The set of processes responsible for accepting information submitted by Producers and preparing it for inclusion in the archival store. Specific functions performed by Ingest include: receipt of information transferred to the OAIS by a Producer; validation that the information received is uncorrupted and complete; transformation of the submitted information into a form suitable for storage and management within the archival system; extraction and/or creation of descriptive metadata to support the OAISs search and retrieval tools and finding aids; transfer of the submitted information and its associated metadata to the archival store. In short, the Ingest function serves as the OAISs external interface with Producers, managing the entire process of accepting custody of submitted information and preparing it for archival retention. (Lavoie 2004)

PRESERV view of OAIS ingest Accords closely with that of Wheatley (2004). Emphasises the need to automate and provide modular tools for the potentially high effort, high cost function of capturing metadata, and the capture of Representation Information (RI). RI is metadata that describes how the bytestream of a digital object can be turned into a human readable representation, and will play a crucial role in achieving long term digital preservation and data curation. RI is what in preservation metadata terms RLG-OCLC (2002) refers to as the viability of digital resources. According to Wheatley, a range of institutional repository ingest functions will need to be developed, including: Automated extraction of metadata Automatic identification of file formats Verification of an objects compliance to a relevant file format specification

Working with the National Archives (Pronom) The project will implement an ingest service based on the OAIS reference model for institutional archives built using Eprints software. Working with the National Archives, the project will link Eprints through a Web service to PRONOM software for identification and verification of file formats, the only such system currently in operational use. The project will emphasise automation, will provide modular tools for capturing metadata and will enable the identification and verification of file formats. The project will scope a technology watch service to populate and update PRONOM where full automation is not feasible for file format recognition.

Eprints-Pronom implementation As part of its work on PRONOM 4, Tessella, National Archives, will develop and host a file format identification tool which can be deployed: as free downloadable software which can be used either as a standalone tool via a Java GUI, or via an exposed programming interface, or API, which can be integrated with other software as a Web service hosted by TNA The tool will use file format signature information stored in PRONOM to perform the identification. Southampton will develop Eprints to allow it to use the tool in one or more of the above configurations. This interface will create an enhanced infrastructure service directly usable by institutional archives. Critical issue Full automation of this service is unlikely. This would depend on 100% format coverage in Pronom; otherwise alerts could be the result of outdated information. Instead there will be a manual check stage on all alerts.

Southampton and Oxford University archives This ingest service will be integrated into the Eprints deposit process for two existing institutional archives, subject to prior satisfactory testing on pilot archives: The institutional archive exemplar at Southampton produced by the TARDis project Oxford University Eprints service Critical issue Judging the moment to transfer an Eprints-PRONOM enabled service from pilot archives to full working institutional archives. Pilot archives are a limited version of real archives, circumscribed in terms of users and content. This project will work with substantial real archives, but by this stage in their development it can be anticipated these archives will be reaching levels of activity that will make administrators wary of changes to interfaces and key services without convincing evidence of the reliability and integrity of the new services.

Trusted digital repositories A trusted digital repository is one whose mission is to provide reliable, long-term access to managed digital resources to its designated community, now and in the future. Some institutions … may choose to manage the logical and intellectual aspects of a repository while contracting with a third-party provider for digital file storage and maintenance. (RLG-OCLC 2002)

Working with the British Library The project will build and test an exemplar OAI-based preservation service based on the digital preservation policies and practices of the British Library, a trusted digital repository. This exemplar will use metadata harvested from preservation-participating institutional archives, and will be independent of the software used to build the archive, which could in principle be based on Eprints, DSpace, or other software.

Future implications The project will work with other JISC approved projects in the JISC 4/04 programme and other JISC programmes to create institutional responsibility for preservation planning, data management, archival storage and administration, to effectively build a network of distributed and cooperating services that are based on the OAIS digital preservation reference model.

Conclusions Preservation is about people. In an institutional archive, based on author self-archiving, preservation begins with the author. Preservation will become an important component of Eprints, but Eprints will be only one component in a network of distributed and cooperating services based on the OAIS digital preservation reference model. Eprints is well suited to this role – by conforming with OAI it can be part of a network of OAI- based preservation services that would make preservation an external service to institutional archives, as proposed by James et al. (2003) and others. There may be tensions between the needs of eprints services and preservation requirements - different pace, timescales, chronology, and different selection criteria. Institutional archives require immediacy and access. What matters for institutional archives is preservation of access.

Footnotes Until the project has a Web site ( this presentation will be found from OR References Crow, R. (2004) "A Guide to Institutional Repository Software". Open Society Institute, v. 2.0, January James, H., et al. (2003) Feasibility and Requirements Study on Preservation of E-Prints. JISC, October 29 Lavoie, B. F. (2004) Introduction to OAIS. Digital Preservation Coalition, Technology Watch Series Report 04-01, January RLG-OCLC (2002) Trusted Digital Repositories:Attributes and Responsibilities May Wheatley, P. (2004) Institutional Repositories in the Context of Digital Preservation. Digital Preservation Coalition, Technology Watch Series Report 04-02, March Credits Southampton University Les Carr, Jessie Hey, Steve Hitchcock, Pauline Simpson National Archives David Ryan, Adrian Brown British Library Richard Boulderstone Oxford University David Price