Developing a digital repository infrastructure for King’s College London RSP Training Day, 22 nd January 2009 Gareth Knight Centre for e-Research.

Slides:



Advertisements
Similar presentations
Preserv Preservation Eprint Services Simple Preservation Services – towards Proactive Support for the Institutional Repository.
Advertisements

IRRA DSpace April 2006 Claire Knowles University of Edinburgh.
Preserv: Preservation architecture and interface A brief overview of ideas wrt to the project plan For Preserv partners meeting, BL, London, 18th November.
1 Metadata Tools for JISC Digitisation Projects of still images and text Ed Fay BOPCRIS, Hartley Library University of Southampton.
Joint Information Systems Committee 11/03/07 | | Slide 1 Joint Information Systems CommitteeSupporting education and research JISC Conference 2007 Managing.
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
Vital Implementation Update Vital Implementation Update 11 th January 2006 Paul Bevan – Glen Robson –
Web design Most digitisation projects are made available through Websites Effective Access depends on good web design Identify users and their information.
Interoperability and Preservation with the Hub and Spoke (HandS) Matt Cordial, Tom Habing, Bill Ingram, Robert Manaster University of Illinois Urbana-Champaign.
Interoperability and Preservation with the Hub and Spoke (HandS) Tom Habing, Bill Ingram, Robert Manaster University of Illinois Urbana-Champaign
Technical Framework Charl Roberts University of the Witwatersrand Source: Repositories Support Project (JISC)
Funded by: © AHDS Sherpa DP – a Technical Architecture for a Disaggregated Preservation Service Mark Hedges Arts and Humanities Data Service King’s College.
Depositing e-material to The National Library of Sweden.
R.Jantz, August 31, Two-day forum on PREMIS Preservation Metadata and the Trusted Digital Repositories August 31, September 1 National Library of.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
SOAPI: a flexible toolkit for implementing ingest and preservation workflows Mark Hedges Centre for e-Research, King’s College London Arts and Humanities.
Teula Morgan The Adaptable Repository: Swinburne Online Journals.
1 CS 502: Computing Methods for Digital Libraries Lecture 22 Repositories.
Durable Digital Repositories: The DSpace Project Bill Jordan University Libraries.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Demonstration of repositories Fedora (Flexible Extensible Digital Object Repository Architecture) Marie Lagerwall MIDESS Partners Meeting February 9, 2007.
OU Digital Library development project Liz Mallett – Project Manager James Alexander – Project Developer 25 January 2012.
Digital Asset Management for All? Visualising a Flexible DAMS Solution for Small and Medium Scale Institutions Paul Bevan Llyfrgell Genedlaethol Cymru.
Good practice in Research Data Management Module 6: Tools, training and support.
Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales
Long-term preservation aspects in the eSciDoc project Natasa Bulatovic Max-Planck Digital Library
Geoff Payne ARROW Project Manager 1 April Genesis Monash University information management perspective Desire to integrate initiatives such as electronic.
The DSpace Course Module – An introduction to DSpace.
Australian Partnership for Sustainable Repositories University of Sydney practices and test-bed projects, sustainability in a distributed.
Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager
Supporting further and higher education The UK FAIR Programme: OAI in context Chris Awre OAI3, CERN, February 2004.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
IUScholarWorks is a set of services to make the work of IU scholars freely available. Allows IU departments, institutes, centers and research units to.
A disaggregated model for preservation of E-Prints Gareth Knight SHERPA DP Project Arts and Humanities Data Service.
EPrints 10 Years of Digital Preservation. What is EPrints For?  EPrints offers a safe, open and useful place to store, share and manage material in the.
HUB AND SPOKE TOOL SUITE PREMIS Implementation Fair – 7 October 2009 Bill Ingram Visiting Research Programmer University of Illinois at Urbana-Champaign.
DAMS Implementation at NLW DAMS Implementation at NLW 20 th February 2007 Paul Bevan
Implementing PTFS ArchivalWare at York St John University: a project under the JISC Repositories Start-up and Enhancement (SUE) strand Helen Westmancoat.
VIVO and Scholarly Repositories: Synergistic Opportunities.
Digital initiatives Digital Initiatives at the National Library of Wales 19 th April 2007 Paul Bevan
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
ARROW Institutional Repositories for Managing e-Theses Presentation to ETD September 2005 Geoff Payne, ARROW Project Manager.
A Fedora 3 to 4 Migration Case Study for UNSW Australia Library Fedora 4 Training Workshop, eResearch Australasia 2015, Brisbane UNSW Library Arif Shaon,
A Fedora 3 to 4 Migration Case Study for UNSW Australia Library Fedora 4 Training Workshop, eResearch Australasia 2015, Brisbane UNSW Library Arif Shaon,
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Research Data Management At the Smithsonian Using Sidora CNI December 10, 2013.
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
The NLW Digital Asset Management System Paul Bevan DAMS Implementation Manager
DSpace - Digital Library Software
Stellenbosch University Research Material Submitter Training Paulette Talliard Library and Information Service.
Digital Repositories: Concepts and Issues By Devendra. S. Gobbur (Sr) Assistant Librarian, Gulbarga University, Gulbarga. 10 NOV, NOV, 2009.
Significant Properties - where next?. 2 Curatorial role in SP Object analysis will enumerate technical properties and identify the purpose for each Stakeholder.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
Data Wrangling: Developing Local Best Practice for Born Digital Metadata Tracy Popp, Digital Preservation Coordinator Ayla Stein, Metadata Librarian University.
Eliot Wilczek University Records Manager Digital Collections and Archives Tufts University Institutional Repositories: Models & Approaches A NELINET Seminar.
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
Meeting of the Member States Expert Group on Digitisation and Digital Preservation , Luxembourg European Archival Records and Knowledge Preservation.
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Metadata & Repositories Jackie Knowles RSP Support Officer.
Breeda Herlihy, IR Manager, UCC Library. UCC selected DSpace in 2008 Software selection group Staff from Library IT, Computer Centre, Special Collections,
Digital Library Storage using iRODS Data Grids Mark Hedges, Tobias Blanke Centre for e-Research, King’s College London Arts and Humanities Data Service.
Building Digital Archives Mark Phillips Cathy Hartman June 6, 2008.
Overview: Fedora Architecture and Software Features
VI-SEEM Data Repository
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
VI-SEEM Data Repository
York Digital Library – Images Case Study
Presentation transcript:

Developing a digital repository infrastructure for King’s College London RSP Training Day, 22 nd January 2009 Gareth Knight Centre for e-Research

2 Approach 1. Analyse existing practices & limitations of current system 2. Establish requirements for Information management & access 3. Investigate alternative approaches (software choice, extensibility, applicability to your data, use by others) 4. Prototype – smaller projects and experiments

3 Centre for e-Research CeRch ( is: A R&D department in Information Services and Systems (ISS) that performs: Management and preservation of research outputs from KCL researchers in all disciplines Research, teaching and consultancy on e-infrastructure, data curation and preservation and others. Formerly Arts & Humanities Data Service: Executive Management and preservation of research outputs from UK researchers in arts and humanities

4 Context: Existing approach Formal, but manual ingest procedures ‘Bespoke’ repository for data management Not scaleable – code could not easily be reapplied to other projects. Functional limitations Preservation, provenance metadata Limited delivery systems Collection-level identifiers (mostly) Diverse, semi-structured data

5 Requirements Persistent identifiers down to the level of individual datastreams, accommodating compound content models Versioning of content and metadata Automated processing and user input Able to integrate specialised third-party tools (e.g. format conversion) Preservation metadata management Audit trail/provenance metadata Standard distribution methods for specific content types (Disseminators)

6 What do we use Fedora for? Digital repository King’s Research Archive – An institutional repository for open access research papers written by King’s College London staff Virtual Research Environment (VRE) – supporting research management EIDER Project – Demonstrator for enhanced deposit and ingest Preservation Services: SOAPI – an architecture for (partially) automating preservation and ingest workflows in digital repositories SHERPA DP2 – developing preservation services for content located in disparate locations. Digitisation projects: Historical Hansard - Digitisation project scanning and markup of 50 years of debates from the Upper Chamber of the Northern Ireland Parliament from 1921 to 1972 East London Theatre Archive - Digitisation of 15,000 performing arts resources, from playbills and programmes to press cuttings and photographs from East London theatres

7 Capture & Ingest workflow Activities performed during Ingest

8 Metadata (1): Descriptive Each project has specific descriptive MD requirements: Scholarly Works Application Profile (SWAP) – created schema for IR Metadata Object Description Schema (MODS) – ELTA and SHERPA DP2 MarcXML – SHERPA DP2 Simple DC (various)

9 Metadata (2): SWAP

10 Metadata (3): Preservation Preservation: PREMIS Object PREMIS Event (forthcoming) Generated by DROID, JHOVE & others Rights: Rights MD Provided by Sherpa-Romeo

11 Metadata (4): Preservation Rights metadata provided by Sherpa Romeo Technical metadata provided by JHOVE

12 Data Capture (1): King’s research data Collection of King’s research data: Web interface for deposit Deposit via SWORD from desktop/web client Capture of metadata from Research Gateway, Web of Science and other sources.

13 Data Capture (2): Archiving services SHERPA DP2 provides archiving and preservation services for varied software repositories and web resources Content providers supported: Repositories: Fedora, CDS Invenio, DSpace, EPrints, DigiTool Website: Large dynamic sites (through Subversion), static sites. Capture methods OAI-PMH for metadata capture Data capture over HTTP/FTP and VPN.

14 Digitisation (1): East London Theatre Archive 15,000 digital objects – playbills, programmes, press cuttings and photographs. Object model representing 2 layers: Performance venue Item (3 manifestations of each image (high-quality, distribution, thumbnail) Each will contain MODS metadata Accessible through browse, search & Google maps-style UI

15 Digitisation (2): Historical Hansard 50 years of debates from the Upper Chamber of the Northern Ireland Parliament from 1921 to Separated into collection and volume. 45,100 items containing: Page images (3 manifestations of each image (high-quality, distribution, thumbnail) OCR’d text stored as XML Relationship MD UI: Experiment with Fez, Muradora, Vital, Existing Stormont

16 Lessons we have learnt… Understand your needs No one-size-fits-all approach Match requirements to functionality, not visa versa Implementation of a Fedora repository requires time No out-of-box solution, though likely to change in the near future Consider a long-term development plan. Some customisation may be required Consider future expansion plans Where do you want to be tomorrow? Don’t be intimidated Lots of features, but don’t need to use them all Possible to break implementation into well-defined stages Avoid reinventing the wheel Examine existing Fedora projects that may save development time. Develop code that can be repurposed to other project

17 Thank you! Gareth Knight Centre for e-Research