Presentation is loading. Please wait.

Presentation is loading. Please wait.

Richard Jones, Systems Developer Technical Issues for Repository Software Theses Alive! Edinburgh University Library SHERPA Nottingham.

Similar presentations


Presentation on theme: "Richard Jones, Systems Developer Technical Issues for Repository Software Theses Alive! Edinburgh University Library SHERPA Nottingham."— Presentation transcript:

1 Richard Jones, Systems Developer Technical Issues for Repository Software Theses Alive! Edinburgh University Library SHERPA Nottingham University Funded by JISC (Joint Information Systems Committee)

2 Richard Jones, Systems Developer Technical Issues for Repository Software My Role Within These Projects Evaluate, adapt and develop an open source package for use across the UK Produce an OAI-compliant E-Thesis repository Develop a pilot national service with the aim of supporting E-Theses creation and management for UK universities

3 Richard Jones, Systems Developer Technical Issues for Repository Software This Presentation What is an Institutional Repository? Common Popular Open-Source Packages Generic Software Issues Specific Repository Software Issues Final Remarks

4 Richard Jones, Systems Developer Technical Issues for Repository Software What is an Institutional Repository? A set of services that a university offers to the members of its community for the management and dissemination of digital materials created by the institution and its community members. Clifford Lynch Executive Director, Coalition for Networked Information (CNI)

5 Richard Jones, Systems Developer Technical Issues for Repository Software Common Popular Open-Source Packages DSpace (http://www.dspace.org/) MIT, HP, DSpace Federation EPrints.org (http://www.eprints.org/) University of Southampton Fedora (http://www.fedora.info/) University of Virginia, Cornell Univeristy ETD-db (http://scholar.lib.vt.edu/ETD-db/) Virginia Tech Endorsed by NDLTD for E-Theses

6 Richard Jones, Systems Developer Technical Issues for Repository Software Generic Software Issues (1) Support & Development Support from authors Documentation essential, mailing lists etc. Continued development Bug fixes, feature requests, minimal local development

7 Richard Jones, Systems Developer Technical Issues for Repository Software Generic Software Issues (2) System Architecture Modular architecture Easy to upgrade, develop and customise Appropriate programming languages Stable and appropriate database system Easy to integrate into current web services Templates and styles, using language standards (e.g. HTML/CSS, XML/XSLT)

8 Richard Jones, Systems Developer Technical Issues for Repository Software Generic Software Issues (3) System Security Authentication methods Authorisation methods Authenticate-able content Secure supporting systems Well-known, open security systems and coherent standard architectures

9 Richard Jones, Systems Developer Technical Issues for Repository Software Generic Software Issues (4) System Administration Coherent user administration Different types of user and user groups Granular, distributable administration Delegate areas of the system to different administrators Access policies

10 Richard Jones, Systems Developer Technical Issues for Repository Software Generic Software Issues (5) Additional Functionality Public API (Application Programming Interface) Providing additional services from the same code base Coherent internal data structuring

11 Richard Jones, Systems Developer Technical Issues for Repository Software Specific Repository Software Issues (1) System Architecture Web services protocols for data retrieval OAI-PMH, Z39.50, SRW/U, SOAP, OpenURL Appropriate database system PostgreSQL (open-source), Oracle (proprietary)

12 Richard Jones, Systems Developer Technical Issues for Repository Software Specific Repository Software Issues (2) System Security Authentication methods Most importantly: the one you use at your institution, with the option to insert your own Authorisation methods Integrate-able into current institutional information systems such as staff, student or course lists Authenticate-able content Provenance metadata, paper-trails, data checksums (e.g. MD5)

13 Richard Jones, Systems Developer Technical Issues for Repository Software Specific Repository Software Issues (3) System Administration Coherent user administration Granular administration system Possible administrator types: Collection admin, User admin, User Group admin, Structure admin, Database Content admin, System Administrator Licensing System Related to access policies, with separate submitter, institution and user licences, ideally with a time- dependent facility Access Policies Possible requirements: domain restrictions, time- dependent restrictions, partial restrictions

14 Richard Jones, Systems Developer Technical Issues for Repository Software Specific Repository Software Issues (4) Record Handling (1) Metadata Capture What metadata do you need? Flexible, appropriate schema (e.g. Qualified DC, ETD-MS (E-Theses), MARC21) Customisable Submission System Collects relevant metadata, and can be modified conditionally on the fly Ingest Methods Standard submission, batch import, harvesting (e.g. OAI-PMH (metadata only)), customised insert using native API

15 Richard Jones, Systems Developer Technical Issues for Repository Software Specific Repository Software Issues (5) Record Handling (2) Extract Methods Native viewing system, batch export, metadata cross- walk, harvest (e.g. OAI-PMH (metadata only)), customised extract using API Item Wrappers Multiple files, multiple metadata records/schemas, internal structure mapping (e.g. METS, DIDL)

16 Richard Jones, Systems Developer Technical Issues for Repository Software Specific Repository Software Issues (6) Digital Preservation (1) Persistent Identifiers Some available systems: Handle, PURL, URN, DOI, ARK Migration On Ingest (migrate submission to open format), or on request (preserve migration tool) Viewers Tools to render the format are preserved Emulation The original viewer is emulated in the new system

17 Richard Jones, Systems Developer Technical Issues for Repository Software Specific Repository Software Issues (7) Digital Preservation (2) Universal Virtual Computer (UVC) On Ingest (migrate submission to open format), or on request (preserve migration tool) Representation Information Metadata regarding the representation of the file format Global Digital Format Registry (GDFR) Typed Object Model (TOM) Wheatley, P A way forward for developments in the digital preservation functions of DSpace: options, issues and recommendations (http://dspace.org/news/readings.html)

18 Richard Jones, Systems Developer Technical Issues for Repository Software Specific Repository Software Issues (8) Additional Functionality Coherent data structuring An internal structure that can represent your institution in one or more overlaying schemas Native Browse Hierarchical browsing, filtering by structure and metadata; aids indexing by search engines Native Search Constrained search locations, using browse functionality to display results Full Text Indexing Public API (Application Programming Interface) Creating Portal-like services within the institution

19 Richard Jones, Systems Developer Technical Issues for Repository Software Final Remarks No systems yet deal with all issues Some good development work ongoing with the various packages Not all issues need to be solved: To provide an Institutional Repository For your institution The Institutional Repository is still in its infancy, and may not mature for another 10 years There are significant policy and community issues that also need to be addressed.

20 Richard Jones, Systems Developer Technical Issues for Repository Software Thanks for Listening Richard Jones JISC: This presentation:


Download ppt "Richard Jones, Systems Developer Technical Issues for Repository Software Theses Alive! Edinburgh University Library SHERPA Nottingham."

Similar presentations


Ads by Google