1 Metadata Tools for JISC Digitisation Projects of still images and text Ed Fay BOPCRIS, Hartley Library University of Southampton.

Slides:



Advertisements
Similar presentations
Richard Jones, Systems Developer Technical Issues for Repository Software Theses Alive! Edinburgh University Library SHERPA Nottingham.
Advertisements

IRRA DSpace April 2006 Claire Knowles University of Edinburgh.
Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew.
METS Awareness Training An Introduction to METS Digital libraries – where are we now? Digitisation technology now well established and well-understood.
UKOLN is supported by: Put functionality Augmenting interoperability across scholarly repositories 20/21 April 2006 Rachel Heery, UKOLN, University of.
Configuration management
Configuration management
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
Vital Implementation Update Vital Implementation Update 11 th January 2006 Paul Bevan – Glen Robson –
Standards showcase: MODS, METS, MARCXML ALA Annual 2006 Rebecca Guenther and Jackie Radebaugh Network Development and MARC Standards Office Library of.
Information Professionals and Learning Object Repositories … more than just metadata quality … Sarah Currier Stòr Cùram Project Librarian JISC X4L Repository.
Robert Sharpe, Operations Director METS in heterogeneous digital repositories.
Services Digitisation & Content Management. 600 People – India.
DRS 2 Metadata Migration June 25, Agenda Introduction Preliminary results - content analysis Metadata options Next steps Questions.
Interoperability and Preservation with the Hub and Spoke (HandS) Matt Cordial, Tom Habing, Bill Ingram, Robert Manaster University of Illinois Urbana-Champaign.
Interoperability and Preservation with the Hub and Spoke (HandS) Tom Habing, Bill Ingram, Robert Manaster University of Illinois Urbana-Champaign
DIGITIZATION OF LOCAL HISTORY COLLECTIONS IN PUBLIC LIBRARY “VLADISLAV PETKOVIC DIS” IN CHACHAK: DIGITIZATION OF THE NEWSPAPER “THE VOICE OF CHACHAK” Bogdan.
Joachim Bauer Senior System Engineer, CCS
1 Uppsala University Library Eva Müller Peter Hansson Stefan Andersson Uwe Klosa Electronic Publishing Centre Krister Östlund Waller project.
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
R.Jantz, August 31, Two-day forum on PREMIS Preservation Metadata and the Trusted Digital Repositories August 31, September 1 National Library of.
Merrilee Proffitt e(X)literature / Digital Cultures Project April 2003 News from the Digital Library The Metadata Encoding and Transmission Standard; the.
PREMIS What is PREMIS? o Preservation Metadata Implementation Strategies When is PREMIS use? o PREMIS is used for “repository design, evaluation, and archived.
AIP Archival Information Package – Defines how digital objects and its associated metadata are packaged using XML based files. METS (binding file) MODS.
WMS: Democratizing Data
THE RUTGERS WORKFLOW MANAGEMENT SYSTEM Mary Beth Weber Cataloging and Metadata Services Rutgers University Libraries August 3, 2007.
Archival Prototypes and Lessons Learned Mike Smorul UMIACS.
Incompatible or Interoperable? A METS bridge for a small gap between two digital preservation software packages Lucas Mak Metadata & CatalogLibrarian
The British Library’s METS Experience The Cost of METS Carl Wilson
OCLC Online Computer Library Center OCLC’s Digital Archive – Disseminating with METS Jay Goodkin Software Engineer Digital Collection and Preservation.
Port Townsend Leader Historical Newspaper Archive Keith Darrock.
Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
1 The Universal Object Format - A METS Profile for an archiving and exchange format for digital objects.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
The DigiTool to FDA Program Lydia Motyka Florida Center for Library Automation.
Mass digitisation? Astrid Verheusen Projectmanager Research & Development Division National library of the Netherlands LIBER-EBLIDA Workshop on Digitisation.
Metadata Considerations Implementing Administrative and Descriptive Metadata for your digital images 1.
An Introduction to METS Morgan Cundiff Network Development and MARC Standards Office Library of Congress Metadata Encoding and Transmission Standard.
Image Workflow Processes Elspeth Haston, Robert Cubey, Martin Pullan & David J Harris.
Preservation Audio Using METS: The Sound Directions Project Robin Wendler Harvard University Library 7 May 2007.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
The FCLA Digital Archive Joint Meeting of CSUL Committees, 2005.
International Seminary on Digitisation: Experience and Technology 11 th May 2004 | National Library | Lisbon – Portugal DIGITAL ARCHIVE OF PORTUGUESE ART.
Selene Dalecky March 20, 2007 FDsys: GPO’s Digital Content System.
A Multi-Tiered Architecture for Distributed Data Collection and Centralized Data Delivery Stacy Kowalczyk and James Halliday April 28, 2008.
National Library of Finland Metadata in the Digitisation Process Cultural unity and diversity of the Baltic Sea Region – common history, different languages,
Feb 21-25, 2005ICM 2005 Mumbai1 Converting Existing Corpus to an OAI Compliant Repository J. Tang, K. Maly, and M. Zubair Department of Computer Science.
DAITSS and the Florida Digital Archive Priscilla Caplan Florida Center for Library Automation iPRES 2006.
NLW. Object Classes Class 1  1 MARC Record  1 Image  No METS Class 2  1 MARC Record  Many images  No METS Class 3  1 MARC Record  Many.
Santi Thompson - Metadata Coordinator Annie Wu - Head, Metadata and Bibliographic Services 2013 TCDL Conference Austin, TX.
Lifecycle Metadata for Digital Objects The Final Curtain December 4, 2006.
Repository-specific Spoke Scripts Content Repository JSR-170/283 Content Repository for Java Technology API Normalized H&S METS Files METS Import/ExportMETS.
Automation Living in a Paper Oriented World and The Steps to Automation.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
Meeting of the Member States Expert Group on Digitisation and Digital Preservation , Luxembourg European Archival Records and Knowledge Preservation.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Joint Meeting of CSUL Committees,
Jim Tuttle North Carolina State University Libraries
DAITSS and the Florida Digital Archive
Joseph JaJa, Mike Smorul, and Sangchul Song
Integrating PREMIS and METS
Library Technology Conference: Building Exhibits
Storage Basic recommendations:
DIGITAL ARCHIVES Into the Light
Office Edition Overview (Dec. 2018).
Presentation transcript:

1 Metadata Tools for JISC Digitisation Projects of still images and text Ed Fay BOPCRIS, Hartley Library University of Southampton

2 Overview: BOPCRIS today Move to work natively with standards Interoperability Preservation Design project procedures from ground up with metadata in mind File-naming and directory structuring Metadata capture processes Production workflow that automates where possible Minimize possibility for human error / subjectivity Final package of digital object that records preservation information on the digital shelf and aims for maximum interoperability between systems, all in one place

3 Overview: technical details File-naming / directory structure Incorporating project-specific unique ids Final package (digital object) Internally consistent tarball [*.TAR] Relative path-naming conventions METS wrapper Extension formats for metadata: descriptive (MODS); technical (MIX); process (PREMIS) Production workflow Automated production of final package Metadata recording Dynamic input by scanner operators

4 History Eighteenth Century Parliamentary Papers Project under Phase 1 of JISC Digitization Programme Proprietary system and data formats (Agora) Manual input of metadata o Descriptive and Structural Advantages and Disadvantages

5 History: Advantages Proprietary system with advanced functionality: OCR workflow Web presentation Highly customizable Metadata fields specified and modified at will

6 History: Disadvantages Non-standard metadata fields No mapping to standard formats difficulties: interoperability; metadata harvesting Translation Between systems, or between use and archive formats introduces possibility of versioning issues No scope for preservation metadata Separation between workflow / presentation system and preservation strategy Resulted in disparate collection of scripts and tools to manage data

7 Present: Metadata Standards Bibliographic database export File-system level Directory structure File-naming conventions Scanning level TIFF headers Additional descriptive metadata METS profile Tailored to project needs Extension formats (MODS, MIX, PREMIS) Checksums (MD5)

8 Present: Metadata Origins Scanned Images TIFF headers METS OCR (Agora / ABBYY) MIX (Z39.87) File-naming Directory structure (TAR) Other metadata Process Additional descriptive PREMIS Bibliographic Metadata MARC21 / MODS / etc. File formats TIFF master / Derived JPEG Flat text (TXT) & Word-co-ordinated OCR Custom dmdSec PRECURSORS GENERATED

9 Future One tool for entire process, from scanned images to METS Tool would: Extract technical metadata Include descriptive metadata Build flat-structure METS Tool would require: File-naming, directory-structuring conventions Image file sources

10 Future: Advantages Abstraction = standardization All digitization projects will produce metadata in similar formats interoperability Certain technical base-standards will be present preservation Any centrally developed preservation or presentation systems would be able to ingest output from any project Saves wasted effort developing similar solutions many times, when one solution can be developed once and adapted

11 Future: Questions… Usefulness of such a tool? Relevance to your project? Problems / obstacles? How much flexibility is necessary? Manual input / editing? Main points: Abstraction, functionality, flexibility

12 Further information Ed Fay, Software Developer BOPCRIS, Hartley Library University of Southampton