BitstreamFormat Renovation: DSpace Gets Real Technical Metadata.

Slides:



Advertisements
Similar presentations
Adding OAI-ORE Support to Repository Platforms Alexey Maslov, Adam Mikeal, Scott Phillips, John Leggett, Mark McFarland Texas Digital Library TCDL09.
Advertisements

OCLC Digital Archive Overview Judith Cobb LIPA Meeting July 2006.
The future’s so bright…. DAITSS DIGITAL PRESERVATION SYSTEM: RE-ARCHITECTED, RE- WRITTEN, AND OPEN SOURCE Priscilla Caplan Florida Center for Library Automation.
LIFECYCLE METADATA FOR DIGITAL OBJECTS Danielle Cunniff Plumer School of Information The University of Texas at Austin Summer 2014.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
The Knowledge Bank Project at the Ohio State University Presented at the American Accounting Association Meeting – Chicago 8/6/07 Charles J. Popovich Head.
Interoperability and Preservation with the Hub and Spoke (HandS) Matt Cordial, Tom Habing, Bill Ingram, Robert Manaster University of Illinois Urbana-Champaign.
Interoperability and Preservation with the Hub and Spoke (HandS) Tom Habing, Bill Ingram, Robert Manaster University of Illinois Urbana-Champaign
CNRIS CNRIS 2.0 Challenges for a new generation of Research Information Systems.
Funded by: © AHDS Sherpa DP – a Technical Architecture for a Disaggregated Preservation Service Mark Hedges Arts and Humanities Data Service King’s College.
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
Common Use Cases for Preservation Metadata Deborah Woodyard-Robinson Digital Preservation Consultant Long-term Repositories:
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
MIT’s DSpace A good fit for ETDs Margret Branschofsky Keith Glavash MIT LIBRARIES.
InterPARES Project Joanne Evans, School of Information Management and Systems, Monash University Description Cross-domain Description Cross Domain - Metadata.
METS What is METS ? What is METS ? A schema that provides a flexible mechanism for encoding descriptive, administrative, and structural metadata for a.
1 Using Scalable and Secure Web Technologies to Design Global Format Registry Muluwork Geremew, Sangchul Song and Joseph JaJa Institute for Advanced Computer.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
PREMIS in the Real World: some reflections on constraints Jan Lavelle Senior Librarian (Systems Development) State Library of Tasmania.
1 The Australian Partnership for Sustainable Repositories Margaret Henty Digital Futures Industry Briefing November 8, 2006.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
Improving access to digital resources: a mandate for order mandate: managing digital assets in tertiary education craig green,
Future of MDR - ISO/IEC Metadata Registries (MDR) Larry Fitzwater, SC 32 WG 2 Convener Computer Scientist U.S. Environmental Protection Agency May.
ETD Repositories Using DSpace Software Andrew Penman The Robert Gordon University 27 th September 2004.
Networking Session: Global Information Structures for Science & Cultural Heritage - The Interoperability Challenge «INTEROPERABILITY FROM THE CULTURAL.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
Catherine Masi, National Geospatial Digital Archive May 16, 2005 NGDA Format Registry  Why do we need a FR? We are designing with long-term storage in.
International Council on Archives Section on University and Research Institution Archives Michigan State University September 7, 2005 Preserving Electronic.
Why We Create Metadata and How it is Useful Bruce Godfrey University of Idaho Library INSIDE Idaho
Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager
A Framework for Relationship Discovery Among Files of Different Types Michal Ondrejcek, Jason Kastner and Peter Bajcsy National Center for Supercomputing.
How to build your own Dark Archive (in your spare time) Priscilla Caplan FCLA.
An Online Knowledge Base for Sustainable Military Facilities & Infrastructure Dr. Annie R. Pearce, Branch Head Sustainable Facilities & Infrastructure.
Themes Architecture Content Metadata Interoperability Standards Knowledge Organisation Systems Use and Users Legal and Economic Issues The Future.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
Topic Rathachai Chawuthai Information Management CSIM / AIT Review Draft/Issued document 0.1.
File format registries - a global infrastructure for local persistence Andreas Aschenbrenner, ERPANET.
EPrints 10 Years of Digital Preservation. What is EPrints For?  EPrints offers a safe, open and useful place to store, share and manage material in the.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
Use & Access 26 March Use “Proof of Concept” Model for General Libraries & IS faculty Model for General Libraries & IS faculty Test bed for DSpace.
Habing1 Integrating PREMIS and METS PREMIS Tutorial Implementers’ Panel June 21, 2007, 9:00-5:30 Library of Congress, Jefferson Building, Whittall.
The State of PREMIS Brian Lavoie Research Scientist OCLC PREMIS Implementation Fair San Francisco, CA October 7, 2009.
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Interoperability and Collection of Preservation Metadata for Digital Repository Content Matt Cordial, Tom Habing, Bill Ingram, Robert Manaster University.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
PREMIS Data Dictionary and the Future of Preservation Metadata Brian Lavoie Research Scientist OCLC Research Society of American Archivists.
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
Challenges in the Nursery: Linking a Finding Aid with Online Content Elizabeth Johnson, Lilly Library Jenn Riley, Digital Library Program DL Brown Bag,
1 Introduction to Metadata: The Role of the Metadata Editor Institutional Repository Workshop 1-3 April 2009 Marguerite Nel Metadata editor
Santi Thompson - Metadata Coordinator Annie Wu - Head, Metadata and Bibliographic Services 2013 TCDL Conference Austin, TX.
Course on persistent identifiers, Madrid (Spain) Information architecture and the benefits of persistent identifiers Greg Riccardi Director Institute for.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
Transparent Format Migration of Preserved Web Content D. S. H. Rosenthal, T. Lipkis, T. S. Robertson, S. Morabito Lib Magazine, 11(1), 2005
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
13 July 2005 Archives Hub day conference The Paradigm Project: The University of Oxford & The University of Manchester
CS 501: Software Engineering Fall 1999
Integrating PREMIS and METS
The Re3gistry software and the INSPIRE Registry
Digital Project Lifecycle Curating Across the Curriculum
Andrea Goethals, Harvard Library
DIGITAL ARCHIVES Into the Light
Implementing an Institutional Repository: Part II
Medusa at the University of Illinois
Nancy Y. McGovern Digital Preservation Officer, ICPSR IASSIST 2007
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
EPrints Preservation.
Presentation transcript:

BitstreamFormat Renovation: DSpace Gets Real Technical Metadata

Benefits (Why Should I Care?)‏ The new format identifier corrected or fixed unidentified data formats of 858 Bitstreams in How many are mis-identified in your repository? Accurate MIME-types improve delivery to Web clients Quality preservation requires accurate data format knowledge Interoperability with internal and external tools relies on correct technical metadata in commonly-recognized standards Without automated tools, maintenance of format technical metadata is a tedious manual job for repository managers BitstreamFormat Renovation Prototype

Data Formats BitstreamFormat Renovation Prototype A “Data Format” is defined as: Technical Metadata that describes how abstract information is encoded and structured in a digital document. “Abstract Information” refers to the actual intellectual content contained in the digital object.

Problems with Current Format Technical Metadata Formats are identified with arbitrary names; that hinders interoperability No means to collect additional format technical metadata, e.g. format specification documents. Identifying formats only by filename extension is imprecise and unreliable Current internal format model is inflexible BitstreamFormat Renovation Prototype

Terms Format Registry –PRONOM –GDFR Identification Plugins –DROID –JHOVE Interoperable Format Identifiers –MIME Type –PUID (PRONOM Unique IDentifier)‏ BitstreamFormat Renovation Prototype

Object-Model Architecture Connected to PRONOM BitstreamFormat Renovation Prototype

Identification of Two BitstreamFormat Types

BitstreamFormat Renovation Prototype The Local “DSpace” and Provisional Registries

Interface to External Registries Get Synonyms –Returns a list of identifiers that are bound to the same format record Import –Turns an external format description into a new BitstreamFormat entry, initializing its metadata fields from the external registry Update –Refresh the metadata fields of a BitstreamFormat to keep up with changes ConformsTo –Tests whether the format described by one identifier “conforms to” or is a sub-type of another format BitstreamFormat Renovation Prototype

The Bitstream Format Metadata Admin Panel BitstreamFormat Renovation Prototype

Importing New Bitstream Formats BitstreamFormat Renovation Prototype

Editing BitstreamFormat Metadata BitstreamFormat Renovation Prototype

Digital Preservation Strategies Pluggable architecture allows for access to external identification and technical metadata tools Access and preservation rely on accurate format identification Migration / Obsolescence tools are only effective with correct and precise identification, because format versions matter The creation of derivatives (i.e. thumbnails or delivery versions) via MediaFilter will also rely on accurate identification BitstreamFormat Renovation Prototype

Interoperability Benefits Avoids platform lock-in Reliable delivery functionality Consistent object description semantics (ORE)‏ Interoperability with digital preservation services BitstreamFormat Renovation Prototype

Quantitative Results Before: –1,020 Unidentified (0.65%)‏ After: –162 Unidentified (0.104%)‏ (155,000 Bitstreams)‏ BitstreamFormat Renovation Prototype

Related Links          BitstreamFormat Renovation Prototype