PREMIS Update Rebecca Guenther Library of Congress PREMIS Implementation Fair Vienna, Austria 22 September 2010.

Slides:



Advertisements
Similar presentations
Applying preservation metadata to repositories For JISC KeepIt course on Digital Preservation Tools for Repository Managers Module 3, Primer on preservation.
Advertisements

The PREMIS Data Dictionary Michael Day Digital Curation Centre UKOLN, University of Bath JORUM, JISC and DCC.
United Nations Statistics Division
Implementing PREMIS in Container Formats Rebecca Guenther, Library of Congress Zhiwu Xie, Los Alamos National Laboratory IS&T’s.
TIPR: Repository Exchange Package Use Cases and Best Practices Joseph Pawletko and Priscilla Caplan IS&T Archiving 2011.
PREMIS Conformance Brian Lavoie Research Scientist OCLC PREMIS Implementation Fair San Francisco, CA October 7, 2009.
PREMIS Conformance. Agenda 1.NLNZ and NLB conformance exercise 2.History of PREMIS Conformance 3.Current status 4.Mapping to functionality.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
Understanding and Implementing the PREMIS Data Dictionary for Preservation Metadata Rebecca Guenther, Library of Congress Digital Preservation Partners’
PREMIS in Thought: Data Center for LC Digital Holdings Ardys Kozbial, Arwen Hutt, David Minor February 11, 2008.
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
3. Technical and administrative metadata standards Metadata Standards and Applications.
InterPARES Project Joanne Evans, School of Information Management and Systems, Monash University Description Cross-domain Description Cross Domain - Metadata.
Merrilee Proffitt e(X)literature / Digital Cultures Project April 2003 News from the Digital Library The Metadata Encoding and Transmission Standard; the.
US GPO AIP Independence Test CS 496A – Senior Design Fall 2010 Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong.
Automatic Evaluation of Migration Quality in Distributed Networks of Converters Miguel Ferreira Supervisors Ana Alice Baptista.
PREMIS What is PREMIS? o Preservation Metadata Implementation Strategies When is PREMIS use? o PREMIS is used for “repository design, evaluation, and archived.
AIP Archival Information Package – Defines how digital objects and its associated metadata are packaged using XML based files. METS (binding file) MODS.
Descriptive Metadata o When will mods.xml be used by METS (aip.xml) ?  METS will use the mods.xml to encode descriptive metadata. Information that describes,
A Registry for controlled vocabularies at the Library of Congress
Metadata : Setting the Scene or a Basic Introduction Wendy Duff University of Toronto, Faculty of Information Studies.
US GPO AIP Independence Test CS 496A – Senior Design Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong Faculty advisor: Dr. Russ.
Rebecca Guenther Library of Congress
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
Profiling Metadata Specifications David Massart, EUN Budapest, Hungary – Nov. 2, 2009.
How to build your own Dark Archive (in your spare time) Priscilla Caplan FCLA.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Topic Rathachai Chawuthai Information Management CSIM / AIT Review Draft/Issued document 0.1.
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan, Florida Center for Library Automation DCC Workshop on Long-term Curation within Digital Repositories.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
Moving from a locally-developed data model to a standard conceptual model Jenn Riley Metadata Librarian Indiana University Digital Library Program.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
The FCLA Digital Archive Joint Meeting of CSUL Committees, 2005.
Implementation of PREMIS in METS Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
PREMIS Controlled vocabularies Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
Habing1 Integrating PREMIS and METS PREMIS Tutorial Implementers’ Panel June 21, 2007, 9:00-5:30 Library of Congress, Jefferson Building, Whittall.
OCLC Online Computer Library Center Preservation Metadata Standards PREMIS & METS Taylor Surface, OCLC.
Conceptual Data Modelling for Digital Preservation Planets and PREMIS Angela Dappert.
The State of PREMIS Brian Lavoie Research Scientist OCLC PREMIS Implementation Fair San Francisco, CA October 7, 2009.
METS Application Profiles Morgan Cundiff Network Development and MARC Standards Office Library of Congress.
PREMIS at the British Library Markus Enders, The British Library PREMIS Implementation Fair, San Fransisco, CA 07 October 2009.
RECORDKEEPING METADATA STANDARDS: THE INTERNATIONAL CONTEXT Barbara Reed, Director, Recordkeeping Innovation.
PREMIS Data Dictionary and the Future of Preservation Metadata Brian Lavoie Research Scientist OCLC Research Society of American Archivists.
AGENTS, RIGHTS, EVENTS. Agents  The Agent entity aggregates information about agents (persons, organizations, or software) associated with rights management.
Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra.
DAITSS and the Florida Digital Archive Priscilla Caplan Florida Center for Library Automation iPRES 2006.
PREMIS Controlled vocabularies Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair Vienna,
Florida Digital Archive PREMIS and DAITSS. Florida Digital Archive.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
An Introduction to PREMIS Jenn Riley Metadata Librarian IU Digital Library Program.
RSC Strategy and RDA Internationalization Gordon Dunsire, Chair, RDA Steering Committee Presented at Selmathon 2, 10 May 2016, Stockholm, Sweden.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
1 The XMSF Profile Overlay to the FEDEP Dr. Katherine L. Morse, SAIC Mr. Robert Lutz, JHU APL
Joint Meeting of CSUL Committees,
RSC Strategy Gordon Dunsire, Chair, RDA Steering Committee
DAITSS: Dark Archive in the Sunshine State
DAITSS and the Florida Digital Archive
RECORDKEEPING METADATA STANDARDS: THE INTERNATIONAL CONTEXT
Module 6: Preparing for RDA ...
Integrating PREMIS and METS
PREMIS Tools and Services
Accommodating local cataloguing traditions in a global context
Metadata in Digital Preservation: Setting the Scene
A Tale of Two Archives: Notes from the Dark Side
The new RDA: resource description in libraries and beyond
Presentation transcript:

PREMIS Update Rebecca Guenther Library of Congress PREMIS Implementation Fair Vienna, Austria 22 September 2010

Overview  Editorial Committee membership  What's new since the last PREMIS Implementation Fair (iPRES 2009)  PREMIS Data Dictionary and schema revision process  Changes to the Data Dictionary in process Schema changes for extensibility Data Dictionary version 2.1  PREMIS conformance  Today’s agenda

PREMIS timeline PREMIS Data Dictionary released Maintenance Activity formed PREMIS Working Group formed Metadata Framework For Digital Preservation PREMIS Editorial Committee formed PREMIS 2.0 released PREMIS Implementation Fairs 2010

The State of PREMIS  de facto standard for preservation metadata; in some countries mandated for cultural heritage repositories  PREMIS implementations are appearing in many places, many contexts, many forms  Some experimentation is leading to changes in the data dictionary and schema  PREMIS Implementation fairs: attempts to consolidate implementation experiences, issues, best practices,

PREMIS Editorial Committee membership  Rebecca Guenther, Chair (Library of Congress)  Yair Brama (ExLibris)  Karin Bredenberg (Riksarkivet, Swedish National Archives)  Priscilla Caplan (Florida Center for Library Automation)  Angela Dappert (British Library)  Angela Di Iorio (Fondazione Rinascimento Digitale)  Markus Enders (British Library)  Noreen Hill (Library and Archives Canada)  Karsten Huth (Sächsisches Staatsarchiv)  David Lake (US National Archives and Records Administration)  Brian Lavoie (OCLC)  Sally Vermaaten (Statistics New Zealand)  Robert Wolfe (MIT/DSpace)  Kate Zwaard (US Government Printing Office)

PREMIS Implementation Fair at iPres 2009  State of PREMIS  Tools PREMIS in METS Toolkit Univ. of Illinois Hub and Spoke toolkit Statistics New Zealand toolkit  Systems ExLibris Rosetta DAITSS  Potential data model changes  Case studies: implementations  Discussion How to store environment information Storing auxiliary files Exchange

What’s new: PREMIS activities  Integration with other standards and efforts Survey of PREMIS in METS profiles (DLib magazine Sept 2010) Extensibility: Add elements about extensions as in METS US intelligence community extending for security classification  PREMIS Documentation Understanding PREMIS: Priscilla Caplan (2009) Gentle introduction to the PREMIS standard Spanish, German and Italian translations PREMIS Data Dictionary for Preservation Metadata version 2.0: translation in Japanese  Workflows and registries PREMIS Tools to facilitate automated workflows: PREMIS in METS toolkit made available as open source PREMIS controlled vocabularies in id.loc.gov

PREMIS Data Dictionary and Schema Revision Process  Send change request for consideration by the PREMIS Editorial Committee via Web form or on pigpen wiki  Non-substantive changes will be documented on change page on PREMIS website  Substantive changes will be brought to the PREMIS Implementers’ group  Editorial Committee will discuss within 2 months  Decisions made Changes made no more than twice a year Published as addendum to Data Dictionary and/or in revision of XML schema Community will be informed about changes with reasons made

Changes to Data Dictionary in process (version 2.1)  Correct links  Add linking semantic units from Agent Entity to Events and Rights: linkingEventIdentifier linkingRightsStatementIdentifier  Corrections of errors, clarify ambiguous areas  Make storage optional  New agent semantic units  Revision of extension element notes to indicate new attributes  New Agent semantic units: agentNote, agentExtension

Schema changes for extensibility  Add information about extension points modeled after METS Allow for wrapping or reference of PREMIS metadata Other attributes: CREATED, STATUS, ID, CHECKSUM, Location type  Include information about metadata type MDTYPE, OTHERMDTYPE, MDTYPEURI  Additional work Coordinate with METS Editorial Board Define controlled values in id.loc.gov Revise PREMIS in METS guidelines Revise notes in Data Dictionary  Draft schema ready to go out for review

Intellectual entities  Has been out of scope and only described by an identifier in PREMIS 1.0 and 2.0  Development of use cases for giving information about intellectual entities  Consideration of how to implement: as another level of object or a separate entity?

Use cases for describing intellectual entities  Represent a collection, FRBR work, FRBR expression, fonds, series, files (in the archival sense) in order to capture descriptive metadata to have business requirements associated with them or to be referenced in business requirements (such as significant characteristics, risk definitions, guidelines for preservation actions, etc.) structural and derivative relationships rIghts information events and agents  Capture versioning information and metadata update events for intellectual lEntities like articles and issues

Adding semantic units for Intellectual Entities  Will be added as another level of object  Advantages to this approach: Data dictionary will be more compact Simplify the dictionary by dropping links such as linkingIntellectualIdentifier Could directly attach to events, agents and indirectly rights to intellectual entities  Next steps Present to PREMIS Implementers’ Group for review Revise Data Dictionary and schema

PREMIS conformance  Experience in implementation, managing, and using PREMIS semantic units growing Corresponding need to cultivate deeper understanding of what it means to be “PREMIS conformant”  Need new conformance statement that is more detailed and more actionable Detailed: precise definition of what conformance means in light of emerging use cases; Actionable: of practical use as resource for assessing conformance of a given PREMIS implementation  Subgroup within PREMIS Editorial Committee formed Brian Lavoie, Rebecca Guenther, Priscilla Caplan, Angela Dappert, Sally Vermaaten, Yair Brama

Some “use cases” for PREMIS conformance  Inter-repository data exchange e.g., TIPR project  Repository certification e.g., TRAC  Shared Registries e.g., PRONOM, Unifed Digital Formats Registry  Automated workflows/reusable tools e.g., SIP/AIP processing  Vendor support e.g., ExLibris Rosetta

New PREMIS conformance statement  Establish conditions required for conformance: Articulate what implementers must do to assert PREMIS conformance  Describe “degrees of freedom” associated with conformance: Identify areas of implementation decision-making where implementers are free to make their own choices while still remaining conformant

1. Establish conditions required for conformance  Organize, amplify, and extend conformance conditions set forth in Data Dictionary v1.0 and v2.0  Define conformance from multiple perspectives: Level of semantic unit Level of Data Dictionary Internal to repository Inter-repository exchange (import and export)  Provide examples of conformance & non- conformance

Examples of conformance: semantic unit  Conformant: A repository uses a relational database system with an Objekteigenschaften table and establishes in the system documentation that Objekteigenschaften shares the definition of the PREMIS semantic unit objectCharacteristics.  Non-conformant: A repository implements a metadata element objectCategory that records information defined in PREMIS semantic units objectCategory and preservationLevel.

Examples of conformance: Data Dictionary  Conformant: A repository that is conformant in regard to Objects also wants to record information about Events; therefore, it implements metadata elements that, at the minimum, capture all of the information specified in the semantic units eventIdentifier, eventType, and eventDateTime.  Non-conformant: The information a repository records about Events does not include information that corresponds to the PREMIS semantic unit eventType

Internal and external conformance  Internal: A repository that satisfies the Principles of Use at both the semantic unit and Data Dictionary levels is considered internally conformant.  External (import): A repository that is import conformant must be able to accept PREMIS-conformant information in the form provided by another repository, parse it, and allocate the information to its corresponding metadata elements in the local repository system, as well as associate it with the appropriate Entities.  External (export): A repository that is export conformant must be able to extract PREMIS-conformant information from its local system, and provide it to another repository in an agreed-upon form, and associate it with its appropriate Entity.

2. Degrees of freedom  Naming Repository is free to implement semantic units using names different from those defined in Data Dictionary  Granularity Repository is free to distribute information defined in a semantic unit across as many metadata elements as it chooses  Level of Detail Repository is free to record more detailed information for a semantic unit than what is defined in Data Dictionary  Explicit Recording of Information Repository is not required to explicitly record information for an implemented semantic unit (but information must be recoverable in some way when needed)  Use of Controlled Vocabularies Repository is free to use (or not use) controlled vocabularies. If repository uses controlled vocabularies, it can use either internally-defined or external/standardized vocabularies

Next steps for conformance  Collect feedback on draft conformance statement from PIG List & PREMIS Implementation Fair participants  Finalize draft for approval by PREMIS Editorial Committee  Post final version on Maintenance Activity Web site

Today’s topics  Data modeling Comparison between PREMIS and PLANETS data models PREMIS OWL ontology  PREMIS in interchange Towards Interoperable Preservation Repositories (TIPR) (Priscilla Caplan, Florida Center for Library Automation) ARTAT (Angela Di Iorio, Fondazione Rinascimento Digitale)  PREMIS controlled vocabularies PREMIS vocabulary service PREMIS events in HathiTrust  Open discussion