Presentation is loading. Please wait.

Presentation is loading. Please wait.

Implementation of PREMIS in METS Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.

Similar presentations


Presentation on theme: "Implementation of PREMIS in METS Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San."— Presentation transcript:

1 Implementation of PREMIS in METS Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress rgue@loc.gov PREMIS Implementation Fair San Francisco, CA October 7, 2009

2  METS records the (possibly hierarchical) structure of digital objects, the names and locations of the files that comprise those objects, and the associated metadata  A METS document may be a unit of storage (e.g. OAIS AIP) or a transmission format (e.g. OAIS SIP or DIP)  METS is extensible and modular  METS uses the XML Schema facility for combining vocabularies from different Namespaces  The METS Editorial Board has endorsed PREMIS as an extension schema  Many institutions trying to use PREMIS within the METS context

3 Structure of a METS file

4 Archival Information Package Descriptive Information Content Information described by derived from delimited by identifies further described by Representation Information Data Object Semantics Provenance Information Reference Information Fixity Information Context Information Preservation Description Information Packaging Information Structure described by premis:event MODS MARCXML DC premis:object metsRights premis:rights File formatspremis:object textMD MIX OAIS, METS and PREMIS Legend Black Arial = OAIS Red Times New Roman = METS Primary Schema Blue Times New Roman Italics = Extension Schema

5 METS extension schemas  “wrappers” or “sockets” where elements from other schemas can be plugged in  Provides extensibility  Uses the XML Schema facility for combining vocabularies from different Namespaces  Endorsed extension schemas: Descriptive: MODS, DC, MARCXML Technical metadata: MIX (image); textMD (text) Preservation related: PREMIS

6 Why do we need guidelines for using PREMIS with METS?  Contents of each information package may vary depending on its function within a repository  Need to determine how to include representation metadata and associate it with package components  PREMIS data entities (objects, events, rights, agents) do not map perfectly to METS categories for representation metadata (techMD, digiProvMD, rightsMD, sourceMD)  There are redundant elements between the two standards  Both have extensibility mechanisms  Flexibility of both standards requires implementation choices

7 Development of Guidelines for Using PREMIS with METS for Exchange  PREMIS in METS Guidelines Working Group Consists of PREMIS and METS experts Focuses on the METS document as a mechanism of exchange of digital objects and their metadata (SIP or DIP) Facilitates communication when internal requirements and technical environments vary  Tension between flexibility and being prescriptive to facilitate interoperability Consider usage scenarios If a SIP it may get unwrapped and stored in different structures If a DIP it is converted from internal structures to PREMIS A more liberal approach is possible for a SIP than a DIP  Establishing guidelines, a METS profile, and examples http://www.loc.gov/standards/premis/guidelines-premismets.pdf

8 Implementation issues in using PREMIS with METS  Location of PREMIS metadata within METS documents  Whether to record elements redundantly if they occur in both PREMIS and METS  Relationship of different structural metadata mechanisms in PREMIS and METS  How to record PREMIS Agent entities in METS documents  Use of identifiers to link elements in PREMIS and METS  How to record elements that are also part of a format specific technical metadata schema (e.g. MIX)

9 Some recommendations from Guidelines  METS sections Use Object in techMD or digiProvMD Use Event in digiProvMD Use Rights in rightsMD Use Agent in digiProvMD or rightsMD  PREMIS Container -- use only if keeping all PREMIS metadata together. Do not use if separating PREMIS metadata into different amdSec subelements  PREMIS and METS redundancies -- Choosing which options to use is an implementation decision, document in profile e.g. METS element attributes and subelements of in PREMIS

10 Recommendations (cont.)  Structural relationship elements -- use the METS structMap to record structural relationships, use PREMIS relationship elements to record preservation and derivation relationships and structural if desired  ID/IDREF and PREMIS identifier elements -- use METS ID/IDREF mechanisms, best practices for using these ID/IDREF mechanisms apply  Use PREMIS extensibility mechanism for format specific technical metadata  Document decisions in METS profiles

11 SHA-1 4638bc65c5b9715557d09ad373eefd147382ecbf EchoDep/messageDigestOriginator> 184302 Elements defined in both METS and PREMIS: METS: Checksum, Checksumtype attribute of not repeatable  PREMIS: fixity also includes messageDigestOriginator allows multiples

12 <file ID="FID1" ADMID="TMD1PREMIS DP1EVENT DP1AGENT“ MIMETYPE="image/jpeg" <techMD ID="TMD1PREMIS“ image/jpeg 1.02 Elements defined both in METS and PREMIS: METS: MIMETYPE attribute of optional  PREMIS: more granular; includes name and version (although name may be MIMETYPE) mandatory

13 ECHODEP Hub Event echo12345 ECHODEP Hub Event echo12345 ingestion 2006-05-02T15:12:53 Elements defined both in METS and PREMIS  METS ID/Idref: used to associate metadata in different sections and for different files  PREMIS identifiers: explicit linking between entity types

14 structural is sibling of UCB FID2 1 Elements defined both in METS and PREMIS:  METS: structMap details structural relationships and is the heart of the METS document hierarchical, so may be more expressive than PREMIS semantic units links the elements of the structure to content files and metadata  PREMIS: details all kinds of relationships, including structural data dictionary says that implementations may record by other means

15 Some METS profiles with PREMIS  UCSD simple and complex object  UC Berkeley  ECHO Dep Generic METS Profile for Preservation and Digital Repository Interoperability  LC Profile for Recorded Events  Australian METS Profile  TIPR  … many others

16 Additional changes to Guidelines  Make extensibility mechanism consistent with METS significantPropertiesExtension objectCharacteristicsExtension creatingApplicationExtension environmentExtension signatureInformationExtension eventOutcomeDetailExtension rightsExtension

17 Additional changes to Guidelines (cont.)  Add the same elements and attributes as in METS to PREMIS extension elements in schema and data dictionary mdRef, mdWrap binData, xmlData Attributes: ID, LABEL, MDTYPE, MIMETYPE, SIZE, CREATED, CHECKSUM, CHECKSUMTYPE  Allow URI or string for MDTYPE  Add use cases/examples to illustrate choices made  Clarify structural relationships

18 Implementing an Exchange Standard  PREMIS Implementation Tool Some tools documented on the PREMIS website http://www.loc.gov/standards/premis/tools_for_premis. php http://www.loc.gov/standards/premis/tools_for_premis. php PiM tool developed by Florida Center for Library Automation PiM Further work to generate metadata from digital files in PREMIS elements


Download ppt "Implementation of PREMIS in METS Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San."

Similar presentations


Ads by Google