Presentation is loading. Please wait.

Presentation is loading. Please wait.

PREMIS What is PREMIS? o Preservation Metadata Implementation Strategies When is PREMIS use? o PREMIS is used for “repository design, evaluation, and archived.

Similar presentations


Presentation on theme: "PREMIS What is PREMIS? o Preservation Metadata Implementation Strategies When is PREMIS use? o PREMIS is used for “repository design, evaluation, and archived."— Presentation transcript:

1 PREMIS What is PREMIS? o Preservation Metadata Implementation Strategies When is PREMIS use? o PREMIS is used for “repository design, evaluation, and archived information packaged among preservation repositories” How is PREMIS use? o PREMIS Data Dictionary provides guidelines regarding “the information a repository uses to support the digital preservation process” What is PREMIS? o Preservation Metadata Implementation Strategies When is PREMIS use? o PREMIS is used for “repository design, evaluation, and archived information packaged among preservation repositories” How is PREMIS use? o PREMIS Data Dictionary provides guidelines regarding “the information a repository uses to support the digital preservation process”

2 PREMIS What is “preservation metadata” referring to? o “It is information that supports and documents the digital preservation process”. Which includes information such as:  Provenance – refers to who has ownership of the digital object  Authenticity – refers to the claim of the digital object  Preservation activity – refers to the activities that have been carried out to preserve the digital object  Technical environment – refers to the tasks required to interpret and use the digital object  Rights management – refers the intellectual property rights that must be declared What is “preservation metadata” referring to? o “It is information that supports and documents the digital preservation process”. Which includes information such as:  Provenance – refers to who has ownership of the digital object  Authenticity – refers to the claim of the digital object  Preservation activity – refers to the activities that have been carried out to preserve the digital object  Technical environment – refers to the tasks required to interpret and use the digital object  Rights management – refers the intellectual property rights that must be declared

3 PREMIS Data Dictionary Conventions for each entry in the PREMIS Data Dictionary o Name of the semantic units : a descriptive name that refers to a piece of information or knowledge  Example: objectIdentifer under the o Semantic components: refers to sub-units held within a container. o Definition: refers to the meaning of the semantic unit. o Rationale: explains why the semantic unit is needed o Data constraint: indicates how the semantic unit should be encoded.  Containers: refers to a xml tag that have no value rather serve to group related semantic units  None: indicates that the semantic unit can be a value of any form  Value should be taken from a controlled vocabulary: “PREMIS Data Dictionary does not specify what this authority list of values should be, and it is assumed that different repositories will use different vocabularies”.  Extension containers: are containers that are designed to give a place for non-PREMIS metadata Conventions for each entry in the PREMIS Data Dictionary o Name of the semantic units : a descriptive name that refers to a piece of information or knowledge  Example: objectIdentifer under the o Semantic components: refers to sub-units held within a container. o Definition: refers to the meaning of the semantic unit. o Rationale: explains why the semantic unit is needed o Data constraint: indicates how the semantic unit should be encoded.  Containers: refers to a xml tag that have no value rather serve to group related semantic units  None: indicates that the semantic unit can be a value of any form  Value should be taken from a controlled vocabulary: “PREMIS Data Dictionary does not specify what this authority list of values should be, and it is assumed that different repositories will use different vocabularies”.  Extension containers: are containers that are designed to give a place for non-PREMIS metadata

4 PREMIS Data Dictionary o Object category: specifies the object to which the semantic unit applies to (a representation, file, or bitstream). o Applicability: indicates whether the semantic unit applies to the category of the object. o Example: sample values that the semantic unit may use o Repeatability: indicates that a semantic unit can take multiple values o Obligation: indicates whether the value of the semantic unit is mandatory, meaning a repository must know this information o Creation/Maintenance note: further detail regarding how the values are created and or updated o Usages notes: provides information regarding the use of the semantic unit. o Object category: specifies the object to which the semantic unit applies to (a representation, file, or bitstream). o Applicability: indicates whether the semantic unit applies to the category of the object. o Example: sample values that the semantic unit may use o Repeatability: indicates that a semantic unit can take multiple values o Obligation: indicates whether the value of the semantic unit is mandatory, meaning a repository must know this information o Creation/Maintenance note: further detail regarding how the values are created and or updated o Usages notes: provides information regarding the use of the semantic unit.

5 PREMIS Data Dictionary Mandatory Semantic Units objectIdentifier * objectCategory objectCharacteristics * format * storage * eventIdentifier eventType eventDateTime agentIdentifier * Note: * indicates semantic units that are repeatable objectIdentifier * objectCategory objectCharacteristics * format * storage * eventIdentifier eventType eventDateTime agentIdentifier * Note: * indicates semantic units that are repeatable

6 PREMIS Data Model

7 PREMIS Intellectual Entity Intellectual Entity – refer to content that can be describe as a unit (e.g. books, maps, articles)

8 PREMIS Object Entity Objects – refer to units of information in digital form. PREMIS defines different kinds of objects it can an a file, bitstream or representation o File – it is a computer file, such as a pdf, txt or JPEG o Bitstream – refer to data bits within a file that contain common properties for preservation purposes Objects – refer to units of information in digital form. PREMIS defines different kinds of objects it can an a file, bitstream or representation o File – it is a computer file, such as a pdf, txt or JPEG o Bitstream – refer to data bits within a file that contain common properties for preservation purposes

9 PREMIS Object Entity o Representation – refer to a set of files, that includes structural metadata, required to be identified, stored and maintained in order to assemble a complete rendition of an Intellectual unit.  For example, text files and images files of a magazine are required to form a representation. o Representation – refer to a set of files, that includes structural metadata, required to be identified, stored and maintained in order to assemble a complete rendition of an Intellectual unit.  For example, text files and images files of a magazine are required to form a representation.

10 PREMIS Object Entity Sample syntax The units of information that can be recorded includes: o Type of object (file, bitstream, or representation) o A unique identifier for the object under  Stores information such as the type and value. The type refers to the classification of the domain that creates the object identifier. The value of the object identifier. For example, FDsys ACP R0b002ee180b003b0 This particular segment states that this object is a representation (that is a set of files, this representation has a unique identifier) Sample syntax The units of information that can be recorded includes: o Type of object (file, bitstream, or representation) o A unique identifier for the object under  Stores information such as the type and value. The type refers to the classification of the domain that creates the object identifier. The value of the object identifier. For example, FDsys ACP R0b002ee180b003b0 This particular segment states that this object is a representation (that is a set of files, this representation has a unique identifier)

11 PREMIS Object Example Other units of information that can be recorded includes: o “Information indicating the policy on the set of preservation functions to be applied to an object” under the FDsys ACP D09002ee180b003a9 full Other units of information that can be recorded includes: o “Information indicating the policy on the set of preservation functions to be applied to an object” under the FDsys ACP D09002ee180b003a9 full

12 PREMIS Object Entity Other units of information that can be recorded includes: o Information indicating if the object is subject to one or more processes of decoding or unbundling under o information used to verify if an object has been changed in an undocumented or unauthorized way under  Information contained within the refers to the algorithm used to produce the message digest for the digital object.  Information contained within the refers to the “output of the message digest algorithm”  Information contained within the refers to the agent that generated the original message digest that will be compared to the fixity check. Other units of information that can be recorded includes: o Information indicating if the object is subject to one or more processes of decoding or unbundling under o information used to verify if an object has been changed in an undocumented or unauthorized way under  Information contained within the refers to the algorithm used to produce the message digest for the digital object.  Information contained within the refers to the “output of the message digest algorithm”  Information contained within the refers to the agent that generated the original message digest that will be compared to the fixity check.

13 PREMIS Object Entity o The size of the object under o The format of the object under  refers to the “identification of the format of the object” Information contained within classifies the format of the file or bitstream.  identifies additional information about the format by using a entry in a format registry. Information contained within identifies the format registry that was used. Information contained within refers to the “unique key used to reference an entry for this format in a format registry”  contains additional information about the format For example o The size of the object under o The format of the object under  refers to the “identification of the format of the object” Information contained within classifies the format of the file or bitstream.  identifies additional information about the format by using a entry in a format registry. Information contained within identifies the format registry that was used. Information contained within refers to the “unique key used to reference an entry for this format in a format registry”  contains additional information about the format For example

14 PREMIS Object Example 0 SHA-256 4977070b92f0bb2642c6be368ad68a8d1d1c5dbbb3310544db781f56a860b0a1 FDsys 9326 text/plain PRONOM x-fmt/111 Plain Text File 0 SHA-256 4977070b92f0bb2642c6be368ad68a8d1d1c5dbbb3310544db781f56a860b0a1 FDsys 9326 text/plain PRONOM x-fmt/111 Plain Text File

15 PREMIS Object Entity Other units of information that can be recorded includes: o The original name of the object (prior to being named by the repository) under o Information about where and how a files are stored in the repository under  stores information needed to retrieve a file from a storage system. Information contained within refers to the way of accessing the location of the content. Information contained within refers to the “location of the content used by the storage system”.  The medium on which an object is stored is contained within Other units of information that can be recorded includes: o The original name of the object (prior to being named by the repository) under o Information about where and how a files are stored in the repository under  stores information needed to retrieve a file from a storage system. Information contained within refers to the way of accessing the location of the content. Information contained within refers to the “location of the content used by the storage system”.  The medium on which an object is stored is contained within

16 PREMIS Object Entity o Information describing a relationship between an object and one or more objects.  classifies the nature of the relationship.  characterizes the nature of the relationship.  refers to “the identifier of the related resource”.  Information contained within refers to the classification of the domain that creates the identifier.  Information contained within refers to “the value of the identifier”. o Information describing a relationship between an object and one or more objects.  classifies the nature of the relationship.  characterizes the nature of the relationship.  refers to “the identifier of the related resource”.  Information contained within refers to the classification of the domain that creates the identifier.  Information contained within refers to “the value of the identifier”.

17 PREMIS Object Example S3880IS.txt URI file:/u02/app/emc/documentum/data/fdsysprod1/fdsysprod1/content_storage_0 1/00002ee1/80/55/b0/48.txt hard disk structural is part of FDsys ACP R0b002ee180b003b0 S3880IS.txt URI file:/u02/app/emc/documentum/data/fdsysprod1/fdsysprod1/content_storage_0 1/00002ee1/80/55/b0/48.txt hard disk structural is part of FDsys ACP R0b002ee180b003b0

18 PREMIS Event Entity Events – refers to actions that involve an object and an agent known to the system o Events are critical for maintaining the digital provenance of an object (helps demonstrates the authenticity of the object) Examples of Events: o modifying an document o actions that create new relationships  Object could be related to another object as a result of a particular event, for instance if a program takes file 1 and generates a different version known as file 2 o Actions that check the validity and integrity of the objects (i.e. virus scan) Events – refers to actions that involve an object and an agent known to the system o Events are critical for maintaining the digital provenance of an object (helps demonstrates the authenticity of the object) Examples of Events: o modifying an document o actions that create new relationships  Object could be related to another object as a result of a particular event, for instance if a program takes file 1 and generates a different version known as file 2 o Actions that check the validity and integrity of the objects (i.e. virus scan)

19 PREMIS Event Entity Sample syntax The information that can be recorded under event includes: o A unique identifier for the event under  The refers to the classification of the domain that creates the event identifier. The <eventIdentiferValue refers to the value of the event identifier. o The type of event under  Classifies the nature of the event. o Date, time and type of event under Sample syntax The information that can be recorded under event includes: o A unique identifier for the event under  The refers to the classification of the domain that creates the event identifier. The <eventIdentiferValue refers to the value of the event identifier. o The type of event under  Classifies the nature of the event. o Date, time and type of event under

20 PREMIS Event Entity Additional information that can be recorded under event includes: o Detail description of the event under o The outcome of the event under  Indicates if the event was a success, partial success, or failure. o Agents involved in the event and their specific roles under  The refers to the classification of the domain that creates the linking agent identifier. The refers to the “value of the linking agent identifier”. The indicates the role of the agent associated to the event.  Agents role are defined here because agents can perform different roles in different events o Objects involved in the event and their specific roles under  The refers to the classification of the domain that creates the linking object identifier. The refers to the “value of the linking object identifier”. The indicates the role of the object associated to the event. Additional information that can be recorded under event includes: o Detail description of the event under o The outcome of the event under  Indicates if the event was a success, partial success, or failure. o Agents involved in the event and their specific roles under  The refers to the classification of the domain that creates the linking agent identifier. The refers to the “value of the linking agent identifier”. The indicates the role of the agent associated to the event.  Agents role are defined here because agents can perform different roles in different events o Objects involved in the event and their specific roles under  The refers to the classification of the domain that creates the linking object identifier. The refers to the “value of the linking object identifier”. The indicates the role of the object associated to the event.

21 PREMIS Event Example FDsys:event 1cdd2b6c-5a2d-449b-b386-ebb15eb4af11 Rendition Submitted 2010-10-06T19:38:47-04:00 Rendition R0b002ee180b003b0, uploaded by hotfolderadmin, was submitted in the Submission Information package P0b002ee180b003af Success FDsys:agent hotfolderadmin implementer FDsys R0b002ee180b003b0 outcome FDsys:event 1cdd2b6c-5a2d-449b-b386-ebb15eb4af11 Rendition Submitted 2010-10-06T19:38:47-04:00 Rendition R0b002ee180b003b0, uploaded by hotfolderadmin, was submitted in the Submission Information package P0b002ee180b003af Success FDsys:agent hotfolderadmin implementer FDsys R0b002ee180b003b0 outcome

22 PREMIS Agent Entity Agents – refer to people, organizations, or software associated with events, more specifically preservation events, of an object o In the data model diagram, there is no arrow from Agent entity to the Object entity, that is because Agents influence Objects indirectly through Events. Agents – refer to people, organizations, or software associated with events, more specifically preservation events, of an object o In the data model diagram, there is no arrow from Agent entity to the Object entity, that is because Agents influence Objects indirectly through Events.

23 PREMIS Agent Entity Sample syntax The information that can be recorded under agent includes: o A unique identifier for the agent under  Information contained within refers to the classification of the domain that creates the agent identifier.  Information contained within refers “value of the agent identifier”. o The agent’s name under o The type of agent (people, organization or software) under Sample syntax The information that can be recorded under agent includes: o A unique identifier for the agent under  Information contained within refers to the classification of the domain that creates the agent identifier.  Information contained within refers “value of the agent identifier”. o The agent’s name under o The type of agent (people, organization or software) under

24 PREMIS Agent Example FDsys:agent hotfolderadmin hotfolderadmin Person FDsys:agent hotfolderadmin hotfolderadmin Person

25 PREMIS Rights Entity Rights – refers to the rights and permission that are directly relevant to preserving objects Sample syntax The information that can be recorded under right includes: o A unique identifier for the rights statement o The action(s) that the rights statement allows o The object(s) to which the statement applies o The agents involved in the rights statements and their roles Note: Keep in mind that FDsys doesn’t use Rights – refers to the rights and permission that are directly relevant to preserving objects Sample syntax The information that can be recorded under right includes: o A unique identifier for the rights statement o The action(s) that the rights statement allows o The object(s) to which the statement applies o The agents involved in the rights statements and their roles Note: Keep in mind that FDsys doesn’t use

26 premis.xml Header xmlns - refers to a namespace, which is a unique value (Note: the xml parser does not use the namespace URI to look up information) xmlns:xsi – indicates to the XML parser that this document should be validated against a schema xsi:schemaLocation: the first value refers to the namespace that will be used and the second value refers to the location of the schema that will be used, in this case it is the MODS XML schema. version: refers to the PREMIS version Example: <premis xmlns="info:lc/xmlns/premis-v2" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="info:lc/xmlns/premis-v2 http://www.loc.gov/standards/premis/premis.xsd" version="2.0"> xmlns - refers to a namespace, which is a unique value (Note: the xml parser does not use the namespace URI to look up information) xmlns:xsi – indicates to the XML parser that this document should be validated against a schema xsi:schemaLocation: the first value refers to the namespace that will be used and the second value refers to the location of the schema that will be used, in this case it is the MODS XML schema. version: refers to the PREMIS version Example: <premis xmlns="info:lc/xmlns/premis-v2" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="info:lc/xmlns/premis-v2 http://www.loc.gov/standards/premis/premis.xsd" version="2.0">

27 Additional Information On Using premis.xml When will premis.xml be used by METS (aip.xml) ? o PREMIS digital object that requires its components to be organized so that the integrity is preserved, this is known as structural metadata, will use METS to accomplished this. o METS uses a pointer to the metadata that is located outside of the METS document. More specifically, it uses a xlink:href to indicate the location of such file. o Example code from aip.xml: When will premis.xml be used by METS (aip.xml) ? o PREMIS digital object that requires its components to be organized so that the integrity is preserved, this is known as structural metadata, will use METS to accomplished this. o METS uses a pointer to the metadata that is located outside of the METS document. More specifically, it uses a xlink:href to indicate the location of such file. o Example code from aip.xml:

28 Additional Information On Using premis.xml o Mets will use a struct map to organize the components of the premis object. o Example code from aip.xml: o Mets will use a struct map to organize the components of the premis object. o Example code from aip.xml:

29 References Understanding PREMIS o http://www.loc.gov/standards/premis/understand ing-premis.pdf http://www.loc.gov/standards/premis/understand ing-premis.pdf Data Dictionary for Preservation Metadata o http://www.oclc.org/research/activities/past/orpr ojects/pmwg/premis-final.pdf http://www.oclc.org/research/activities/past/orpr ojects/pmwg/premis-final.pdf W3C Schools o http://www.w3schools.com/xml/default.asp http://www.w3schools.com/xml/default.asp Understanding PREMIS o http://www.loc.gov/standards/premis/understand ing-premis.pdf http://www.loc.gov/standards/premis/understand ing-premis.pdf Data Dictionary for Preservation Metadata o http://www.oclc.org/research/activities/past/orpr ojects/pmwg/premis-final.pdf http://www.oclc.org/research/activities/past/orpr ojects/pmwg/premis-final.pdf W3C Schools o http://www.w3schools.com/xml/default.asp http://www.w3schools.com/xml/default.asp


Download ppt "PREMIS What is PREMIS? o Preservation Metadata Implementation Strategies When is PREMIS use? o PREMIS is used for “repository design, evaluation, and archived."

Similar presentations


Ads by Google