Presentation is loading. Please wait.

Presentation is loading. Please wait.

ICS-FORTH May 23, 2009 1 An Ontological Approach to Digital Preservation Metadata Martin Doerr Foundation for Research and Technology - Hellas Institute.

Similar presentations


Presentation on theme: "ICS-FORTH May 23, 2009 1 An Ontological Approach to Digital Preservation Metadata Martin Doerr Foundation for Research and Technology - Hellas Institute."— Presentation transcript:

1 ICS-FORTH May 23, 2009 1 An Ontological Approach to Digital Preservation Metadata Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Prague, Czechia May 23, 2009 Center for Cultural Informatics

2 ICS-FORTH May 23, 2009 2  Cultural and scientific data cannot be understood without knowledge about the meaning of the data and the ways and circumstances of their creation  We use Metadata to assess  encoding (used formats, tools, instruments)  meaning (context of creation, experimental setups, background knowledge, etc. ),  relevance (described things, their status, their conditions),  quality (credibility, authenticity, calibration, tolerances, possible errors),  possibilities of Improvement and Reprocessing.  From generation to use, permanent storage, reuse (life-cycle)  No standards yet! Digital Preservation Metadata

3 ICS-FORTH May 23, 2009 3  Required: Reliable interoperable registration of the creation and modification processes and contextual conditions – “provenance metadata”, through time.  Solution: a common core ontology to explain the meaning of various data structures describing highly specialized processes.  Idea:  Metadata and scientific data and are historical records!  Tool-mediated creation and machine-supported processing is initiated, on behalf of and controlled by human activity.  Things, data, people, times and places are causally related by events.  Other relations are either deductions from events or found by observation events.  The CIDOC CRM (ISO21127) can be used as core model! Digital Preservation Metadata

4 ICS-FORTH May 23, 2009 4  Three applications so far:  For www.c-h-i.org: A completely CRM-based model for provenance (scientific workflow) metadata for generating RTI images. (combines up to 2000 individual shots).www.c-h-i.org  For the European Integrated Project CASPAR on Digital Preservation: — Could explicate OAIS PDI Type “Provenance Information” and authenticity as a queries to the CRM.  European IP 3D-COFORM: Digital Provenance of 3D Models.  We have added 10 classes and some properties under the CRM:  Relation of human action and machine action.  Digitization as a measurement and information object creation  Formal derivation: feature preservation between input and output The CRM Digital Extended applications – Digital Provenance

5 ICS-FORTH May 23, 2009 5 C2 Digitization Process E7 Activity E65 CreationE16 Measurement C11 Digital Measurement Event C7 Digital Machine Event C10 Software Execution C3 Formal Derivation E5 Event C12 Data Transfer Event E11 Modification The CRM Digital Digital Events

6 ICS-FORTH May 23, 2009 6 C1 Digital Object E54 Dimension C9 Data Object E73 Information Object E70 Thing C8 Digital Device E22 Man-Made Object E84 Information Carrier C13 Digital Information Carrier The CRM Digital Digital Things

7 ICS-FORTH May 23, 2009 7 C7 Digital Machine Event E7 Activity E65 Creation E70 Thing P16 used specific object (was used for) E28 Conceptual Object C8 Digital Device C1 Digital Object S10 had input (was input of) C1 Digital Object E5 Event P9 consists of (forms part of) E73 Information Object E22 Man-Made Object S11 had output (was output of) P94 has created (was created by) deduction E4 PeriodE19 Physical Object P8 took place on or within (witnessed) S12 happened on device (was device for) The CRM Digital Human creation by machine events

8 ICS-FORTH May 23, 2009 8 C7 Digital Machine Event C8 Digital Device C1 Digital Object S10 had input (was input of) C1 Digital Object S11 had output (was output of) S12 happened on device (was device for) C10 Software Execution C3 Formal Derivation S2 used as source (was source for) C1 Digital Object S13 used parameters (parameters for) C1 Digital Object The CRM Digital Software Execution

9 ICS-FORTH May 23, 2009 9 E11 Modification E7 Activity E65 Creation P125 used object of type (was type of object used in) E55 Type C11 Digital Measurement Event S15 measured thing of type (was type of thing measured by) C9 Data Object E54 Dimension P40 observed dimension (was observed in) The CRM Digital Digital Measurement (Activity view) C7 Digital Machine Event S20 has created (was created by) E16 Measurement

10 ICS-FORTH May 23, 2009 10 C2 Digitization Process E24 Physical Man-Made Thing E11 Modification P31 has modified (was modified by) E65 Creation E28 Conceptual Object C1 Data Object E73 Information Object E70 Thing P128 carries (is carried by) E18 Physical Thing P94 has created (was created by) S20 has created (was created by) S1 digitized (was digitized by) C11 Digital Measurement Event C13 Digital Information Carrier S18 has modified (was modified by) S19 stores (is stored on) C1 Digital Object S15 measured thing of type (was type of thing measured by) The CRM Digital Digitization = feature transfer physical-digital

11 ICS-FORTH May 23, 2009 11 C7 Digital Machine Event C8 Digital Device C1 Digital Object S11 had output (was output of) S12 happened on device (was device for) C12 Data Transfer Event S10 had input (was input of) C1 Digital Object E11 Modification P31 has modified (was modified by) E84 Information Carrier C1 Digital Object S14 transferred (was transferred by) C8 Digital Device S16 has receiver (was receiver for) S15 has sender (was sender for) The CRM Digital Unreliable transfer

12 ICS-FORTH May 23, 2009 12 P29custody received by (received custody through) E39 Actor Vincent van Gogh Foundation P28 custody surrendered by (surrendered custody through) E39 Actor Vincent Willem van Gogh P29custody received by (received custody through) P28 custody surrendered by (surrendered custody through) P29custody received by (received custody through) P50 has current keeper (is current keeper of) P30 transferred custody of (custody transferred through) E73 Information ObjectE10 Transfer of Custody The custody passing to Theo's widow E10 Transfer of Custody The custody passing to the van Gogh Foundation E39 Actor Theo van Gogh E39 Actor Johanna van Gogh-Bonger E10 Transfer of Custody The custody passing to Johanna's son Preservation Metadata history of physical objects

13 ICS-FORTH May 23, 2009 13 P14.1 in the role of P131 is identified by (identifies) E39 Actor “the creator of ADT music” E82 Actor Appellation “Georges Aperghis” E55 Type “Composer” P14 carried out by (performed) E65 Creation “The conception of ADT” P14.1 in the role of P131 is identified by (identifies) E39 Actor “the creator of ADT libretto” E82 Actor Appellation “Peter Szendy” P131 is identified by (identifies) E55 Type “Writer” P94 has created (was created by) E28 Conceptual Object “ADT” Preservation Metadata creation of born-digital objects

14 ICS-FORTH May 23, 2009 14 C1 Digital Object CreteSmall.png C1 Digital Object Crete.jpg S2 used as source (was source for) C1 Digital Object Crete.png C3 Formal Derivation JPG2PNG conversion C3 Formal Derivation Reduce png resolution P94 has created (was created by) P94 has created (was created by) S2 used as source (was source for) E29 Design or Procedure JPG2PNG Algorithm X P33 used specific technique (was used by) P32 used general technique (was technique of) E55 Type JPG2PNG E55 Type Software P16 used specific object (was used for) E28 Conceptual Object Adobe Photoshop CS2 P2 has type (is type of) P2 has type (is type of) E55 Type JPG P2 has type (is type of) E55 Type PNG P2 has type (is type of) P2 has type (is type of) C1 Digital Object color depth=24 resolution = 600 compression level = 5 S13 used parameters (parameters for) Preservation Metadata transformation of digital objects

15 ICS-FORTH May 23, 2009 15  Authenticity can be defined on Object History: Given: Man-Made Object O1, “was present at” Event E1 (typically creation or publication) Man-Made Object O2, “was present at” Event E2 (typically ingestion or validation) Information Object X1 “is carried by” O1 (historical carrier) Information Object X2 “is carried by” O2 (current carrier) O2 is “authentic” if O2 = O1, or X1 = X2  Reasoning on completeness/security of curation and carrier transfer chain and/or comparison of multiple assumed current carriers. Preservation Metadata Authenticity

16 ICS-FORTH May 23, 2009 16 The Open Provenance Model  An annotated causality graph defined as a record of a past (or current) execution  Three node types  Artifact - Immutable piece of state, which may have a physical embodiment in a physical object, or a digital representation in a computer system.  Process - Action or series of actions performed on or caused by artifacts, and resulting in new artifacts.  Agent - Contextual entity acting as a catalyst of a process, enabling, facilitating, controlling, affecting its execution.  Nodes can be annotated with properties  Processes operate in one or more Roles (R)

17 ICS-FORTH May 23, 2009 17  Nodes are connected by edges  used(R)  wasGeneratedBy(R)  wasControlledBy(R)  wasTriggeredBy  wasDerivedFrom Ag P A P P PP A A A used(R) wasGeneratedBy(R) wasControlledBy(R) wasTriggeredBy wasDerivedFrom The Open Provenance Model

18 ICS-FORTH May 23, 2009 18  Does not distinguish between material and immaterial objects  Does not explicitly model the concept of an Event, a concept of prominent importance.  Without the notion of event and also of physical objects that are carriers (devices) of information, it is not possible for example, to describe adequately the conditions under which a photograph was taken  the way OPM treats Processes resembles events, however the corresponding ontological structure of OPM is not rich enough.  provenance information recorded according to CRMdig can be mapped to an OPM-based view, but not the other way around The Open Provenance Model

19 ICS-FORTH May 23, 2009 19 The CIDOC CRM Conclusions  The CIDOC model and a suitable extension allow for representing all provenance related preservation metadata.  Specific tools need more models of specific parameter sets, that do not influence the integration of and reasoning on the provenance chain.  There is no competitive generic model that consistently describes material and digital objects and their related history.  Relationship between human and machine action still needs refinement: Using OWL we can avoid the ambiguity of multiple IsA.


Download ppt "ICS-FORTH May 23, 2009 1 An Ontological Approach to Digital Preservation Metadata Martin Doerr Foundation for Research and Technology - Hellas Institute."

Similar presentations


Ads by Google