Presentation is loading. Please wait.

Presentation is loading. Please wait.

ISO 16642 - a tutorial Part 2: Representing data categories TMF - Terminological Markup Framework Laurent Romary - Laboratoire Loria.

Similar presentations


Presentation on theme: "ISO 16642 - a tutorial Part 2: Representing data categories TMF - Terminological Markup Framework Laurent Romary - Laboratoire Loria."— Presentation transcript:

1 ISO 16642 - a tutorial Part 2: Representing data categories TMF - Terminological Markup Framework Laurent Romary - Laboratoire Loria

2 Why formalizing DatCats? 4 Systematizing data category description: –Notion of Data Category Registry (DCR) I need a data category: is it there? –Query by name, definition etc. 4 Automatizing processes: –Format control of TMLs –Filters from one TML to GMT

3 Which model for DatCats? 4 Using XML: –Coherence with TMF principles –Using stylesheet to generate schemas and filters 4 Using RDF (Resource Description Framework) –Intended format for representing meta-data: Description of a DatCat is meta-data with regards TMF

4 RDF - a quick presentation Cf. other file

5 Data Categories A Formal Description

6 Data Category Registry dcsd:DataCategory rdf:about Data Category DCRegistry Description VersionNumber dcsd:VersionNumber

7 Data Category description DCDefinition DCName Content dcsd:DCDefinition dcsd:DCName dcsd:Content dcsd:DCIdentifier dcsd:Level DCType (S, C) dcsd:DCType Salt 2000-11-08/SEW dcsd:DCAdmin DCComment dcsd:DCComment Data Category Locus DCAdmin DCIdentifier DCParent dcsd:DCParent DCExample dcsd:DCExample

8 Simple and complex DatCats 4 Complex data categories –shall serve as field identifiers (not names) in databases and can have content. The datatype for this content shall be declared for each data category and can commonly take the form of different categories of text, defined data types (such as dates), and specified data domains, e.g., picklists comprising standardized permissible instances. »Example: /Part of Speech/ 4 Simple data categories – shall serve as the content of complex data categories. »Example: /Noun/, /Verb/, /Adjective/ etc.

9 Levels and content Content DataType TargetType Ref to other datcat(s) dcsd:DataType dcsd:TargetType rdf:Alt rdf:li List of References Ref to other datcats rdf:Alt rdf:li Level/Loci rdf:Alt Ref to other datcat(s) rdf:li List of References

10 Administrative properties dcsd:DCAdmin Data Category DCAdmin Status dcsd:Status StatusDate dcsd:StatusDate StatusNote dcsd:StatusNote EditionDate dcsd:EditionDate ShortFormAdmittedNameForbiddenName Source dcsd:Source VariantNames dcsd:VariantNames Dcsd:ShortForm Dcsd:AdmittedName Dcsd:ForbiddenName

11 RDF Representation

12 /term/ - RDF description (1) <dcsd:DataCategory dcsd:DCIdentifier="ISO12620A01" dcsd:DCName="term" dcsd:position="A.01" dcsd:DCType="C"> A verbal designation of a general concept in a specific subject field For definition of related term, see ISO 1087-1, 3.4.3. Terms can consist of single words or be composed of multiword strings… "radix" in annex C, figure C.1. A.1

13 /term/ - RDF description (2) TL TC <dcsd:DCAdmindcsd:OrgSource="ISO TC 37" dcsd:DocSource="ISO12620:1999" dcsd:subDate="2000-10-20 SEW" dcsd:registryComment="Prepared 2000-10-20" dcsd:Status="Accepted"/>

14 /term type/ - RDF description (1) <dcsd:DataCategorydcsd:DCIdentifier="ISO12620A0201" dcsd:DCName="term type" dcsd:position="A.02.01" dcsd:DCType="C"> An attribute assigned to a term A.2.1 ISO12620A020101 ISO12620A020102 ISO12620A020119

15 /term type/ - RDF description (2) TL TC <dcsd:DCAdmindcsd:OrgSource="ISO TC 37" dcsd:DocSource="ISO12620:1999" dcsd:subDate="2000-10-20 SEW" dcsd:registryComment="Prepared 2000-10-20" dcsd:Status="Accepted"/>

16 Actualizing a DatCat TMF specific properties

17 Styling properties dcsd:Style Data Category Style StyleName dcsd:StyleName ElementName dcsd:ElementName AttributeName dcsd:AttributeName TypeValue dcsd:TypeValue Simple Element Attribute TypedElement ValuedElement TVElement Value dcsd:Value For ‘ Simple ’ AnchorInfo dcsd:Anchor AnchorLevel

18 Attribute style description dcsd:StyleName="Attribute" –Conditions of use: Not valid for annotations –Required properties dcsd:AttributeName –Example: dcsd:AttributeName="id" …

19 Element style description dcsd:StyleName="Element" –Required properties dcsd:ElementName –Example: dcsd: ElementName ="definition" …

20 TypedElement style description dcsd:StyleName="TypedElement" –Required properties dcsd:ElementName, dcsd:TypeValue –Example: dcsd:ElementName ="termNote" dcsd:TypeValue="partOfSpeech" N

21 ValuedElement style description dcsd:StyleName="ValuedElement" –Conditions of use: Not valid for annotations –Required properties dcsd:ElementName –Example: dcsd:ElementName ="pos"

22 TVElement style description dcsd:StyleName="TVElement" –Conditions of use: Not valid for annotations –Required properties dcsd:ElementName, dcsd:TypeValue –Example: dcsd:ElementName ="free" dcsd:TypeValue="pos"

23 Simple style description dcsd:StyleName="Simple" –Conditions of use: Express the value of simple data categories –Required properties: dcsd:Value –Example: dcsd:Value ="Nom" Nom

24 Dealing with languages

25 Two types of languages 4 Working language The language used at a given place in a document, along the XML hierarchy Representation: xml:lang 4 Object language The language about which you speak at a given place in your terminological entry (e.g. describes the Language Section level) Representation: as a data category "language", with a narrow scope

26 Example — DXLT Une valeur entre 0 et 1 utilisée... alpha smoothing factor fullForm

27 Example — GMT en Une valeur entre 0 et 1 utilisée... alpha smoothing factor fullForm

28 Conclusion –A general model for analysing and representing terminological data collection –An underlying formalism expressed in XML,RDF –Associated tools (Salt project) DCSEditor, DCSBrowser, Automatic generation of XSLT filters and XML schemas from a given TML specification

29 Useful pointers 4 SALT project –http://www.loria.fr/projets/SALT –http://www.ttt.org/ 4 The TMF site –http://www.loria.fr/projets/TMF


Download ppt "ISO 16642 - a tutorial Part 2: Representing data categories TMF - Terminological Markup Framework Laurent Romary - Laboratoire Loria."

Similar presentations


Ads by Google