ISO 16642 - a tutorial Part 2: Representing data categories TMF - Terminological Markup Framework Laurent Romary - Laboratoire Loria.

Slides:



Advertisements
Similar presentations
TMF - a tutorial Part 3: Designing (schemas and) filters TMF - Terminological Markup Framework Laurent Romary - Laboratoire Loria.
Advertisements

OLIF V2 Gr. Thurmair April OLIF April 2000 OLIF: Overview Rationale Principles Entries Descriptions Header Examples Status.
Using OLIF, The Open Lexicon Interchange Format Susan McCormick OLIF2 Consortium October 1, 2004.
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
XML Schema Heewon Lee. Contents 1. Introduction 2. Concepts 3. Example 4. Conclusion.
XML: Extensible Markup Language
1 Metadata Registry Standards: A Key to Information Integration Jim Carpenter Bureau of Labor Statistics MIT Seminar June 3, 1999 Previously presented.
RDF Schemata (with apologies to the W3C, the plural is not ‘schemas’) CSCI 7818 – Web Technologies 14 November 2001 Van Lepthien.
XML Technology in E-Commerce
Data Category specifications 19 June 20121CLARIN-NL 2012 ISOcat tutorial.
METS Dr. Heike Neuroth EMANI – Project Meeting February 14 th - 16 th, 2002 Springer-Verlag Heidelberg Göttingen State and University Library (SUB)
IEC Substation Configuration Language and Its Impact on the Engineering of Distribution Substation Systems Notes Dr. Alexander Apostolov.
MLIF: A Metamodel to Represent and Exchange Multilingual Textual Information ISO TC37 SC4 WG Samuel Cruz-Lara, Gil Francopoulo, Laurent Romary,
Ontology Notes are from:
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
TC3 Meeting in Montreal (Montreal/Secretariat)6 page 1 of 10 Structure and purpose of IEC ISO - IEC Specifications for Document Management.
Interchange using TBX 8 th Metadata conference Berlin April 2005 Alan K. Melby Brigham Young University, Provo campus.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
XML Introduction What is XML –XML is the eXtensible Markup Language –Became a W3C Recommendation in 1998 –Tag-based syntax, like HTML –You get to make.
4/20/2017.
OASIS TECHNICAL COMMITTEE FORMAT OF AUTOMOTIVE REPAIR INFORMATION SC2-D5 Architecture and Specifications.
TMF - a tutorial TMF - Terminological Markup Framework Laurent Romary - Laboratoire Loria.
Procedures to Develop and Register Data Elements in Support of Data Standardization September 2000.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
Z39.50, XML & RDF Applications ZIG Tutorial January 2000 Poul Henrik Jørgensen, Danish Bibliographic Centre,
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
Principles of the GOLD Ontology & Conversion of GOLD to DCIF Presenters: Anthony Aristar, Evelyn Richter.
Provo, 16 Aug 2007 LMF meeting 1 Lexical Markup Framework: ISO Provo meeting Gil Francopoulo.
An Introduction to XML Patrice Bonhomme & Laurent Romary Lucid-ITLORIA eXtensible Markup Language version 1.0 Recommendation,
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Towards Translating between XML and WSML based on mappings between.
Why XML ? Problems with HTML HTML design - HTML is intended for presentation of information as Web pages. - HTML contains a fixed set of markup tags. This.
Standards for language resources the ISO/TC 37(/SC 4) perspective
XML Overview. Chapter 8 © 2011 Pearson Education 2 Extensible Markup Language (XML) A text-based markup language (like HTML) A text-based markup language.
March 19, ICE 1341 – Programming Languages (Lecture #8) In-Young Ko Programming Languages (ICE 1341) Lecture #8 Programming Languages (ICE 1341)
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
Logics for Data and Knowledge Representation
Database Management System Lecture 4 The Relational Database Model- Introduction, Relational Database Concepts.
SDPL 2001Notes 4: Intro to Stylesheets1 4. Introduction to Stylesheets n Discussed recently: –Programmatic manipulation of (data-oriented) documents n.
Salt Suite User Guide (Copyright Salt ).
TUTORIAL Dolphy A. Fernandes Computer Science & Engg. IIT Bombay.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
New Perspectives on XML, 2nd Edition
A comprehensive framework for multimodal meaning representation Ashwani Kumar Laurent Romary Laboratoire Loria, Vandoeuvre Lès Nancy.
XML 2nd EDITION Tutorial 4 Working With Schemas. XP Schemas A schema is an XML document that defines the content and structure of one or more XML documents.
1 Tutorial 14 Validating Documents with Schemas Exploring the XML Schema Vocabulary.
Common Terminology Services 2 CTS 2 Submission Team Status Update HL7 Vocabulary Working Group May 17, 2011.
TMF - Terminological Markup Framework Laurent Romary Laboratoire LORIA (CNRS, INRIA, Universités de Nancy) ISO meeting London, 14 August 2000.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 4 1COMP9321, 15s2, Week.
ISO CD Editorial and technical comments. Contact Mailing list Subject: sub FirstName LastName.
ISO TC 37/CLARIN SEMANTIC DATA REGISTRY WORKSHOP UTRECHT, DECEMBER ISOcat: Metadata Registry SUE ELLEN WRIGHT DECEMBER 2013.
Web Technologies for Bioinformatics Ken Baclawski.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
ISO TMF - Terminological Markup Framework Laurent Romary - Laboratoire Loria.
Working with XML. Markup Languages Text-based languages based on SGML Text-based languages based on SGML SGML = Standard Generalized Markup Language SGML.
Lecture 23 XQuery 1.0 and XPath 2.0 Data Model. 2 Example 31.7 – User-Defined Function Function to return staff at a given branch. DEFINE FUNCTION staffAtBranch($bNo)
Ontology Technology applied to Catalogues Paul Kopp.
XML Extensible Markup Language
Rendering XML Documents ©NIITeXtensible Markup Language/Lesson 5/Slide 1 of 46 Objectives In this session, you will learn to: * Define rendering * Identify.
CASEY A. MULLIN WITH: LALA HAJIBAYOVA SCOTT MCCAULAY DECEMBER 8, 2008 FRBR in RDF: a proof-of-concept model 1 ©2008 Casey A. Mullin.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
SNU OOPSLA Lab. A Tour of XML © copyright 2001 SNU OOPSLA Lab.
1 XML and XML in DLESE Katy Ginger November 2003.
XML: Extensible Markup Language
Information Delivery Manuals: Functional Parts
Database Processing with XML
The Re3gistry software and the INSPIRE Registry
Data Model.
Session 2: Metadata and Catalogues
CSE591: Data Mining by H. Liu
Presentation transcript:

ISO a tutorial Part 2: Representing data categories TMF - Terminological Markup Framework Laurent Romary - Laboratoire Loria

Why formalizing DatCats? 4 Systematizing data category description: –Notion of Data Category Registry (DCR) I need a data category: is it there? –Query by name, definition etc. 4 Automatizing processes: –Format control of TMLs –Filters from one TML to GMT

Which model for DatCats? 4 Using XML: –Coherence with TMF principles –Using stylesheet to generate schemas and filters 4 Using RDF (Resource Description Framework) –Intended format for representing meta-data: Description of a DatCat is meta-data with regards TMF

RDF - a quick presentation Cf. other file

Data Categories A Formal Description

Data Category Registry dcsd:DataCategory rdf:about Data Category DCRegistry Description VersionNumber dcsd:VersionNumber

Data Category description DCDefinition DCName Content dcsd:DCDefinition dcsd:DCName dcsd:Content dcsd:DCIdentifier dcsd:Level DCType (S, C) dcsd:DCType Salt /SEW dcsd:DCAdmin DCComment dcsd:DCComment Data Category Locus DCAdmin DCIdentifier DCParent dcsd:DCParent DCExample dcsd:DCExample

Simple and complex DatCats 4 Complex data categories –shall serve as field identifiers (not names) in databases and can have content. The datatype for this content shall be declared for each data category and can commonly take the form of different categories of text, defined data types (such as dates), and specified data domains, e.g., picklists comprising standardized permissible instances. »Example: /Part of Speech/ 4 Simple data categories – shall serve as the content of complex data categories. »Example: /Noun/, /Verb/, /Adjective/ etc.

Levels and content Content DataType TargetType Ref to other datcat(s) dcsd:DataType dcsd:TargetType rdf:Alt rdf:li List of References Ref to other datcats rdf:Alt rdf:li Level/Loci rdf:Alt Ref to other datcat(s) rdf:li List of References

Administrative properties dcsd:DCAdmin Data Category DCAdmin Status dcsd:Status StatusDate dcsd:StatusDate StatusNote dcsd:StatusNote EditionDate dcsd:EditionDate ShortFormAdmittedNameForbiddenName Source dcsd:Source VariantNames dcsd:VariantNames Dcsd:ShortForm Dcsd:AdmittedName Dcsd:ForbiddenName

RDF Representation

/term/ - RDF description (1) <dcsd:DataCategory dcsd:DCIdentifier="ISO12620A01" dcsd:DCName="term" dcsd:position="A.01" dcsd:DCType="C"> A verbal designation of a general concept in a specific subject field For definition of related term, see ISO , Terms can consist of single words or be composed of multiword strings… "radix" in annex C, figure C.1. A.1

/term/ - RDF description (2) TL TC <dcsd:DCAdmindcsd:OrgSource="ISO TC 37" dcsd:DocSource="ISO12620:1999" dcsd:subDate=" SEW" dcsd:registryComment="Prepared " dcsd:Status="Accepted"/>

/term type/ - RDF description (1) <dcsd:DataCategorydcsd:DCIdentifier="ISO12620A0201" dcsd:DCName="term type" dcsd:position="A.02.01" dcsd:DCType="C"> An attribute assigned to a term A.2.1 ISO12620A ISO12620A ISO12620A020119

/term type/ - RDF description (2) TL TC <dcsd:DCAdmindcsd:OrgSource="ISO TC 37" dcsd:DocSource="ISO12620:1999" dcsd:subDate=" SEW" dcsd:registryComment="Prepared " dcsd:Status="Accepted"/>

Actualizing a DatCat TMF specific properties

Styling properties dcsd:Style Data Category Style StyleName dcsd:StyleName ElementName dcsd:ElementName AttributeName dcsd:AttributeName TypeValue dcsd:TypeValue Simple Element Attribute TypedElement ValuedElement TVElement Value dcsd:Value For ‘ Simple ’ AnchorInfo dcsd:Anchor AnchorLevel

Attribute style description dcsd:StyleName="Attribute" –Conditions of use: Not valid for annotations –Required properties dcsd:AttributeName –Example: dcsd:AttributeName="id" …

Element style description dcsd:StyleName="Element" –Required properties dcsd:ElementName –Example: dcsd: ElementName ="definition" …

TypedElement style description dcsd:StyleName="TypedElement" –Required properties dcsd:ElementName, dcsd:TypeValue –Example: dcsd:ElementName ="termNote" dcsd:TypeValue="partOfSpeech" N

ValuedElement style description dcsd:StyleName="ValuedElement" –Conditions of use: Not valid for annotations –Required properties dcsd:ElementName –Example: dcsd:ElementName ="pos"

TVElement style description dcsd:StyleName="TVElement" –Conditions of use: Not valid for annotations –Required properties dcsd:ElementName, dcsd:TypeValue –Example: dcsd:ElementName ="free" dcsd:TypeValue="pos"

Simple style description dcsd:StyleName="Simple" –Conditions of use: Express the value of simple data categories –Required properties: dcsd:Value –Example: dcsd:Value ="Nom" Nom

Dealing with languages

Two types of languages 4 Working language The language used at a given place in a document, along the XML hierarchy Representation: xml:lang 4 Object language The language about which you speak at a given place in your terminological entry (e.g. describes the Language Section level) Representation: as a data category "language", with a narrow scope

Example — DXLT Une valeur entre 0 et 1 utilisée... alpha smoothing factor fullForm

Example — GMT en Une valeur entre 0 et 1 utilisée... alpha smoothing factor fullForm

Conclusion –A general model for analysing and representing terminological data collection –An underlying formalism expressed in XML,RDF –Associated tools (Salt project) DCSEditor, DCSBrowser, Automatic generation of XSLT filters and XML schemas from a given TML specification

Useful pointers 4 SALT project – – 4 The TMF site –