Interchange using TBX 8 th Metadata conference Berlin April 2005 Alan K. Melby Brigham Young University, Provo campus.

Slides:



Advertisements
Similar presentations
OLIF V2 Gr. Thurmair April OLIF April 2000 OLIF: Overview Rationale Principles Entries Descriptions Header Examples Status.
Advertisements

Using OLIF, The Open Lexicon Interchange Format Susan McCormick OLIF2 Consortium October 1, 2004.
Can I Use It, and If so, How? Christian Lieske SAP AG – MultiLingual Technology Discussion of Consortium Proposal for OLIF2 File Header.
XML Schema Heewon Lee. Contents 1. Introduction 2. Concepts 3. Example 4. Conclusion.
Health IT Workforce Curriculum Version 1.0 Fall Networking and Health Information Exchange Unit 4e Basic Health Data Standards Component 9/Unit.
Interoperability of Distributed Component Systems Bryan Bentz, Jason Hayden, Upsorn Praphamontripong, Paul Vandal.
LREC 2000 Athens; Gerhard Budin and Alan Melby Accessibility of Multilingual Terminological Resources Current Problems and Prospects for the Future Gerhard.
MLIF: A Metamodel to Represent and Exchange Multilingual Textual Information ISO TC37 SC4 WG Samuel Cruz-Lara, Gil Francopoulo, Laurent Romary,
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Chapter 2 Database Environment.
XML e X tensible M arkup L anguage 1/23.
15 November Review Introduction to Databases. Take Home: Hand In.
XML Prashant Karmarkar Brendan Nolan Alexander Roda.
Chapter9 Introduction to Information Technology Turban, Rainer and Potter John Wiley & Sons, Inc. Copyright 2005.
XML Introduction What is XML –XML is the eXtensible Markup Language –Became a W3C Recommendation in 1998 –Tag-based syntax, like HTML –You get to make.
XML(EXtensible Markup Language). XML XML stands for EXtensible Markup Language. XML is a markup language much like HTML. XML was designed to describe.
Jennifer Widom XML Data XML Schema. Jennifer Widom XML Schema “Valid” XML Adheres to basic structural requirements  Also adheres to content-specific.
4/20/2017.
Introduction to Geospatial Metadata – FGDC CSDGM National Coastal Data Development Center A division of the National Oceanographic Data Center Please .
TMF - a tutorial TMF - Terminological Markup Framework Laurent Romary - Laboratoire Loria.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Database Environment 1.  Purpose of three-level database architecture.  Contents of external, conceptual, and internal levels.  Purpose of external/conceptual.
Training Course 2 User Module Training Course 3 Data Administration Module Session 1 Orientation Session 2 User Interface Session 3 Database Administration.
Provo, 16 Aug 2007 LMF meeting 1 Lexical Markup Framework: ISO Provo meeting Gil Francopoulo.
EARTH SCIENCE MARKUP LANGUAGE “Define Once Use Anywhere” INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
CPS120: Introduction to Computer Science The World Wide Web Nell Dale John Lewis.
Why XML ? Problems with HTML HTML design - HTML is intended for presentation of information as Web pages. - HTML contains a fixed set of markup tags. This.
“Integrating Standards in Practice” 10th Open Forum on Metadata Registries July 9-11, 2007 New York City, NY USA An international conference to share and.
Experiments with ODD outside the TEI framework Laurent Romary & Piotr Banski The ISO-TEI connection.
1 © Netskills Quality Internet Training, University of Newcastle Introducing XML © Netskills, Quality Internet Training University.
Introduction to XML. XML - Connectivity is Key Need for customized page layout – e.g. filter to display only recent data Downloadable product comparisons.
9/14/2012ISC329 Isabelle Bichindaritz1 Database System Life Cycle.
XML BIS4430 – unit 10. XML Origins Extensible Markup Language (XML) 1998 Inspired by Standard Generalized Markup Language (SGML) and HTML. SGML defines.
What is XML?  XML stands for EXtensible Markup Language  XML is a markup language much like HTML  XML was designed to carry data, not to display data.
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. TBX TermBase Exchange Format.
Chapter 27 The World Wide Web and XML. Copyright © 2004 Pearson Addison-Wesley. All rights reserved.27-2 Topics in this Chapter The Web and the Internet.
Computer Science 101 Database Concepts. Database Collection of related data Models real world “universe” Reflects changes Specific purposes and audience.
CountryData Technologies for Data Exchange SDMX Information Model: An Introduction.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
XML A web enabled data description language 4/22/2001 By Mark Lawson & Edward Ryan L’Herault.
WEB BASED DATA TRANSFORMATION USING XML, JAVA Group members: Darius Balarashti & Matt Smith.
ISO a tutorial Part 2: Representing data categories TMF - Terminological Markup Framework Laurent Romary - Laboratoire Loria.
XLIFF 2.0 GLOSSARY MODULE / TBX-BASIC Facilitating Interoperability and Compatibility.
Chapter 27 The World Wide Web and XML. Copyright © 2004 Pearson Addison-Wesley. All rights reserved.27-2 Topics in this Chapter The Web and the Internet.
XML Introduction. What is XML? XML stands for eXtensible Markup Language XML stands for eXtensible Markup Language XML is a markup language much like.
Part4 Methodology of Database Design Chapter 07- Overview of Conceptual Database Design Lu Wei College of Software and Microelectronics Northwestern Polytechnical.
© Copyright 2013 STI INNSBRUCK “How to put an annotation in HTML?” Ioannis Stavrakantonakis.
DATABASE MANAGEMENT SYSTEM ARCHITECTURE
CS 157B: Database Management Systems II February 11 Class Meeting Department of Computer Science San Jose State University Spring 2013 Instructor: Ron.
TMF - Terminological Markup Framework Laurent Romary Laboratoire LORIA (CNRS, INRIA, Universités de Nancy) ISO meeting London, 14 August 2000.
Internet & World Wide Web How to Program, 5/e. © by Pearson Education, Inc. All Rights Reserved.2.
ISO CD Editorial and technical comments. Contact Mailing list Subject: sub FirstName LastName.
ISO TC 37/CLARIN SEMANTIC DATA REGISTRY WORKSHOP UTRECHT, DECEMBER ISOcat: Metadata Registry SUE ELLEN WRIGHT DECEMBER 2013.
Dictionary based interchanges for iSURF -An Interoperability Service Utility for Collaborative Supply Chain Planning across Multiple Domains David Webber.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Developing OLIF, Version 2 Susan M. McCormick Christian Lieske OLIF2 Consortium SAP/Walldorf, Germany.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
26 January 2016CountrySTAT Training for the Philippines Introduction to FAOSTAT and CountrySTAT 1 Overview of the FAOSTAT and CountrySTAT Candido J. Astrologo,
ISO TMF - Terminological Markup Framework Laurent Romary - Laboratoire Loria.
Statistical Data and Metadata Exchange SDMX Metadata Common Vocabulary Status of project and issues ( ) Marco Pellegrino Eurostat
Introduction to XML Kanda Runapongsa Dept. of Computer Engineering Khon Kaen University.
SEMI-STRUCTURED DATA (XML) 1. SEMI-STRUCTURED DATA ER, Relational, ODL data models are all based on schema Structure of data is rigid and known is advance.
Metadata Michael J. Watts
The Object-Oriented Thought Process Chapter 11
ADAC-Practical Applications: ADAC XML Data Transfer Implementation in Trimble Business Centre Chris Power.
Eugenia Fernandez IUPUI
Microsoft Access 2003 Illustrated Complete
Chapter 1 Database Systems
Developing a Data Model
What is TBX? TermBase eXchange
Presentation transcript:

Interchange using TBX 8 th Metadata conference Berlin April 2005 Alan K. Melby Brigham Young University, Provo campus

Open Forum Berlin 2005 © Alan K. Melby of 18 Outline  (A) Introduction to TBX  (B) One Use for TBX: interchange  (C) Checking for TBX compliance

Open Forum Berlin 2005 © Alan K. Melby of 18 A: Introduction to TBX  TBX® stands for TermBase eXchange  TBX is a Terminological Markup Framework (TMF) markup language  TMF is an ISO standard (16642)  TBX is consistent with ISO (MARTIF)  TBX is maintained by OSCAR (  The TBX specification is free  But why should I care about TBX?

Open Forum Berlin 2005 © Alan K. Melby of 18 Who Should Care about TBX?  If you don’t care about terminological consistency in your translation/localization work, then you have no reason to care about TBX  If you only need a simple bilingual list of terms (source term and target term) with no additional information, then you don’t need TBX; just use a two-column spreadsheet for your list

Open Forum Berlin 2005 © Alan K. Melby of 18 On the other hand …  If you do care about terminological consistency and you maintain one or more terminology databases (termbases), then you should be interested in TBX, unless you want your termbase to be locked into the terminology management software you are currently using.  Portability of complex terminological data is the key benefit of TBX

Open Forum Berlin 2005 © Alan K. Melby of 18 What does TBX look like?  A TBX file is an XML document  A TBX file consists of:  A header that describes the file  A set of entries, one per concept in the termbase  For each concept, a set of terms, grouped by language, that designate the concept  A terminological concept entry (termEntry)  Can be multilingual  Can be monolingual

Open Forum Berlin 2005 © Alan K. Melby of 18 Example of a TBX file  [+ ref to DTD/schema]   [ global info ]   [concept entries] 

Open Forum Berlin 2005 © Alan K. Melby of 18 TBX header   from AKM Berlin   SYSTEM "TBXDCSv05c.xml" 

Open Forum Berlin 2005 © Alan K. Melby of 18 Parts of a Paintbrush - Handle - Ferrule (with crimp to attach it to handle) - Brush tip (with adhesive to attach to ferrule)

Open Forum Berlin 2005 © Alan K. Melby of 18 TBX body   [concept: brush]   [concept: ferrule] 

Open Forum Berlin 2005 © Alan K. Melby of 18 Concept Entry for “brush”   art  An instrument composed of bristles, or other like material, set in a suitable handle and used for various purposes, as in laying on colors   brush   pinceau 

Open Forum Berlin 2005 © Alan K. Melby of 18 Concept Entry for “ferrule”   art  the metal tube connecting the brush to the handle   ferrule   virole 

Open Forum Berlin 2005 © Alan K. Melby of 18 Relationship between TBX and TMF  TBX is a TML (Terminological Markup Language) of TMF (ISO 16642) ( see Annex B )  TBX maps to the TMF meta-model  A TBX file is a TDC (terminological data collection)  martifHeader provides GI (global information)  termEntry: TE (terminological entry)  langSet: LS (language section)  tig/ntig: TS (term section)  A TMF DCS (Data Category Selection) in TBX is in XCS (eXtensible Constraint Specification) format

Open Forum Berlin 2005 © Alan K. Melby of 18 TMF and lexical resources  In general, a terminological resource is organized into concept entries, each of which includes one or more terms designating a particular concept  In general, a lexical resource is organized into lexical entries, each of which includes one or more senses of a particular lexical item (a word or phrase)  A concept entry containing multiple terms can be split into multiple lexical entries, one per term, and multiple lexical entries associated with the same concept can be combined into one concept entry

Open Forum Berlin 2005 © Alan K. Melby of 18 B: One use for TBX: interchange  A termbase is a collection of concept entries, each associated with a subject field  There are many different data models for termbases  Information is not directly transferable between termbases with different data models  TBX facilitates interchange among termbases

Open Forum Berlin 2005 © Alan K. Melby of 18 Some obstacles to interchange  Not all termbases include the same data categories  e.g. one allows geographicalUsage; another not  Some data categories are required in one but optional in another  Some data categories appear at different levels of the entry structure in different termbases  e.g. definition is at concept vs. language level

Open Forum Berlin 2005 © Alan K. Melby of 18 When could TBX be used for exchange?  An organization requires a translation and supplies terminology for that translation. If the organization’s terminology is in TBX and the translation/localization tool supports TBX, then terminology can be imported.  A document production system includes multiple terminology-aware software components. TBX can be used as the interchange format between components.  TBX facilitates information interchange among termbases with different data models

Open Forum Berlin 2005 © Alan K. Melby of 18 C: Checking for TBX compliance  TBX compliance has three requirements:  Well-formedness (an XML notion that can be checked with any XML parser)  Validity (an XML notion that can be check using any validating XML parser and the TBX core DTD or core XSD schema)  Adherence to the DCS named in the martifHeader Option one: use the open source TBX checker Option two: generate an elaborated XSD schema

Open Forum Berlin 2005 © Alan K. Melby of 18 For More Information TBX webpage: or Alan K. Melby Translation Research Group Brigham Young University Department of Linguistics Provo, Utah USA