ITS 2.0 in XLIFF 2 FEISGILTT Dublin June 2014 Yves Savourel ENLASO Corporation This presentation was made possible by.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Meta Data Larry, Stirling md on data access – data types, domain meta-data discovery Scott, Ohio State – caBIG md driven architecture semantic md Alexander.
Bringing Procedural Knowledge to XLIFF Prof. Dr. Klemens Waldhör TAUS Labs & FOM University of Applied Science FEISGILTT 16 October 2012 Seattle, USA.
Table, List, Blocks, Inline Style
Introduction to HTML & CSS
HTML Overview - Cascading Style Sheets (CSS). Before We Begin Make a copy of one of your HTML file you have previously created Make a copy of one of your.
1 XSLT – eXtensible Stylesheet Language Transformations Modified Slides from Dr. Sagiv.
ISO DSDL ISO – Document Schema Definition Languages (DSDL) Martin Bryan Convenor, JTC1/SC18 WG1.
SPECIAL TOPIC XML. Introducing XML XML (eXtensible Markup Language) ◦A language used to create structured documents XML vs HTML ◦XML is designed to transport.
 Fundamentals of Web Design.  Describe the history and theory of XHTML  Understand the rules for creating valid XHTML documents  Apply a DTD to an.
Helping people find content … preparing content to be found Enabling the Semantic Web Joseph Busch.
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
Lecture17 SMIL - II. Layouts Dividing Space into Regions Arranging media in smil is done (usually) as a two-step process. First, a region is created,
Tutorial 9 Working with XHTML. XP Objectives Describe the history and theory of XHTML Understand the rules for creating valid XHTML documents Apply a.
Creating a Well-Formed Valid Document. 2 Objectives Introducing XHTML Creating a Well-Formed Document Creating a Valid Document Creating an XHTML Document.
Data Modeling Entity - Relationship Models. Models Used to represent unstructured problems A model is a representation of reality Logical models  show.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
(C) 2013 Logrus International Practical Visualization of ITS 2.0 Categories for Real World Localization Process Part of the Multilingual Web-LT Program.
XP New Perspectives on XML Tutorial 4 1 XML Schema Tutorial – Carey ISBN Working with Namespaces and Schemas.
Cascading Style Sheets (CSS) Instructor: Mr. Ahmed Al Astal ITGD4104 Department Requirement for senior student University of Palestine Faculty of IT.
Cascading Style Sheets Dreamweaver. Styles Determine how the HTML code will display Determine how the HTML code will display Gives designers much more.
The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in.
XML-to-Relational Schema Mapping Algorithm ODTDMap Speaker: Artem Chebotko* Wayne State University Joint work with Mustafa Atay,
XP Tutorial 9New Perspectives on Creating Web Pages with HTML, XHTML, and XML 1 Working with XHTML Creating a Well-Formed Valid Document Tutorial 9.
Chapter 10: XML.
XML CPSC 315 – Programming Studio Fall 2008 Project 3, Lecture 1.
Chapter 1 XHTML: Part I The Web Warrior Guide to Web Design Technologies.
Website Development & Management Creating Web Pages CIT Fall Instructor: John Seydel, Ph.D.
XML Syntax - Writing XML and Designing DTD's
1 CIS336 Website design, implementation and management (also Semester 2 of CIS219, CIS221 and IT226) Lecture 6 XSLT (Based on Møller and Schwartzbach,
The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in.
Processing of structured documents Spring 2002, Part 2 Helena Ahonen-Myka.
Tyler Snow Brigham Young University Translation Research Group.
1 Chapter 10: XML What is XML What is XML Basic Components of XML Basic Components of XML XPath XPath XQuery XQuery.
XP Tutorial 9 1 Working with XHTML. XP SGML 2 Standard Generalized Markup Language (SGML) A standard for specifying markup languages. Large, complex standard.
Cascading Style Sheets Dreamweaver. Styles Determine how the HTML code will display Determine how the HTML code will display Gives designers much more.
The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in.
1 CIS336 Website design, implementation and management (also Semester 2 of CIS219, CIS221 and IT226) Lecture 5 XML Schema (Based on Møller and Schwartzbach,
1 XML eXtensible Markup Language. 2 XML vs. HTML HTML is a HyperText Markup language HTML is a HyperText Markup language Designed for a specific application,
XML 2nd EDITION Tutorial 4 Working With Schemas. XP Schemas A schema is an XML document that defines the content and structure of one or more XML documents.
XLIFF 2.0 vs XLIFF 1.2 FEISGILTT Dublin June 2014 Yves Savourel ENLASO Corporation This presentation was made possible by.
INT222 - Internet Fundamentals Shi, Yue (Sunny) Office: T2095 SENECA COLLEGE.
FEISGILTT Dublin 2014 Yves Savourel ENLASO Corporation QuEst Integration in Okapi This presentation was made possible by This project is sponsored by the.
Internet & World Wide Web How to Program, 5/e. © by Pearson Education, Inc. All Rights Reserved.2.
Dictionary based interchanges for iSURF -An Interoperability Service Utility for Collaborative Supply Chain Planning across Multiple Domains David Webber.
Machine Translate Post Edit Quality Check Extract Content I18N Text Analysis Curate Corpora Workflow Analysis Segment Identify Terms Translate Provenance.
Introduction to XML XML – Extensible Markup Language.
Microsoft ® Official Course Structuring and Publishing Websites for All Users Microsoft SharePoint 2013 SharePoint Practice.
Games: XML Presented by: Idham bin Mat Desa Mohd Sharizal bin Hamzah Mohd Radzuan bin Mohd Shaari Shukor bin Nordin.
XP Tutorial 9New Perspectives on HTML and XHTML, Comprehensive 1 Working with XHTML Creating a Well-Formed Valid Document Tutorial 9.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
XPath --XML Path Language Motivation of XPath Data Model and Data Types Node Types Location Steps Functions XPath 2.0 Additional Functionality and its.
Experience with XML Schema Ashok Malhotra Schema Usage  Mapping XML Schema and XML documents controlled by the Schema to object classes and instances.
Tutorial 9 Working with XHTML. New Perspectives on HTML, XHTML, and XML, Comprehensive, 3rd Edition 2 Objectives Describe the history and theory of XHTML.
Jackson, Web Technologies: A Computer Science Perspective, © 2007 Prentice-Hall, Inc. All rights reserved Chapter 7 Representing Web Data:
Tutorial 9 Working with XHTML. XP Objectives Describe the history and theory of XHTML Understand the rules for creating valid XHTML documents Apply a.
SNU OOPSLA Lab. A Tour of XML © copyright 2001 SNU OOPSLA Lab.
A report by Olaf-Michael Stefanov to the JIAMCATT community
Why XLIFF and Why XLIFF 2 ? David Filip OASIS XLIFF OMOS TC Chair
Unit 4 Representing Web Data: XML
Creating a Well-Formed Valid Document
Tutorial 9 Working with XHTML
Data Modeling II XML Schema & JAXB Marc Dumontier May 4, 2004
Tutorial 9 Working with XHTML
Part of the Multilingual Web-LT Program
Part of the Multilingual Web-LT Program
CSE591: Data Mining by H. Liu
Use Cases Simple Machine Translation (using Rainbow)
Linked Data Reuse in the Language Services Industry
New Perspectives on XML
Presentation transcript:

ITS 2.0 in XLIFF 2 FEISGILTT Dublin June 2014 Yves Savourel ENLASO Corporation This presentation was made possible by

Why the mapping? ITS 2.0 provides many data categories that match or complete XLIFF metadata. ITS 2.0 has a mapping to XLIFF 1.2 Having a mapping for XLIFF 2 make sense Mapping done by the ITS Interest Group Goal is to create a new XLIFF 2 module

Types of mapping Data categories not used directly in XLIFF (typically non-metadata data categories e.g. Id Value) Use existing XLIFF metadata: e.g. Translate Use ITS markup directly: e.g. Text Analysis Use a mixed mapping: e.g. Terminology

Marker type Use type="its:any" in most cases Mix of data categories can share one annotation of type its:any Any data category that can use its:any can use other marker types too Exceptions: – Terminology (only type: term or its:term-no ) – Localization Note (only type: comment )

Translate The XLIFF translate attribute has the exact same syntax and semantics as in ITS. In,, and / elements. Example

Localization Note with either a value attribute or a ref attribute. Note that ref attribute must point to an internal within the unit priority="1" is locNoteType="alert", other priority values map to "description"

Terminology with ref mapping to its:termInfoRef istxlf:termConfidence for its:termConfidence Use type="its:term-no" for its:term="no" Challenging to implement because mix of Core + ITS features (3 different namespaces) Example

Directionality Not mapped yet XLIFF 2.0 has srcDir, trgDir, dir (values: ltr, rtl or auto ) Inside content: uses Unicode control characters

Language Information In element: – Use for the source language – Use for the target language Inline: – Use xml:lang in

Elements Within Text Not used directly in XLIFF, but it drives what XLIFF element is used when extracting: withinText="no" : go to withinText="yes" : go to,, or withinText="nested" : go to separate. With subFlows attribute in parent.

Domain Use istxlf:domains attribute. In and elements Example

Text Analysis Use ITS native attributes. In element. Example

Locale Filter Use ITS native attributes Add translate="yes|no" if the annotation is generated when the target language of the document is known. In and elements.

Provenance Use ITS native attributes and elements. Stand-off elements at the level. Or should it be at the level? Applies to the target content: Single instance or reference to stand-off list in element. Example

External Resource Not mapped yet Mapping would likely be related to the Resource Data module.

Target Pointer Target of the original document is in the elements. selector="//xlf:unit/xlf:source" targetPointer="../xlf:target"

Id Value Use the name attribute of the element to store the original ID values Using Id Value data category on XLIFF is not really useful as there are no document-wide unique IDs.

Preserve Space Use xml:space like ITS In and elements Note that is by default xml:space="preserve" while other elements inherit from parents (and default is xml:space="default"

Localization Quality Issue Use ITS native attributes In elements, with stand-off notation at the level. Example

Localization Quality Rating Not mapped yet In some aspects similar to ’s matchQuality (which is mapped to MT Confidence) But has two representations: a score and a number of votes, so using the native ITS attributes may be simpler

MT Confidence In the Translation Candidates module: – Use matchQuality (scaled to ) – In element Normal inline content: – Use ITS native attributes – In element Example

Allowed Characters Use ITS native attributes In element Example

Storage Size Not mapped yet To map with the Size and Length Restriction module

A few links ITS 2.0 Specification XLIFF 2.0 Specification ITS 2.0 mapping for XLIFF 2: Okapi XLIFF Toolkit (implements the mapping):