(C) 2013 Logrus International Practical Visualization of ITS 2.0 Categories for Real World Localization Process Part of the Multilingual Web-LT Program.

Slides:



Advertisements
Similar presentations
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Advertisements

What is XML? a meta language that allows you to create and format your own document markups a method for putting structured data into a text file; these.
HTML5 and CSS3 Illustrated Unit B: Getting Started with HTML
SRDC Ltd. 1. Problem  Solutions  Various standardization efforts ◦ Document models addressing a broad range of requirements vs Industry Specific Document.
XML Unit 6 October 31. XML, review XML is used to markup data Used to describe information Uses tags like HTML –But all tags are user-defined –Must be.
Unit 2, cont. September 14 HTML,Validating your pages, Publishing your site.
Exploring Microsoft® Office Grauer and Barber 1 Committed to Shaping the Next Generation of IT Experts. Robert Grauer and Maryann Barber Using.
Introducing HTML & XHTML:. Goals  Understand hyperlinking  Understand how tags are formed and used.  Understand HTML as a markup language  Understand.
Tutorial 1: Getting Started with HTML5
ECA 228 Internet/Intranet Design I Intro to XSL. ECA 228 Internet/Intranet Design I XSL basics W3C standards for stylesheets – CSS – XSL: Extensible Markup.
Chapter 12 Creating and Using XML Documents HTML5 AND CSS Seventh Edition.
Basics of HTML.
 Using Microsoft Expression Web you can: › Create Web pages and Web sites › Set what you site will look like as you design it › Add text, images, multimedia.
Pemrograman Berbasis WEB XML part 2 -Aurelio Rahmadian- Sumber: w3cschools.com.
Overview of Previous Lesson(s) Over View  ASP.NET Pages  Modular in nature and divided into the core sections  Page directives  Code Section  Page.
Lesson 4: Using HTML5 Markup.  The distinguishing characteristics of HTML5 syntax  The new HTML5 sectioning elements  Adding support for HTML5 elements.
Introduction to XML cs3505. References –I got most of this presentation from this site –O’reilly tutorials.
XP 1 CREATING AN XML DOCUMENT. XP 2 INTRODUCING XML XML stands for Extensible Markup Language. A markup language specifies the structure and content of.
An Introduction to XML Presented by Scott Nemec at the UniForum Chicago meeting on 7/25/2006.
HTML (HyperText Markup Language)
Tutorial 1 Developing a Basic Web Page. New Perspectives on HTML, XHTML, and XML, Comprehensive, 3rd Edition Objectives – Lesson 1 Introduction to the.
Introduction to XML. XML - Connectivity is Key Need for customized page layout – e.g. filter to display only recent data Downloadable product comparisons.
Miscellaneous Excel Combining Excel and Access. – Importing, exporting and linking Parsing and manipulating data. 1.
NASRULLAH KHAN.  Lecturer : Nasrullah   Website :
Section 4.1 Format HTML tags Identify HTML guidelines Section 4.2 Organize Web site files and folder Use a text editor Use HTML tags and attributes Create.
HTML 4 Foundation Level Course HyperText Markup Language Most common language used in creating Web documents. You can use HTML to create cross-platform.
XP Dreamweaver 8.0 Tutorial 3 1 Adding Text and Formatting Text with CSS Styles.
The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in.
© Copyright 2008 STI INNSBRUCK NLP Interchange Format José M. García.
Session 1 SESSION 1 Working with Dreamweaver 8.0.
Chapter 13. Applets and HTML HTML Applets Computer Programming with JAVA.
XML 2nd EDITION Tutorial 1 Creating An Xml Document.
HTML: Hyptertext Markup Language Doman’s Sections.
(C) 2014 Logrus International Visualizing ITS 2.0 Categories for the localization process.
1 Introduction  Extensible Markup Language (XML) –Uses tags to describe the structure of a document –Simplifies the process of sharing information –Extensible.
Introduction to XML This presentation covers introductory features of XML. What XML is and what it is not? What does it do? Put different related technologies.
Session: 1. © Aptech Ltd. 2Introduction to the Web / Session 1  Explain the evolution of HTML  Explain the page structure used by HTML  List the drawbacks.
Accessing Data Using XML CHAPTER NINE Matakuliah: T0063 – Pemrograman Visual Tahun: 2009.
Translation Memory System (TMS)1 Translation Memory Systems Presentation by1 Melina Takanen & Julianna Ekert CAT Prof. Thorsten Trippel University.
The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in.
© Copyright 2013 STI INNSBRUCK “How to put an annotation in HTML?” Ioannis Stavrakantonakis.
DMED1100 InDesign Advanced Class 8. Agenda  Scripting  Introduction to XML 2.
Tutorial 3 Adding and Formatting Text with CSS Styles.
Unit 3 — Advanced Internet Technologies Lesson 11 — Introduction to XSL.
ITS 2.0 in XLIFF 2 FEISGILTT Dublin June 2014 Yves Savourel ENLASO Corporation This presentation was made possible by.
HTML Basics. HTML Coding HTML Hypertext markup language The code used to create web pages.
XML CSC1310 Fall HTML (TIM BERNERS-LEE) HyperText Markup Language  HTML (HyperText Markup Language): December  Markup  Markup is a symbol.
Microsoft Expression Web 3 – Illustrated Unit D: Structuring and Styling Text.
Chapter 1 Introduction to JavaScript JavaScript, Third Edition.
Basic HTML Document Structure. Slide 2 Goals (XHTML HTML5) XHTML Separate document structure and content from document formatting HTML 5 Create a formal.
Chapter 29. Copyright 2003, Paradigm Publishing Inc. CHAPTER 29 BACKNEXTEND 29-2 LINKS TO OBJECTIVES Attach an XML Schema Attach an XML Schema Load XML.
CHAPTER NINE Accessing Data Using XML. McGraw Hill/Irwin ©2002 by The McGraw-Hill Companies, Inc. All rights reserved Introduction The eXtensible.
Connecting to External Data. Financial data can be obtained from a number of different data sources.
Customizing Share Document Previews Will Abson Senior Integrations Engineer and Share Extras Project Lead
XML Notes taken from w3schools. What is XML? XML stands for EXtensible Markup Language. XML was designed to store and transport data. XML was designed.
HTML5 and CSS3 Illustrated Unit B: Getting Started with HTML.
C Copyright © 2009, Oracle. All rights reserved. Using SQL Developer.
Making the Most of Search in SharePoint 2013 Christina Wheeler.
Getting Started with CSS
XML QUESTIONS AND ANSWERS
Open Source CAT Tool.
Basic HTML Document Structure
Introducing HTML & XHTML:
Part of the Multilingual Web-LT Program
XML Data Introduction, Well-formed XML.
Part of the Multilingual Web-LT Program
CSE591: Data Mining by H. Liu
Use Cases Simple Machine Translation (using Rainbow)
Extensible Markup Language (XML)
HTML5 and CSS3 Illustrated Unit B: Getting Started with HTML
Presentation transcript:

(C) 2013 Logrus International Practical Visualization of ITS 2.0 Categories for Real World Localization Process Part of the Multilingual Web-LT Program

(C) 2013 Logrus International WHAT IS ITS AND WHY IT’S SO IMPORTANT  The Internationalization Tag Set (ITS) is a set of attributes and elements designed to provide internationalization and localization support in XML and HTML documents. It also defines implementations of these conceptsInternationalization Tag Set (ITS)  XML developers can use this namespace to integrate internationalization features directly into their own XML schemas and documents  The set is currently almost ready/frozen  We believe that this is a one of the key standards for localization industry  The set includes a number of categories of crucial importance to translators:  Terminology note and Localization Note metadata  Translate (yes/no) metadata to mark non-translatable text  ITS metadata make it possible to include various instructions for translators into documents, add terminology and comments, and mark non-translatable segments  Will reduce inconsistency in adding translation instructions to documents  Provides a universal interface for transferring translation metadata between tools

(C) 2013 Logrus International WHY ARE WE DOING THIS: DETAILS  To make it possible to comment translatable content irrespective of its nature  To make these instructions easily accessible to translators and editors  Including recommendations, instructions, terminology suggestions  Independent from translation tools  Saving time: The text is already marked with context information  One doesn’t have to think whether smth. NEEDS TO BE TRANSLATED or not  One doesn’t have to think whether smth. IS A TERM or not  Key advantages/improvements:  Time (i.e. cost)  Quality (fewer translation errors)  Also very important for machine translation applications (post-editing in context)

(C) 2013 Logrus International WHY ARE WE DOING THIS: WORKFLOW PARADIGM CHANGE  FROM:  Bulk manual translation of “raw” content or post-editing “raw” machine-translation output  When external terminology glossaries, localization instructions and reference data are matched with content in indirect manner mostly in translator’s brain on-the-fly and to the extent of his/her understanding of these instructions and personal skills  TO:  Using natural language processing (NLP) tools and ITS metadata markup to pre- populate content to be translated or post-edited with context-related information  When external terminology glossaries, localization instructions and reference data are matched with content directly through automated process of preliminary linguistic analysis  Pre-processing is controlled by dedicated qualified linguists/terminologists/editors  PROVIDED THAT:  Glossaries, instructions and reference data are converted into format compatible with NLP tools and ITS markup  And corresponding content searching algorithms are created (including fuzzy algorithms)

(C) 2013 Logrus International WHAT IS BEING DEVELOPED  ITS 2.0 implementation project, a part of the Multilingual Web-LT program funded by EU  Developing the ITS Browser Plugin as a building block of future “Work In Context System” (WICS)  Making it possible to view standard ITS (Internationalization Tag Set) translation- related metadata contained in XML, XLIFF, or HTML files  Can be done in parallel with translating using CAT tools or for reviewing materials  The JavaScript plugin would support most popular browsers  For previewing XML or XLIFF, standalone filters for conversion into HTML will be used  Implementation:  Standard-based preview solution: HTML5, Java Script, Web browser  A script located in the same folder as HTML files  The script is started by the browser automatically  It is expected that both scripts and filters will be publicly available

(C) 2013 Logrus International THE PROJECT IDEA  ITS metadata-enriched XML or XLIFF files: what’s inside?  Previewing ITS metadata in Web browser while translating content in any CAT tool  Standard-based preview solution: HTML5, Java Script, Web browser  Next step: ITS metadata as a carrier for localization instructions and any reference data

(C) 2013 Logrus International THE WORK BREAKDOWN: PROJECT COMPONENTS  Visual designs  Java scripts to render and navigate metadata and content  Rich sample files  Content format conversion algorithms:  XML+ITS -> HTML5+ITS*  XLIFF+ITS -> HTML5+ITS*  XML+ITS -> XLIFF+ITS (just an example)  HTML+ITS -> HTML5+ITS* * For the purposes of visualization, some redundant ITS syntax options for HTML are not supported.

(C) 2013 Logrus International THE PROJECT CORE: VISUAL DESIGNS  Screen space limitations in localization process:

(C) 2013 Logrus International THE PROJECT CORE: VISUAL DESIGNS (CONT.)  Collapsed view of metadata

(C) 2013 Logrus International THE PROJECT CORE: VISUAL DESIGNS (CONT.)  Expanded view of metadata

(C) 2013 Logrus International THE PROJECT CORE : VISUAL DESIGNS (CONT.)  Summary view of metadata

(C) 2013 Logrus International THE PROJECT CORE: VISUAL DESIGNS (CONT.)  Color highlighting to indicate metadata linked to content

(C) 2013 Logrus International THE PROJECT CORE: VISUAL DESIGNS (CONT.)  Visual “tags” to indicate metadata linked to content

(C) 2013 Logrus International THE PROJECT CORE: VISUAL DESIGNS (CONT.)  Visual tags to highlight metadata (example)

(C) 2013 Logrus International DEVELOPMENT STATUS  Sample files: to be completed by end of May  File conversion algorithms: to be completed by Sep 30:  XML+ITS -> XLIFF+ITS (July) (sample)  XML+ITS -> HTML5+ITS (August)  HTML+ITS -> HTML5+ITS (August)  XLIFF+ITS -> HTML5+ITS (September)  Visualization scripts: to be completed by end of June

(C) 2013 Logrus International KNOWN ISSUES: FORMAT CONVERSIONS  “Translation” of XPath expressions from source XML to target HTML  XLIFF: MRK element to be used instead of SPAN  Selection between SPAN and DIV elements in output HTML  Merging external ITS rule files into internal list of rules

(C) 2013 Logrus International KNOWN ISSUES: METADATA VISUALIZATION  Parsing local standoff markup along with other rules  Parsing list of merged ITS rules  Hyperlinks embedded in metadata  Static definitions like “Do not translate” for Translate category  Highlighting active ITS item  Displaying summary of all ITS items  Parsing nested ITS metadata  Differences in Java Script implementation between browsers  Navigation through content and ITS items  Fragmentation of content to avoid large pieces of text to be displayed

(C) 2013 Logrus International LIVE DEMO The demo samples are built on the preliminary versions of visual designs and illustrate just a few ITS data categories:  Localization Note  Terminology  Translate

(C) 2013 Logrus International THANK YOU! Questions?