Using Open Standards: Save Money and Meet Customer Needs Using Open Standards: Save Money and Meet Customer Needs John Watkins, President, ENLASO

Slides:



Advertisements
Similar presentations
© 2008 EBSCO Information Services SUSHI, COUNTER and ERM Systems An Update on Usage Standards Ressources électroniques dans les bibliothèques électroniques.
Advertisements

The creation of "Yaolan.com" A Site for Pre-natal and Parenting Education in Chinese by James Caldwell DAE Interactive Marketing a Web Connection Company.
Copyright OASIS, 2001 OASIS Recent Technical Developments John Borras Office of e-Envoy Cabinet Office UK Government June 2003.
Introduction to HTML, XHTML, and CSS
Bringing Procedural Knowledge to XLIFF Prof. Dr. Klemens Waldhör TAUS Labs & FOM University of Applied Science FEISGILTT 16 October 2012 Seattle, USA.
Click View > Slide Master to insert a photo as a background behind the colored boxes. Driving the XLIFF Standard at Microsoft 3 rd International XLIFF.
Machine Translation The Translator s Choice Heidi Düchting Sylke Krämer Johann Roturier.
Reliable Interoperation between Open Office & MS office by UOML Alex Wang Chair/OASIS UOML TC Chairman / Sursen Co.
PQL Programming using Object- Oriented Methods Randy Banks ISER, University of Essex
Usage of the memoQ web service API by LSP – a case study
XML Craig Stewart Dr. Alexandra I. Cristea
The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in.
How to Use a Translation Memory Prof. Reima Al-Jarf King Saud University, Riyadh, Saudi Arabia Homepage:
Quality Label and Certification Processes Vienna Summit 11 April 2014 Karima Bourquard Director of Interoperability IHE-Europe.
INTER-OPERABILITY IN THE NEW ZEALAND EDUCATION SECTOR USING A SECTOR DATA MODEL DRIVEN METHODOLOGY Presented on April at the New Zealand State.
Josep Bonet Heras & Olaf-Michael Stefanov On behalf of Alan Melby and Tomas Carrasco Benitez May 2013, Nairobi, Kenya.
Implementing the XLIFF Format Dell Inc. and Adams Globalization Michael MacGregor – Dell Inc. Vivek Anand – Adams Globalization LISA Summit June, 2006.
L10N Standards Warszawa 2014
MLIF: A Metamodel to Represent and Exchange Multilingual Textual Information ISO TC37 SC4 WG Samuel Cruz-Lara, Gil Francopoulo, Laurent Romary,
Project 1 Introduction to HTML.
Introduction to HTML 2006 CIS101. What is the Internet? Global network of computers that are connected and communicate via a series of Protocols Protocols.
Chapter 10 Application Development. Chapter Goals Describe the application development process and the role of methodologies, models and tools Compare.
1st Project Introduction to HTML.
(C) 2013 Logrus International Practical Visualization of ITS 2.0 Categories for Real World Localization Process Part of the Multilingual Web-LT Program.
HTML 1 Introduction to HTML. 2 Objectives Describe the Internet and its associated key terms Describe the World Wide Web and its associated key terms.
Chapter ONE Introduction to HTML.
HTML Comprehensive Concepts and Techniques Intro Project Introduction to HTML.
San José, CA – September, 2004 Localizing with XLIFF and ICU Markus Scherer Raghuram (Ram) Viswanadha IBM San.
Chapter 1 Introduction to HTML, XHTML, and CSS
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
 Trends: › usual trio: desktop version, server version, cloud version › cloud version + free editor › industry standards adopted (XLIFF, TMX, TBX)
Copyright © 2012 Accenture All Rights Reserved.Copyright © 2012 Accenture All Rights Reserved. Accenture, its logo, and High Performance Delivered are.
SDL – University Alliance Program LRC Summer School 31 st May, 2004 Philip Briscoe – Product Marketing Manager, SDL International.
CS117 Introduction to Computer Science II Lecture 1 Introduction to WWW and HTML Instructor: Li Ma Office: NBC 126 Phone: (713)
Introduction to XML. XML - Connectivity is Key Need for customized page layout – e.g. filter to display only recent data Downloadable product comparisons.
Open Source: the new paradigm for international digital content development? LRC '04: Open Source Localisation September 2004.
HTML, XHTML, and CSS Sixth Edition Chapter 1 Introduction to HTML, XHTML, and CSS.
The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in.
Open Standards A winner or a loser? Terence Mac Goff, 3 rd June 2004.
Tyler Snow Brigham Young University Translation Research Group.
HTML PROJECT #1 Project 1 Introduction to HTML HTML Project 1: Introduction to HTML 2 Vocabulary Internet service provider (ISP) A company that has a.
Sofia Garcia/Roberto Silva Tutorial Workshop, GrenobleDate: 31/Jan/2007 The work of a professional translator and the translation agency V1.0.
Coping with Babel How to Localize XML. Designing for Localization Document design can seriously impact the costs of translation and localization. Remember.
Internationalization: Implementing the XLIFF Standard Jon Allen, Producer instructional media + magic, inc. JA-SIG Summer Conference 2003 June 10, 2003.
Tuesday, November 12, 2002 LRC 2002 Conference XLIFF An XML standard for localisation Tony Jewtushenko – Oracle Peter Reynolds – Bowne Global Solutions.
Case Study Summary Link Translation entered a partner agreement with Autodesk to provide translation solutions integrating human and machine translation.
(C) 2014 Logrus International Visualizing ITS 2.0 Categories for the localization process.
Open Source CAT Tool Patrícia Azeredo Ivone Ferreira IT for Translation 2009/2010.
PASSOLO ® Makes Your Software Ready for the Global Market Localisation Standards The Tools Developer’s Perspective.
Xml:tm XML Based Text Memory Using XML technology to reduce the cost of translating XML documents 27 June 2005.
Xml:tm XML Text Memory Using XML technology to reduce the cost of translating XML documents.
1 Machine Assisted Human Translation (MAHT) (…aka “Translation Memory” or “CAT tool”) …and what it does for the translator…
Machine Translate Post Edit Quality Check Extract Content I18N Text Analysis Curate Corpora Workflow Analysis Segment Identify Terms Translate Provenance.
HTML Concepts and Techniques Fifth Edition Chapter 1 Introduction to HTML.
Standards that might come up in discussion today EN 15038: quality standard developed especially for translation services providers, including regular.
Chapter 1 Introduction to HTML, XHTML, and CSS HTML5 & CSS 7 th Edition.
Eurostat Sharing data validation services Item 5.1 of the agenda.
Subject: Internationalization of AJAX applications using ITS and XML, Best practices and application. Doctoral Program in Technology and Software Engenieering.
HTML PROJECT #1 Project 1 Introduction to HTML. HTML Project 1: Introduction to HTML 2 Project Objectives 1.Describe the Internet and its associated key.
Project 1 Introduction to HTML.
Chapter 1 Introduction to HTML
Chapter 1 Introduction to HTML.
Introducing the technology
Project 1 Introduction to HTML.
Dave Lewis W3C MultilingualWeb - Language Technology Working Group
Translation Workspace File Filters
Part of the Multilingual Web-LT Program
DITA Translation Management Challenges in Japan
Use Cases Simple Machine Translation (using Rainbow)
Linked Data Reuse in the Language Services Industry
Presentation transcript:

Using Open Standards: Save Money and Meet Customer Needs Using Open Standards: Save Money and Meet Customer Needs John Watkins, President, ENLASO

Standards are Hard…

Agenda What are Standards Who uses Standards How to use Standards

What are Standards – Evolution SDLs Trados: SDLs computer-assisted translation software products are the de facto standard in enterprise-wide translation... 1 De facto standards arise from market share: influence through prevalence Standards may evolve from de facto standards through the cooperation of the industry and a relevant standards body TMX is a standard for the exchange of translation memory data (independent of the tool used to create the translation memory) This arises through the cooperation of industry and standards bodies to agree upon the specifications for the transfer of translation memory data among tools (be they in proprietary or open source tools) Today a wide variety of translation memory and related tools use TMX to ensure interoperability 1 Ignacio Garcia & Vivian Stevenson, TRADOS and the Evolution of Language Tools, Multilingual, May

What are Standards – Definitions Standards – Remove barriers for the purpose of performing functions that are within an industry – Are approved and maintained by neutral third parties with input from industry, to avoid being locked into a proprietary solution Open Standards – Do the above – With Transparency in development and maintenance – And availability to the public (with accessible rights) – Synergy with Open Source software – We are fortunate that core Standards in use in the localization industry are, indeed, Open Standards

What are Standards – Benefits Open Standards facilitate the interoperability of language services tools – Freedom to work with a wide variety of tools (many proprietary tools support open standards for compatibility) – Processes are developed independent of the tools Customers and providers can work more easily with the various file types The best linguists can be used regardless of their tool preference Consequently – Tools are not constrained – Workflow is easier – Projects can be faster, better, and cheaper

Who Uses Standards – Everyone Customers Eschew proprietary solutions Service Providers Support the wide variety of tools to meet customer and vendor needs Linguists Want flexibility with CAT tool selection Tool Developers Trying to meet everybodys needs

Who Uses Standards – Contribute Size doesnt matter – we can all contribute – Mid-Size Localization Company – Investing in Open Standards Member of GALA Open Standards Initiative Member of OASIS (XLIFF TC) Member of W3C (LT-Web, ITS) Multilingual Web-LT Project (European Commission W3C) – Develop Open Source Tools (Okapi Framework)

Using Standards – Contribute Direct to the Source – OSCAR/LISA -> Disbanded Standards developed by OSCAR under LISA now under the Creative Commons Attribution license – See GALA European Telecommunications Standards Institute (ETSI) Localization Industry Standards (LIS) Industry Specification Group as the successor for the LISA/OSCAR portfolio (TMX, TBX, SRX…): – OASIS (XLIFF, DITA…): – W3C (ITS, MultilingualWeb-LT – ITS 2.0): – European Commission (LT-Web):

Using Standards – Contribute GALA Standards Initiative – OSCAR standards: – Linport (localizaiton kit standardization) – Model Service Elements (localization task standardization) – Coordination and representation QT Launchpad (DFKI - translation quality) W3C MultilingualWeb-LT (W3C - international web standards) ISO TC 37 SC5 SD (ISO translation services) OASIS, Unicode Consortium, OpenTM2, more to come – GALA-Connect (working groups for members): – Quarterly webinars on standards developments: There are lots of ways/places to contribute!

Using Standards 1.Look at an example project 2.Identify Standards involved 3.Use Standards to provide localized files

Using Standards – Examples Example using Four Standards that are stable and work well – Translation memories TMX: Translation Memory eXchange 1 Easily exchange of translation memory among tools – Segmentation SRX: Segmentation Rules eXchange 1 Provide a standard method to describe segmentation rules that are being exchanged among tools – Extracted data ITS: Internationalization Tag Set 2 Used for XML to support the internationalization and localization of XML schemas and documents XLIFF: XML Localisation Interchange File Format 3 To store localizable data and carry it from one step of the localization process to the other, while allowing interoperability among tools 1 See GALA Open Standards: 2 See W3C: 3 See OASIS:

Using Standards – Open Source Open Standards are a logical fit with Open Source tools. We work with the Okapi Framework Project 1 – Rainbow: Toolbox with functions for pre/post processing, file conversion, encoding conversion, QA, etc. – Pensieve: An Okapi TM engine – Ratel: Segmentation editor – And much more 1 See Okapi Framework project site at:

Wordfast TM TMX 1 TMX 2 Trados TM SRX Rules Excel XML File MIF File Translation Memory from Tradosfrom Trados Translation Memory from Wordfastfrom Wordfast Segmentation rulesSegmentation rules for the TMs New version of the documents to translate (from Excel and FrameMaker)ExcelFrameMaker Example Project

You can use the Okapi Framework to: – Manipulate and combine translation memories – Extract text with appropriate filters – Edit segmentation rules and apply them to content – Leverage from TM – Machine translate unmatched text – Create the translation package for the linguists – Rebuild translated files Means to an End

FrameMaker MIF File

Excel XML File

Three Tasks 1.Consolidate client TMs into a single TM 2.Prepare the translation package to send to the linguist 3.Post-process the files for delivery

Translation Memories – TMX TMX (Translation Memory eXchange) is the standard way to store source text (segments) and their corresponding translations Supported by most CAT tools

Wordfast TM – TMX

Trados TM – TMX

Wordfast TM TMX 1 TMX 2 Pensieve TM Rainbow Toolbox Trados TM Combining the two TMs into a single one. Four different tools sharing data through TMX Combine TMs

Three Tasks Consolidate the clients TMs data into a single Pensieve TM 2.Prepare the translation package to send to the linguist 3.Post process the files for delivery

Pipeline (Driven by Rainbow) Excel File MIF File ITS Rules MIF FilterXML Filter Content Extraction ITS rules can be complex, but it provides a clear way for the owner of the source material to specify what needs to be translated. ITS-aware tools can process XML documents without guesswork. XML Extraction – ITS

For XML documents, ITS (Internationalization Tag Set) describes what needs to be extracted and how to extract it W3C MultilingualWeb-LT WG just started to work on the successor of ITS 1.0 Using ITS rules to identify localizable text in the Excel XML documentITS rulesExcel XML document

ITS Rules

Pipeline (Driven by Rainbow) Excel File MIF File ITS Rules Segmentation MIF FilterXML Filter SRX Rules Extraction Sharing segmentation rules is key to sharing TMs Segmentation – SRX

Translation is done at the segment level SRX (Segmentation Rules eXchange) describes where to break or not break the content into segments Having the rules for source segments allows better re-usability of existing TM, giving exact matches Maintain SRX rules with an SRX EditorSRX rulesSRX Editor

Segmentation – SRX Dont break segment after VS. V.S. vs. or v.s.

Pipeline (Driven by Rainbow) Translation Kit Pensieve TM Excel File MIF File ITS Rules Segmentation MIF Filter Pre-translate unmatched from MT Pre-translate from TM Translation Kit Creation XML Filter SRX Rules Microsoft MT Excel XLIFF MIF XLIFF TMXEtc. Extraction Pensieve TM Connector Microsoft MT Connector Translation Kit – XLIFF, TMX

To flow through the translation cycle the extracted content needs to be stored in a common format many tools understand XLIFF (XML Localisation Interchange File Format) is a standard way to represent extracted data TMX files with all the translation candidates found

Translation Kit – XLIFF, TMX

Three Tasks Consolidate the clients TMs data into a single Pensieve TM Prepare the translation package to send to the linguist 3.Post process the files for delivery

Pipeline (Driven by Rainbow) Translation Kit Excel File MIF File Translator Kit Filter Translation Kit Post- Processing XML Filter Excel XLIFF MIF XLIFF TMXEtc. Extraction MIF FilterPost-Processing

Translated FrameMaker MIF

Translated Excel XML

Three Tasks Consolidate the clients TMs data into a single TM Prepare the translation package to send to the linguist Post process the files for delivery

Summary We know – More about our standards – Everybody should care about standards – We can (and do) use them today Next Steps – Consider requiring open standards compliance with the tools you use to ensure portability – Get involved in educational opportunities – Support standards initiatives through the organization(s) that best fit your needs

References TMX 1.4b – Translation Memory eXchange ITS 1.0 – Internationalization Tag Set SRX 2.0 – Segmentation Rules eXchange XLIFF 1.2 – XML Localisation Interchange File Format Okapi Framework (open-source & cross-platform) Globalization and Localization Association (GALA provides access to various standards projects)

Questions? John Watkins, President, ENLASO