Presentation is loading. Please wait.

Presentation is loading. Please wait.

The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in.

Similar presentations


Presentation on theme: "The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in."— Presentation transcript:

1 The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815. ITS 2.0 Bird’s Eye View & Why 1.0 was not enough & Motivation for Showcase Felix Sasaki (W3C, DFKI), Christian Lieske (SAP AG)

2 The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815. Multilingual content production Seen from the moon Internationalize Localize Translate Seen from an airplane Create Internationalize Translate/Localize Publish Harvest Analyze Seen from a desktop Specify directionality Mark-up terminology Add links about entities Extract / filter content Segment Run through MT Assess (linguistic) quality Generate translation kit Run post-production 2

3 The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815. Multilingual content production needs help “Which data elements need to be translated?” 3... images/cancel.gif 12,20 Cancel 60,40 Number of files:

4 The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815. ITS 2.0 – The help 4 Supports internationalization, translation, localization and other aspects of the multilingual content production cycle Comprehensive Building on W3C ITS 1.0 Standardized data categories, values etc. Meta data

5 The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815. ITS 2.0 Basic principles Say important things “Do not translate” About specific content “All or selected data elements” In a standard way With agreed upon syntax and values 5

6 The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815. 1. Say important things: ITS 2.0 “data categories” Translate, Localization Note, Terminology, Directionality, Language Information, Elements Within Text, Domain, Text Analysis, Locale Filter, Provenance, External Resource, Target Pointer, Id Value, Preserve Space, Localization Quality Issue, Localization Quality Rating, MT Confidence, Allowed Characters, Storage Size Definition in proseSelection of content via two approaches 6

7 The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815. 2. About specific content: Content selection approaches 7 Cancel 60,40... XPath to select markup nodes Selection global ITS local attributes Selection local ITS selection can be compared to CSS global = “style” element local = “style” attribute

8 The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815. 3. In a standard way (1/2) “Translate”: “yes” or “no” Pre-defined (if appl.) meta data values Elements: translate “yes”, attributes: translate “no” Specific defaults (if appl.) E.g. “alt” attribute default “yes” Specific HTML5 behaviour 8

9 The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815. 3. In a standard way (2/2) Powerful (e.g. easy combination) Dublin Core, xml Independent /orthogonal Supported ITS 2.0 data categories Supported selection mechanism (local / global) and type of content (HTML / XML) Test suite to guide implementers and users https://github.com/finnle/ITS-2.0-Testsuite/ https://github.com/finnle/ITS-2.0-Testsuite/ Strict conformance clauses 9

10 The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815. Why ITS 2.0 (1/2) ITS 1.0 = simplified view of multilingual content production Too limited for comprehensive automated content processing/usage scenarios (see http://www.w3.org/TR/mlw- metadata-us-impl/ for various ITS 2.0 usage scenario descriptions)http://www.w3.org/TR/mlw- metadata-us-impl/ Example gap: too few data categories 10

11 The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815. Why ITS 2.0 (2/2) Coverage for additional types of content: HTML5 Bridge to Web & app content Accommodate relevant HTML5 markup (e.g. HTML5 “translate” attribute behaviour) Easy mapping/conversion to other formats XML Localization Information Markup (XLIFF; status: informal mapping, under discussion) = bridge to localization workflows Natural Language Processing Interchange Format (NIF) = bridge to the Semantic Web and Natural Language Processing 11

12 The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815. Example: MT Confidence Score from machine translation engineExample for new ITS capability: Tool traceability 12... Dublin is the capital of Ireland.

13 The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815. Example: Locale Filter Content relevant only for a specific locale 13... Text for Canadian locales. Text for non-Canadian locales....

14 The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815. Example: Localization Quality Issue For quality assessment 14... <span its-loc-quality-issue-comment="should be 'quality'" its-loc-quality-issue-profile-ref=http://example.org/qaMovel/v1 its-loc-quality-issue-severity=50 its-loc-quality-issue-type=spelling>qulaity...

15 The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815. Why a showcase? Demo Yes, we can ;-) Introduce you to ITS 2.0 by example Data categories Usage scenarios Applications Gather your feedback Write your comments on IRC (http://irc.w3.org; channel #mlw-lt)http://irc.w3.org Point you to a place for answers and your contributions: ITS Interest Group http://www.w3.org/International/its/ig/ http://lists.w3.org/Archives/Public/public-i18n-its-ig (public list, free to subscribe)http://lists.w3.org/Archives/Public/public-i18n-its-ig 15


Download ppt "The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in."

Similar presentations


Ads by Google