Automating Translation in the Localisation Factory An Investigation of Post-Editing Effort Sharon O’Brien Dublin City University.

Slides:



Advertisements
Similar presentations
© 2000 XTRA Translation Services Is MT technology available today ready to replace human translators?
Advertisements

The Link between Controlled Language and Post-Editing: An Empirical Investigation of Technical, Temporal and Cognitive Effort Sharon OBrien, CTTS/SALIS.
KeTra.
Machine Translation The Translator s Choice Heidi Düchting Sylke Krämer Johann Roturier.
Guidelines for Writing Technical Documents Computer Science 312.
Technical Writing Rhetoric Basics Dr. Brian Gastle Western Carolina University.
1 Rules Based Machine Translation Fred Hollowood Consultant RBMT and CL.
Joke Daems PhD student Lieve Macken, Sonia Vandepitte, Robert Hartsuiker Comparing HT and PE using advanced research tools.
A Syntactic Translation Memory Vincent Vandeghinste Centre for Computational Linguistics K.U.Leuven
Language Network Pg. 92.  Independent Clause – Contains a subject, a verb, conveys a complete thought, and is also know as a complete sentence  Subordinate.
Relative Pronouns When referring to people, use who, whom or whose. Use who to refer to people that are subjects, whom to refer to people that are objects.
Punctuation & Grammar., ?; :’!., ?; “” :’!., ?; “” :’!
Semi-Automatic Learning of Transfer Rules for Machine Translation of Low-Density Languages Katharina Probst April 5, 2002.
Subject Adjective Clauses & Adjective Phrases
INSTRUCTOR: TSUEIFEN CHEN TERM:   Participial phrase: what is it and what does it do?  Participle forms: 1. General form –ing participial phrases.
Module Code CT1H01NI: Study Skills For Communication Technology Lecture for Week Autumn.
Grammar Skills Workshop
1 Business Writing in a Technical Environment Prepared by Graham Associates copyright 2002 copyright © 2002.
Welcome Orientation. Introduction to the Course Course Objectives By the end of this course students will be able to: · Master the grammatical uses and.
Stephen Doherty, CNGL/SALIS
Carlos S. C. Teixeira Universitat Rovira i Virgili Knowledge of provenance: How does it affect TM/MT integration? New Research in Translation and Interpreting.
Daily Grammar Practice
Drafting and Revising Effective Sentences Chapter 11 Presented By Blake O'Hare Chris McClune.
Punctuation:  An Essential Part of Writing. The Semicolon  Use between closely related independent clauses not joined with a coordinating conjunction.
Scientific writing style Exact  Word choice: make certain that every word means exactly what you want to express. Choose synonyms with care. Be not.
Can Controlled Language Rules increase the value of MT? Fred Hollowood & Johann Rotourier Symantec Dublin.
EFL 084 Grammar 4 Modal Auxiliaries –Meaning Probability Necessity Advisability Ability –Time Present/future structure Past structure.
Verbals. A gerund is a verbal that ends in -ing and functions as a noun. The term verbal indicates that a gerund, like the other two kinds of verbals,
Warm-Up Imperative sentences make a request. Open your test booklets now.
Copyright 2010 Pearson Education, Inc. Publishing as Longman PowerPoint Presentations Pathways for Writing Scenarios: From Sentence to Paragraph, Second.
Parts of Speech Major source: Wikipedia. Adjectives An adjective is a word that modifies a noun or a pronoun, usually by describing it or making its meaning.
LANGUAGE ARTS LA WORKS UNIT 3 REVIEW STUDY GUIDE.
English Review for Final These are the chapters to review. In Textbook: Chapter 9 Nouns Chapter 10 Pronouns Chapter 11 Adjectives Chapter 12 Verbs Chapter.
Standards: Unit on Verbals (and review of verbs) LS 8.1: Grammar Usage Demonstrate command of the conventions of standard English grammar and usage when.
Unit 9. SENTENCE STRUCTURE RULES Chapter 10 in Shapo.
IELTS Intensive Writing part two. IELTS Writing Two parts of ielts writing Part one writing about a Graph, chart, diagram Part two is an essay.
© 2005 Pearson Education, Inc., publishing as Longman Publishers. 1 Chapter 13 Editing for Readable Style Technical Communication, 10/e John M. Lannon.
© 2011 Pearson Education, Inc., publishing as Longman Publishers. 1 Chapter 11 Editing for a Professional Style and Tone Technical Communication, 12 th.
© 2003 Pearson Education, Inc., publishing as Longman Publishers. 1 Chapter 13 Revising for Readable Style Technical Communication, 9/e John M. Lannon.
Expectations in English. All year groups have heightened expectations End of year 2 Punctuation- Use of capital letters, full stops, question marks, and.
Writing Technical Reports in Science Writing in Science Writing in Science.
Category 2 Category 6 Category 3.
Phrase Definition review. Consists of an appositive and any modifiers the appositive has.
2 pt 3 pt. 4 pt 5pt 1 pt 2 pt 3 pt 4 pt 5 pt 1 pt 2pt 3 pt 4pt 5 pt 1pt 2pt 3 pt 4 pt 5 pt 1 pt 2 pt 3 pt 4pt 5 pt 1pt Parts of speech PunctuationVerbal's.
Twitter as a Corpus for Sentiment Analysis and Opinion Mining
Evaluator Identification & Preview Sign your name at the end of the essay. Review objective of the PROGRESS CHECK. Take 2 minutes to preview your peers.
D.L.P. – Week Seven GRADE SEVEN. Day One – Skills Capitalization – Proper Nouns - Places Names of specific places must be capitalized since they are proper.
D.L.P. – Week Five GRADE EIGHT. Day One – Skills Participles – Correction of Dangling A participle looks like a verb but works like an adjective. To locate.
Clauses and Phrases Quick recap from Day 1. Clauses and phrases Clauses and phrases are groups of words Clauses have a subject and verb.
PUNCTUATION REVIEW For each question, identify if it is true of false. If it is false, make the statement correct.
Introduction to Plain Language. Why use Plain Language? What is Plain Language?
SPAG Parent Workshop April Agenda English and the new SPaG curriculum How to help your children at home How we teach SPaG Sample questions from.
Being a Writer at St Leonard’s
Analytical Essay Proper Punctuation.
Intervention Strategies
Daily Grammar Practice Week One Grade 8
Introduction to In-Text Citations
The Synthesis Essay.
Syntax of the English Language
A.C.T. English test.
IUED Institute of Translation and Interpreting
Daily Grammar Practice
Daily Grammar Practice Week One Grade 8
©2004 Pearson Education, Inc., publishing as Longman Publishers.
Pima Medical Institute Online Education
Chapter 9 Revising, Rewriting, and Editing
Jeopardy Game Grammar Edition
For the week of November 2 - 6
TECHNICAL REPORTS WRITING
HSP3 Grammar Review By: [Insert your name].
Presentation transcript:

Automating Translation in the Localisation Factory An Investigation of Post-Editing Effort Sharon O’Brien Dublin City University

Assumptions about MT T (MT + PE) < T (Trans)

Do we have proof? Dated studies: Pan-American Health Organisation General Motors European Union 3-4 times faster than translation But: No details given More Recently: Average daily throughput for PE: 5,250 words per day Krings (2001): only thorough, published empirical data on PE rates

MT + CL CL: Relatively young field of research/implementation Consequently: little empirical data

CL improves “translatability” The notion of translatability is based on so-called "translatability indicators" where the occurrence of such an indicator in the text is considered to have a negative effect on the quality of machine translation. The fewer translatability indicators, the better suited the text is to translation using MT. (Underwood and Jongejan, 2001: 363)

Can we prove it - empirically? By using CL rules to eliminate negative “translatability indicators”, post-editing effort of MT output will be lower than for output where negative translatability indicators have not been removed.

Experimental Set-Up Validity! Professional, experienced subjects, native speakers (German) Homogenous backgrounds and level of experience Familiar text (user guide) Familiar working environment Payment for time However: limited number of subjects

Framework of Analysis How do you measure post-editing “effort”? Temporal Technical Cognitive Two sentence types: “Snti” “Smin-nti”

Framework of Analysis Temporal Effort: How much time, in seconds, did it take to post-edit each sentence? Technical Effort: How many deletions, insertions, cut & pastes were made for each sentence? Cognitive Effort: Combined Temporal & Technical Additional measurement: Choice Network Analysis

Analysis Tools IBM Websphere Translog Excel

Translog User Interface

Translog Log File

Results: General Temporal Effort

Temporal Effort: Individual Variation

Temporal Effort by Sentence Type Processing Speed: the total number of source words in each segment divided by the total processing time for that segment

Processing Speed by Sentence Type

Technical Effort by Sentence Type

Technical Effort: Cut & Paste Very little activity! Retyping of entire phrases rather than cutting & pasting Less effort to re-type? Need for training?

Cognitive Effort On average, the elimination of NTIs suggests that PE effort is reduced. However, CNA shows: More edits to some NTIs than to others Even though NTIs have been removed from a sentence, this does not guarantee zero post-editing

High PE Effort Gerund (“ing” form of verb) Ungrammatical Phrase Putting an adjective after the noun Non-finite verb (no tense marked) Slang Misspelling Long Noun Phrase Ellipsis Long Sentence (more than 25 words) Verbs with particles Use of Footnotes Multiple Prepositions Short Segment (fewer than 4 words)

Medium PE Effort Multiple Coordinators Problematic Punctuation Passive Voice Phrase not syntactically complete Use of Personal Pronouns Use of Slash as a separator Ambiguous coordination Use of brackets Proper Nouns Missing “that” in a relative clause

Low PE Effort Abbreviations Demonstrative Pronouns Missing “in order to” Contractions (“Let’s”)

Conclusions Taking into account that no QA was performed on the final texts: On average post-editing can be faster than translation High degree of individual variation On average, removing NTIs reduces PE Effort But some NTIs demand more effort than others

Conclusions Even if all known NTIs are removed, sentences may still require PE effort.

Conclusions Not all CL rules will have equal impact Even if CL is applied, PE effort will not be removed completely Post-editors are still human and still translators…