Presentation is loading. Please wait.

Presentation is loading. Please wait.

Quantitative Evaluation of Machine Translation Systems: Sentence Level Palmira Marrafa António Ribeiro.

Similar presentations


Presentation on theme: "Quantitative Evaluation of Machine Translation Systems: Sentence Level Palmira Marrafa António Ribeiro."— Presentation transcript:

1 Quantitative Evaluation of Machine Translation Systems: Sentence Level Palmira Marrafa António Ribeiro

2 Outline n Motivation n ISO Characteristics to Measure n Draft Proposals n Future Work

3 Motivation n Evaluate the Quality of Translated Sentences n Assumption –It is possible to quantify the quality of Translations n Design Evaluation Measures

4 ISO Characteristics to Measure 2.2 System external characteristics 1 Functionality 2 Accuracy...

5 ISO Characteristics to Measure 2 Individual sentence level 1 Morphology 2 Syntax (sentence and phrase structure) 3 Types of errors 3 Lexical errors 4 Syntax errors 5 Stylistic errors

6 Characteristics to Measure n n Types of Errors – –Lexicon – –Syntax – –Morphology

7 MT Systems Used n Systems –S1 –S2

8 Lexicon n « «Lexical» errors refer words or phrases that are inappropriate … » n Example –Input: Foot-and-mouth disease –Output: Febre aftosa –Systran: *Doenca de pé-e-boca –Intertran: *Pé-e-boca doenca

9 Lexicon: Evaluation n Collocations n Fixed Expressions n Semi-fixed Expressions

10 Lexicon: Evaluation n 4-point scale –0 wrong (Doenca de pé-e-boca) –1 marginal (Febre dos pés e da boca) –2 inadequate (Doenca aftosa) –3 correct (Febre aftosa)

11 Lexicon: Evaluation n Scores –Average of individual word scores –Order to be included n Foot-and-mouth disease vs n Disease of foot-and-mouth

12 Syntax n ««Syntax» is concerned with grammaticality» –Previous work: n DARPA: 5-point scale

13 Syntax: Evaluation n Comprehensive typology of Errors in –NPs –PPs –VPs –Predication Level n e.g. subject verb agreement

14 Syntax: Evaluation n NPs –Specifiers n todos os estudantes n all *the students –Modifiers –Co-occurrence Restrictions –Order –...

15 Syntax: Evaluation n VPs –Tense –Mode –Aspect –Modality (should vs might) –Negation –Complements order –Preposition Selection –...

16 Future Work n Generalise across the Categories –to build the metrics n Assign scores to each error n Test the metrics


Download ppt "Quantitative Evaluation of Machine Translation Systems: Sentence Level Palmira Marrafa António Ribeiro."

Similar presentations


Ads by Google