Quantitative Evaluation of Machine Translation Systems: Sentence Level Palmira Marrafa António Ribeiro.

Slides:



Advertisements
Similar presentations
The Structure of Sentences Asian 401
Advertisements

Modals can - could will - would may - might - must shall - should.
Computational language: week 10 Lexical Knowledge Representation concluded Syntax-based computational language Sentence structure: syntax Context free.
BBN-ANG-253 Advanced Syntax Lecture Course Autumn, 2014/15
NP Movement Passives, Raising: When NPs are not in their theta positions.
Syntactic analysis using Context Free Grammars. Analysis of language Morphological analysis – Chairs, Part Of Speech (POS) tagging – The/DT man/NN left/VBD.
1 Rules Based Machine Translation Fred Hollowood Consultant RBMT and CL.
Language and Cognition Colombo, June 2011 Day 2 Introduction to Linguistic Theory, Part 4.
Linguistics, Morphology, Syntax, Semantics. Definitions And Terminology.
1 The passive Summary phonological form/shape of the passive in English (the data): be + V-ed The guest was murdered by the chef problem: subject is patient.
Statistical NLP: Lecture 3
Introduction.  “a technique that enables the computer to encode complex grammatical knowledge such as humans use to assemble sentences, recognize errors.
1 LIN 1310B Introduction to Linguistics Prof: Nikolay Slavkov TA: Qinghua Tang CLASS 18, March 13, 2007.
1 VERB PHRASE LAY SENGHOR. 2 What is a verb phrase? Definition: A verb phrase is composed of the verbs of the sentence and any modifiers of verbs,including.
Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.
Verbs and Verbals Infinitives, Gerunds, and Participles.
Evaluating an MT French / English System Widad Mustafa El Hadi Ismaïl Timimi Université de Lille III Marianne Dabbadie LexiQuest - Paris.
1 CSC 594 Topics in AI – Applied Natural Language Processing Fall 2009/ Outline of English Syntax.
AuxiliariesAuxiliaries. Auxiliaries A verb used to add a functional or grammatical meaning to a clause in which it appears. Functions in a supporting.
Linguistic Theory Lecture 2 Phrase Structure. What was there before structure? Classical studies: Classical studies: –Languages such as Latin Rich morphology.
Syntax The number of words in a language is finite
GRAMMAR APPROACH By: Katherine Marzán Concepción EDUC 413 Prof. Evelyn Lugo.
Machine Translation Dr. Radhika Mamidi. What is Machine Translation? A sub-field of computational linguistics It investigates the use of computer software.
1 Statistical NLP: Lecture 10 Lexical Acquisition.
Word category and verb-argument structure information in the dynamics of parsing Frisch, Hahne, and Friedericie (2004) Cognition.
Editing for Writers of English as a Second Language presented by Judith M. Davis, Director Writing Technology Laboratory 24 March 2003.
Kyoshiro SUGIYAMA, AHC-Lab., NAIST An Investigation of Machine Translation Evaluation Metrics in Cross-lingual Question Answering Kyoshiro Sugiyama, Masahiro.
Can Controlled Language Rules increase the value of MT? Fred Hollowood & Johann Rotourier Symantec Dublin.
Sensitivity of automated MT evaluation metrics on higher quality MT output Bogdan Babych, Anthony Hartley Centre for Translation.
LING 702 Structure Class 2 Instructor: Bede McCormack.
14/12/2009ICON Dipankar Das and Sivaji Bandyopadhyay Department of Computer Science & Engineering Jadavpur University, Kolkata , India ICON.
Transfer-based MT with Strong Decoding for a Miserly Data Scenario Alon Lavie Language Technologies Institute Carnegie Mellon University Joint work with:
Extract Questions from Sentences. Purpose The behavior of extracting questions from sentences can be regarded as extracting semantics or knowledge from.
Initial Considerations
Error Correction: For Dummies? Ellen Pratt, PhD. UPR Mayaguez.
1 LIN 1310B Introduction to Linguistics Prof: Nikolay Slavkov TA: Qinghua Tang CLASS 12, Feb 13, 2007.
Ideas for 100K Word Data Set for Human and Machine Learning Lori Levin Alon Lavie Jaime Carbonell Language Technologies Institute Carnegie Mellon University.
What you have learned and how you can use it : Grammars and Lexicons Parts I-III.
C HAPTER 11 Grammar Fundamentals. T HE P ARTS OF S PEECH AND T HEIR F UNCTIONS Nouns name people, places things, qualities, or conditions Subject of a.
Leonid Iomdin Institute for Information Transmission Problems, Russian Academy of Sciences
SYNTAX.
◦ Process of describing the structure of phrases and sentences Chapter 8 - Phrases and sentences: grammar1.
Error Analysis of Two Types of Grammar for the purpose of Automatic Rule Refinement Ariadna Font Llitjós, Katharina Probst, Jaime Carbonell Language Technologies.
Discriminative Modeling extraction Sets for Machine Translation Author John DeNero and Dan KleinUC Berkeley Presenter Justin Chiu.
Word classes and part of speech tagging. Slide 1 Outline Why part of speech tagging? Word classes Tag sets and problem definition Automatic approaches.
Avenue Architecture Learning Module Learned Transfer Rules Lexical Resources Run Time Transfer System Decoder Translation Correction Tool Word- Aligned.
3.3 A More Detailed Look At Transformations Inversion (revised): Move Infl to C. Do Insertion: Insert interrogative do into an empty.
Subject Complements p Subject Complements Complements are words that complete the meaning or action of verbs. Complements always come in the.
Category 2 Category 6 Category 3.
Auxiliaries in simple past How to work with “did” and “was-were”
How do I begin to prepare ????. Why Praxis ?? 35 states in US currently use Praxis for assessment purposes in CT - used for assessing skills of future.
Learning to Generate Complex Morphology for Machine Translation Einat Minkov †, Kristina Toutanova* and Hisami Suzuki* *Microsoft Research † Carnegie Mellon.
AUXILIARY AND MODAL VERBS ETSI de Telecomunicación INGLÉS.
The World of Verbs.
Lecture – VIII Monojit Choudhury RS, CSE, IIT Kharagpur
Lecture 3 Krisztina Szécsényi
Statistical NLP: Lecture 3
Ross Chapter 10 p The strong verb (modified from Ross because we will use the root.
DGP – Sentence 1 Sentence Parts.
Translation Problems.
Initial Considerations
Language Variations: Japanese and English
PREPOSITIONAL PHRASES
Software metrics.
Linguistic Essentials
Lecture 5 Krisztina Szécsényi
Traditional Grammar VS. Generative Grammar
The SAT Writing and Language Test
Statistical NLP: Lecture 10
Presentation transcript:

Quantitative Evaluation of Machine Translation Systems: Sentence Level Palmira Marrafa António Ribeiro

Outline n Motivation n ISO Characteristics to Measure n Draft Proposals n Future Work

Motivation n Evaluate the Quality of Translated Sentences n Assumption –It is possible to quantify the quality of Translations n Design Evaluation Measures

ISO Characteristics to Measure 2.2 System external characteristics 1 Functionality 2 Accuracy...

ISO Characteristics to Measure 2 Individual sentence level 1 Morphology 2 Syntax (sentence and phrase structure) 3 Types of errors 3 Lexical errors 4 Syntax errors 5 Stylistic errors

Characteristics to Measure n n Types of Errors – –Lexicon – –Syntax – –Morphology

MT Systems Used n Systems –S1 –S2

Lexicon n « «Lexical» errors refer words or phrases that are inappropriate … » n Example –Input: Foot-and-mouth disease –Output: Febre aftosa –Systran: *Doenca de pé-e-boca –Intertran: *Pé-e-boca doenca

Lexicon: Evaluation n Collocations n Fixed Expressions n Semi-fixed Expressions

Lexicon: Evaluation n 4-point scale –0 wrong (Doenca de pé-e-boca) –1 marginal (Febre dos pés e da boca) –2 inadequate (Doenca aftosa) –3 correct (Febre aftosa)

Lexicon: Evaluation n Scores –Average of individual word scores –Order to be included n Foot-and-mouth disease vs n Disease of foot-and-mouth

Syntax n ««Syntax» is concerned with grammaticality» –Previous work: n DARPA: 5-point scale

Syntax: Evaluation n Comprehensive typology of Errors in –NPs –PPs –VPs –Predication Level n e.g. subject verb agreement

Syntax: Evaluation n NPs –Specifiers n todos os estudantes n all *the students –Modifiers –Co-occurrence Restrictions –Order –...

Syntax: Evaluation n VPs –Tense –Mode –Aspect –Modality (should vs might) –Negation –Complements order –Preposition Selection –...

Future Work n Generalise across the Categories –to build the metrics n Assign scores to each error n Test the metrics