Applying Machine Translation Metrics to Student-Written Translations Lisa N. Michaud Computer Science Department Merrimack College North Andover, Massachusetts,

Slides:



Advertisements
Similar presentations
Spanish and English use comparisons to indicate which of two people or things has a lesser, equal, or greater degree of a quality. Copyright © 2008 Vista.
Advertisements

Unit IV: Lesson 2 Slow Way Home Persuasive Essay Assessment Milinda Jay, Ph. D.
Asking Questions: Interrogative Words and Sentences
1 End-User Programming to Support Classroom Activities on Small Devices Craig Prince University of Washington VL/HCC 2008.
By D. Fisher Geometric Transformations. Reflection, Rotation, or Translation 1.
Understanding the English Proficiency Levels of ELLs Catawba County Schools
By Mohamed Hassan Bell High School, Los Angeles, California.
Addition Facts
Statistical modelling of MT output corpora for Information Extraction.
Rationale for a multilingual corpus for machine translation evaluation Debbie Elliott Anthony Hartley Eric Atwell Corpus Linguistics 2003, Lancaster, England.
Mathematics and Special Education Leadership Protocols Protocol 2: Essential Understandings About Students with Disabilities Developed in collaboration.
King Alfred A Translation Environment for Learners of Anglo-Saxon English Lisa N. Michaud Computer Science Department Saint Anselm College Manchester,
The SCPS Professional Growth System
© 2006 TDA Development Draft and subject to amendments from consultation Performance Management Challenge for Schools PM workshops 23 October 2006.
1 INTRODUCE MYSELF GOOD MORNING STUDENTS. I AM AN ENGLISH TEACHER. MY NAME IS RINA.
Understanding the ELA/Literacy Evidence Tables. The tables contain the Reading, Writing and Vocabulary Major claims and the evidences to be measured on.
GENERAL EDUCATION ASSESSMENT Nathan Lindsay January 22-23,
ITEMS Grocery Store? Couples? No, language testing.
Chapter 2 Section 3.
The Chinese Room: Understanding and Correcting Machine Translation This work has been supported by NSF Grants IIS Solution: The Chinese Room Conclusions.
WALT: saying what sort of a time you had on holidays Wilf: Giving more detail of your holiday. i.e opinion (Level3-4) Wilf: expressing opinion about past.
Put a tick for true, or a cross for false, in your book
Changes to the Provincial Exam. Previously, the Provincial was written in 2 sittings: a morning session (2 ½ hours) and an afternoon session (2 hours)
Adapted with permission from Melanie Learoyd & Morag Kelley North Vancouver School Board May 9, 2009.
Programa de Doble Inmersión Hedenkamp Elementary Spanish Hedenkamp Spanish Dual Immersion Program.
Vamos a hablar......de los iPods. You are going to have a conversation with your teacher about new technology and your free time. What could the teacher.
Crowd Idea Generation Lixiu (Lisa) Yu Stevens Institute of Technology 1 Funding for this research was provided by the National Science.
Test B, 100 Subtraction Facts
Doubles Facts Doubles with Pictures Doubles without Pictures Pictures Only.
$1 Million $500,000 $250,000 $125,000 $64,000 $32,000 $16,000 $8,000 $4,000 $2,000 $1,000 $500 $300 $200 $100 Welcome.
$100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200.
Is it Mathematics? Linking to Content Standards. Some questions to ask when looking at student performance Is it academic? – Content referenced: reading,
Detecting Spam Zombies by Monitoring Outgoing Messages Zhenhai Duan Department of Computer Science Florida State University.
Number bonds to 10,
1 A Systematic Review of Cross- vs. Within-Company Cost Estimation Studies Barbara Kitchenham Emilia Mendes Guilherme Travassos.
OBSERVING PEDAGOGICAL PRACTICES THAT FACILITATE NEGOTIATION FOR MEANING IN L2 CLASSES.
Copyright © 2008 Vista Higher Learning. All rights reserved. 7.1–1.
Content-based instruction is a model which follows an integrated-skills approach to language instruction. The second language is simply the medium to.
Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.
June 2004 D ARPA TIDES MT Workshop Measuring Confidence Intervals for MT Evaluation Metrics Ying Zhang Stephan Vogel Language Technologies Institute Carnegie.
Evaluating an MT French / English System Widad Mustafa El Hadi Ismaïl Timimi Université de Lille III Marianne Dabbadie LexiQuest - Paris.
¡ADELANTE! ¿POR QUÉ EL ESPAÑOL? CAPÍTULO PRELIMINAR.
A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora Benjamin Arai Computer Science and Engineering Department.
Department of Humanities College of Sciences and Liberal Arts Writing Program Assessment at New Jersey Institute of Technology Carol Siri Johnson Associate.
TESOL Conference Trip By Reba Orton. Reba’s TESOL Conference Schedule 3/23/2010 – 9 am – 4 pm Mathematics for ELL’s: What Teachers Need to Know 3/24/2010.
Spanish Grammar vs. English Grammar By: Rachel Brewer.
METEOR-Ranking & M-BLEU: Flexible Matching & Parameter Tuning for MT Evaluation Alon Lavie and Abhaya Agarwal Language Technologies Institute Carnegie.
Relative Words Used Between Clauses Elizabeth Navedo Arbeláez S Contrastive Analysis of English and Spanish ENGL 360 Professor: Dr. Evelyn Lugo.
TLPI Spring 2007 Secondary Math/Science 1/29/07 ► Red Binder Questions ► Why Teach? ► Assessment ► Map of Pedagogical Knowledge ► Teaching Diverse Students.
Computational Linguistics. The Subject Computational Linguistics is a branch of linguistics that concerns with the statistical and rule-based natural.
Hacer Ahora Martes, el 3 de mayo
The grammar translation method is a foreign language teaching method derived from the classical method of teaching Greek and La tin. The method requires.
 There must be a coherent set of links between techniques and principles.  The actions are the techniques and the thoughts are the principles.
Improving Named Entity Translation Combining Phonetic and Semantic Similarities Fei Huang, Stephan Vogel, Alex Waibel Language Technologies Institute School.
Teacher’s English Proficiency Test (TEPT) and Process Skills Test (PST) in Science and Mathematics TEPT-PST: Overview 2015.
April 18, 2002CLAUDIA ACERO WELCOME! April 18, 2002CLAUDIA ACERO SOUTHERN NEW HAMPSHIRE UNIVERSITY MS-TEFL. COMPUTER ASSISTED LANGUAGE LEARNING EFL 537.
1 Minimum Error Rate Training in Statistical Machine Translation Franz Josef Och Information Sciences Institute University of Southern California ACL 2003.
Written by: lwitkop edited by: Rebecca Scudder updated: 3/2/2012 spanish/33236-verbs-use-the-wedding-acronym-for-the-
Pastra and Saggion, EACL 2003 Colouring Summaries BLEU Katerina Pastra and Horacio Saggion Department of Computer Science, Natural Language Processing.
Review: Review: Translating without in-domain corpus: Machine translation post-editing with online learning techniques Antonio L. Lagarda, Daniel Ortiz-Martínez,
Rubrics.
1 Ann (Ana) Morris  To provide simple stories with learning experiences.  To record memories of family experiences.  To provide.
Unit 1 Edited by: May 24, Re write the text using the model. Change the name of Sally for the name of one classmate. Change the sport or hobbbie’s.
Academic Goals and PLC Progress Bixby Middle School.
Ling 575: Machine Translation Yuval Marton Winter 2016 February 9: MT Evaluation Much of the materials was borrowed from course slides of Chris Callison-Burch.
Cognates, prefixes and suffixes.
Martinez-Alvarez, P. , Bannan, B. , & Peters-Burton, E. (2012)
Rubrics.
So, You’re Writing an Article!
Presentation transcript:

Applying Machine Translation Metrics to Student-Written Translations Lisa N. Michaud Computer Science Department Merrimack College North Andover, Massachusetts, USA Patricia Ann McCoy Language Department Universidad de las Americas Puebla Puebla, Mexico

Evaluating learner translation Michaud and McCoy2 Translating ancient texts and doing what MT cannot do yet

Criteria for judging translations fluency (is it well-formed?) fidelity (does it convey original meaning?) (Hovy et al., 2002) 3Michaud and McCoy

Multiplicity of translations  In each one of these jobs the professor could have agreed to work 6 hours a day and therefore would not be surpassing the working day hour limit.  In each one of these jobs the teacher could have agreed to work 6 hours per day and therefore he wouldn't be bound by the limits of the working day.  In each of these examples the teaching could have been arranged so that he/she works six hours a day and would not be affected by any workday limitations.  In both of these jobs the professor could have agreed to work six hours daily and therefore he wouldn't be affecting his work shift limit. 4Michaud and McCoy

Multiplicity of translations  In each one of these jobs the professor could have agreed to work 6 hours a day and therefore would not be surpassing the working day hour limit.  In each one of these jobs the teacher could have agreed to work 6 hours per day and therefore he wouldn't be bound by the limits of the working day.  In each of these examples the teaching could have been arranged so that he/she works six hours a day and would not be affected by any workday limitations.  In both of these jobs the professor could have agreed to work six hours daily and therefore he wouldn't be affecting his work shift limit. 5Michaud and McCoy

BLEU 6Michaud and McCoy Hypothesis Multiple References

TERp 7Michaud and McCoy Hypothesis Single Reference PHRASAL EQUIVALENCE or SYNONYM SHIFTSUBSTITUTION INSERTION SAME STEM

TERp alignment and tags 8Michaud and McCoy

Student translation corpus 9Michaud and McCoy

Sentence boundaries 10Michaud and McCoy EXPERT: Due to the fact that previous studies have shown that POMC neurons are linked to obesity, the scientists believe the findings might also lead to developing treatments to control obesity and other metabolic disorders. STUDENT: Past studies have shown that the POMC neurons are involved with obesity. The scientists believe this finding could also help develop treatments for the obese and other metabolic disorders. ORIGINAL: Estudios en el pasado han demostrado que las neuronas POMC están involucradas en la obesidad, por lo tanto los científicos creen que el hallazgo también ofrece la posibilidad de desarrollar tratamientos para controlar la obesidad y otros trastornos metabólicos.

Does TERp agree with an expert? Michaud and McCoy11 Instructor Scores vs Inverted TERp 650 sentences (22%) Pearson Correlation r =

Score distribution 12Michaud and McCoy

Instructor rubric (original) Michaud and McCoy13 10Excellent 9Good 8Satisfactory 0-7 Deficient

TERp scores over the semester 14Michaud and McCoy

Evaluating TERp tags (pilot) Michaud and McCoy15

Future work Michaud and McCoy16

Instructor rubric (revised) Michaud and McCoy Excellent 90Good 80Satisfactory 0-70 Deficient

Modifying the TERp Score 18Michaud and McCoy Hypothesis Single Reference PHRASAL EQUIVALENCE or SYNONYM SHIFTSUBSTITUTION INSERTION SAME STEM

Recognizing false cognates 19Michaud and McCoy Hypothesis Single Reference SUBSTITUTION cynical brazen Source cínico

Extracting mistranslation pairs Michaud and McCoy20 SPANISH DICTIONARY ENGLISH DICTIONARY cynical cínico cynical brazen zonazone