Machine Translation: Interlingual Methods Thanks to Les Sikos Bonnie J. Dorr, Eduard H. Hovy, Lori S. Levin.

Slides:



Advertisements
Similar presentations
SWG Strategy (C) Copyright IBM Corp. 2006, All Rights Reserved. P4 Task 2 Fact Extraction using a CNL Current Status David Mott, Dave Braines, ETS,
Advertisements

The Chinese Room: Understanding and Correcting Machine Translation This work has been supported by NSF Grants IIS Solution: The Chinese Room Conclusions.
A Semantic Template for Light Verb Constructions Karine Megerdoomian University of California, San Diego
Prague, Dec, 2006 Computational Lexical Semantics Martha Palmer Vilem Mathesius Lecture Series 21 Charles University, Prague December, 2006.
OntoSem2OWL. Plan of the talk ● OntoSem Overview ● Features of OntoSem Ontology ● Mapping OntoSem2OWL ● Motivation ● Possible Application Scenarios.
Statistical NLP: Lecture 3
For Friday No reading Homework –Chapter 23, exercises 1, 13, 14, 19 –Not as bad as it sounds –Do them IN ORDER – do not read ahead here.
For Monday Read Chapter 23, sections 3-4 Homework –Chapter 23, exercises 1, 6, 14, 19 –Do them in order. Do NOT read ahead.
C SC 620 Advanced Topics in Natural Language Processing Lecture 22 4/15.
Speech Translation on a PDA By: Santan Challa Instructor Dr. Christel Kemke.
Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.
C SC 620 Advanced Topics in Natural Language Processing Lecture 20 4/8.
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
LCS and Approximate Interlingua at UMD Semantic Annotation Planning Meeting April 14, 2004 Bonnie J. Dorr University of Maryland.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University
Translation Divergence LING 580MT Fei Xia 1/10/06.
Methodology Conceptual Database Design
Chapter 2: Algorithm Discovery and Design
Machine Translation Challenges and Language Divergences Alon Lavie Language Technologies Institute Carnegie Mellon University : Machine Translation.
Machine Translation History of Machine Translation Difficulties in Machine Translation Structure of Machine Translation System Research methods for Machine.
March 1, 2009 Dr. Muhammed Al-Mulhem 1 ICS 482 Natural Language Processing INTRODUCTION Muhammed Al-Mulhem March 1, 2009.
ANSWERING CONTROLLED NATURAL LANGUAGE QUERIES USING ANSWER SET PROGRAMMING Syeed Ibn Faiz.
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
Created By: Benjamin J. Van Someren.  Natural Language Translation – Translating one natural language such as German to another natural language such.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
Communicative Language Teaching Vocabulary
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
An Introduction to Software Architecture
For Friday Finish chapter 23 Homework: –Chapter 22, exercise 9.
Week 9: resources for globalisation Finish spell checkers Machine Translation (MT) The ‘decoding’ paradigm Ambiguity Translation models Interlingua and.
Querying Across Languages: A Dictionary-Based Approach to Multilingual Information Retrieval Doctorate Course Web Information Retrieval Speaker Gaia Trecarichi.
Globalisation and machine translation Machine Translation (MT) The ‘decoding’ paradigm Ambiguity Translation models Interlingua and First Order Predicate.
Interpreting Dictionary Definitions Dan Tecuci May 2002.
Reading. How do you think we read? -memorizing words on the page -extracting just the meanings of the words -playing a mental movie in our heads of what.
Natural Language Processing Introduction. 2 Natural Language Processing We’re going to study what goes into getting computers to perform useful and interesting.
© Copyright 2013 ABBYY NLP PLATFORM FOR EU-LINGUAL DIGITAL SINGLE MARKET Alexander Rylov LTi Summit 2013 Confidential.
TRANSLATION THEORY Dr. Mashadi Said
Terminology and documentation*  Object of the study of terminology:  analysis and description of the units representing specialized knowledge in specialized.
Chapter 10 Analysis and Design Discipline. 2 Purpose The purpose is to translate the requirements into a specification that describes how to implement.
MT with an Interlingua Lori Levin April 13, 2009.
CSA2050 Introduction to Computational Linguistics Lecture 1 Overview.
Artificial Intelligence Research Center Pereslavl-Zalessky, Russia Program Systems Institute, RAS.
Programming Languages and Design Lecture 3 Semantic Specifications of Programming Languages Instructor: Li Ma Department of Computer Science Texas Southern.
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
Jan 2005CSA4050 Machine Translation II1 CSA4050: Advanced Techniques in NLP Machine Translation II Direct MT Transfer MT Interlingual MT.
For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.
CS 4705 Lecture 17 Semantic Analysis: Robust Semantics.
Machine Translation Divergences: A Formal Description and Proposed Solution Bonnie J. Dorr University of Maryland Presented by: Soobia Afroz.
Natural Language Processing Group Computer Sc. & Engg. Department JADAVPUR UNIVERSITY KOLKATA – , INDIA. Professor Sivaji Bandyopadhyay
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
Using Semantic Relations to Improve Information Retrieval
NATURAL LANGUAGE PROCESSING
A Simple English-to-Punjabi Translation System By : Shailendra Singh.
Jan 2012MT Architectures1 Human Language Technology Machine Translation Architectures Direct MT Transfer MT Interlingual MT.
Learning to Generate Complex Morphology for Machine Translation Einat Minkov †, Kristina Toutanova* and Hisami Suzuki* *Microsoft Research † Carnegie Mellon.
LingWear Language Technology for the Information Warrior Alex Waibel, Lori Levin Alon Lavie, Robert Frederking Carnegie Mellon University.
TYPES OF TRANSLATION.
Approaches to Machine Translation
Statistical NLP: Lecture 3
Complexity Time: 2 Hours.
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Representation of Actions as an Interlingua
Statistical NLP: Lecture 13
CSc4730/6730 Scientific Visualization
Approaches to Machine Translation
Information Retrieval
Presentation transcript:

Machine Translation: Interlingual Methods Thanks to Les Sikos Bonnie J. Dorr, Eduard H. Hovy, Lori S. Levin

Overview What is Machine Translation (MT)? –Automated system –Analyzes text from Source Language (SL) –Produces equivalent text in Target Language (TL) –Ideally without human intervention Source Language Target Language

Overview Three main methodologies for Machine Translation –Direct –Transfer –Interlingual

Overview Three main methodologies for Machine Translation –Direct –Transfer –Interlingual

Overview Three main methodologies for Machine Translation –Direct –Transfer –Interlingual

Overview Three main methodologies for Machine Translation –Direct –Transfer –Interlingual

Overview Interlingua –Single underlying representation for both SL and TL which ideally Abstracts away from language-specific characteristics Creates a language-neutral representation Can be used as a pivot representation in the translation

Overview Cost/Benefit analysis of moving up the triangle –Benefit Reduces the amount of work required to traverse the gap between languages –Cost Increases amount of analysis –Convert the source input into a suitable pre-transfer representation Increases amount of synthesis –Convert the post-transfer representation into the final target surface form

Overview Two major advantages of Interlingua method 1.The more target languages there are, the more valuable an Interlingua becomes Source Language TL 1 TL 2 TL 3 TL 4 TL 5 TL 6 Inter- Lingua

Overview Two major advantages of Interlingua method 2.Interlingual representations can also be used by NLP systems for other multilingual applications

Overview Sounds great, but…due to many complexities –Only one interlingual MT system has ever been made operational in a commercial setting KANT (Nyberg and Mitamura, 1992, 2000; Lonsdale et al., 1995) –Only a few have been taken beyond research prototype

Current Efforts KANT system (Nyberg and Mitamura, 1992) –Only interlingual MT system that has ever been made operational in a commercial setting Caterpillar document workflow (mid-90s) –Knowledge-based system –Designed for translation of technical documents written in Caterpillar Technical English (CTE) to French, Spanish, and German –Controlled English – no pronouns, conjunctions,...

Issues Loss of Stylistic Elements –Because representation is independent of syntax Generated target text reads more like a paraphrase Style and emphasis of the original text are lost –Not so much a failure of Interlingua as incompleteness Caused by a lack of understanding of discourse and pragmatic elements required to recognize and appropriately reproduce style and emphasis In some cases it may be an advantage to ignore the authors style –Outside the field of artistic texts (poetry and fiction) syntactic form of source text is superfluous

Issues Loss of Stylistic Elements –Current state of the art It is only possible to produce reliable interlinguas between language groups (e.g., Japanese – Western European) within specialized domains

Issues Linguistic Divergences –Structural differences between languages Categorical Divergence –Translation of words in one language into words that have different parts of speech in another language »To be jealous »Tener celos (To have jealousy)

Issues Linguistic Divergences Conflational Divergence –Translation of two or more words in one language into one word in another language »To kick »Dar una patada (Give a kick)

Issues Linguistic Divergences Structural Divergence –Realization of verb arguments in different syntactic configurations in different languages »To enter the house »Entrar en la casa (Enter in the house)

Issues Linguistic Divergences Head-Swapping Divergence –Inversion of a structural-dominance relation between two semantically equivalent words »To run in »Entrar corriendo (Enter running)

Issues Linguistic Divergences Thematic Divergence –Realization of verb arguments that reflect different thematic to syntactic mapping orders »I like grapes »Me gustan uvas (To-me please grapes)

Issues Linguistic Divergences may be the norm rather than the exception –Differences in MT architecture (direct, transfer, interlingual) are crucial for resolution of cross-language divergences Interlingua approach takes advantage of the compositionality of basic units of meaning to resolve divergences

Issues For example: To kick – Dar una patada (Give a kick) –Conflational divergence can be resolved by mapping English kick into two components before translating into in Spanish Motional component (movement of the leg) Manner component (a kicking motion)

Current Efforts Pangloss project (Frederking et al., 1994) –Ambitious attempt to build rich interlingual expressions –Uses humans to augment system analysis –Representation includes a set of frames for representing semantic components, each of which Are headed by a unique identifier And have a separate frame with aspectual information (duration, telicity, etc.) –Some modifiers are treated as scalars and represented by numerical values

Current Efforts Mikrokosmos (Mahesh and Nirenburg, 1995) / OntoSem (Nirenburg and Raskin, 2004) –Focus is to produce semantically rich Text- Meaning Representations (TMRs) of text –TMRs use a language-independent metalanguage also used for static knowledge resources –TMRs aimed at the most difficult problems of NLP; Disambiguation, reference resolution –Goal is to populate a fact repository with TMRs as a language-independent search space for question-answering and knowledge-extraction

Current Efforts PRINCITRAN (Dorr & Voss, 1996) –Approach assumes an interlingua derived from lexical semantics and predicate decomposition Jackendoff 1983, 1990; Levin & Rappaport-Hovav 1995a, 1995b –Has not complicated, but rather facilitated, the identification and construction of systematic relations at the interface between each level

Current Efforts Motivation for Non-Uniform Approach German: Der Berg liegt im Suden der Stadt –Ambiguous in English: The mountain lies in the south of the city The mountain lies to the south of the city –In other words, the German phrase maps to two distinct representations

Current Efforts Using Default knowledge in the KR –Mountains are physical entities, typically distinct and external to cities –System chooses second translation The mountain lies to the south of the city Using specific facts in the KR –A particular mountain is in the city –System overrides default knowledge and chooses first translation The mountain lies in the south of the city

Current Efforts The need to translate such sentences accurately is a clear case of where general as well as specific real-world knowledge should assist in eliminating inappropriate translations –Knowledge Representational level, not the Interlingual level, provides this capability in this model

Current Efforts Lexical Conceptual Structure (LCS) –Used as part of many MT language pairs including ChinMT (Habash et al., 2003a) Chinese-English –Also been used for other natural language applications Cross-language information retrieval

Current Efforts Lexical Conceptual Structure (LCS) –Approach focuses on linguistic divergences –For example – Conflational divergence Arabic: The reporter caused the to go to Al-Jazeera in a sending manner. English: The reporter ed Al-Jazeera.

Current Efforts LCS representation (event cause (thing[agent] reporter+) (go loc (thing[theme] +) (path to loc (thing +) (position at loc (thing +) (thing[goal] aljazeera+))) (manner send+ingly)))

Current Efforts LCS representation (event cause (thing[agent] reporter+) (go loc (thing[theme] +) (path to loc (thing +) (position at loc (thing +) (thing[goal] aljazeera+))) (manner send+ingly))) –Primary components of meaning are the top- level conceptual nodes cause and go

Current Efforts LCS representation (event cause (thing[agent] reporter+) (go loc (thing[theme] +) (path to loc (thing +) (position at loc (thing +) (thing[goal] aljazeera+))) (manner send+ingly))) –Primary components of meaning are the top- level conceptual nodes cause and go –These are taken together with their arguments Each identified by a semantic role (agent, theme, goal) –And a modifier (manner) send+ingly

LCS as an interlingua? Jackendoff wasnt trying to capture all of meaning – just the semantics that corresponds to syntactic generalizations Ch-of-loc, causation, states,... are very fundamental. If we dont get anything else, we should get at least these LCS highlights just these relations – not bad for an interlingua, but what about those stylistic things, etc?

Current Efforts Approximate Interlingua (Dorr and Habash, 2002) –Depth of knowledge-based systems is approximated –Taps into the richness of resources in one language (often English) –This information is used to map the source- language input to the target-language output

Current Efforts Approximate Interlingua (Dorr and Habash, 2002) –Focus on linguistic divergences but with fewer knowledge-intensive components than in LCS –Key feature Coupling of basic argument-structure information with some, but not all, components the LCS representation Only the top-level primitives and semantic roles are retained –This new representation provides the basis for generation of multiple sentences that are statistically pared down – ranked by TL constraints

Current Efforts Approximate Interlingua representation: –Check top-level conceptual nodes for matches –Check unmatched thematic roles for conflatability Cases where semantic roles are absorbed into other predicate positions –Here there is a relation between the conflated argument N and V