1 Machine Translation Dai Xinyu 2006-10-27. 2 Outline  Introduction  Architecture of MT  Rule-Based MT vs. Data-Driven MT  Evaluation of MT  Development.

Slides:



Advertisements
Similar presentations
1 Statistical Machine Translation Bonnie Dorr Christof Monz CMSC 723: Introduction to Computational Linguistics Lecture 8 October 27, 2004.
Advertisements

The Application of Machine Translation in CADAL Huang Chen, Chen Haiying Zhejiang University Libraries, Hangzhou, China
Statistical Machine Translation Part II: Word Alignments and EM Alexander Fraser Institute for Natural Language Processing University of Stuttgart
Statistical Machine Translation Part II: Word Alignments and EM Alexander Fraser ICL, U. Heidelberg CIS, LMU München Statistical Machine Translation.
Statistical Machine Translation Part II – Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart
Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.
Improving Machine Translation Quality via Hybrid Systems and Refined Evaluation Methods Andreas Eisele DFKI GmbH and Saarland University Helsinki, November.
What is a corpus?* A corpus is defined in terms of  form  purpose The word corpus is used to describe a collection of examples of language collected.
January 12, Statistical NLP: Lecture 2 Introduction to Statistical NLP.
Machine Translation (Level 2) Anna Sågvall Hein GSLT Course, September 2004.
Natural Language and Speech Processing Creation of computational models of the understanding and the generation of natural language. Different fields coming.
Machine Translation Anna Sågvall Hein Mösg F
Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.
The current status of Chinese-English EBMT research -where are we now Joy, Ralf Brown, Robert Frederking, Erik Peterson Aug 2001.
Machine Translation Prof. Alexandros Potamianos Dept. of Electrical & Computer Engineering Technical University of Crete, Greece May 2003.
Resources Primary resources – Lexicons, structured vocabularies – Grammars (in widest sense) – Corpora – Treebanks Secondary resources – Designed for a.
Course Summary LING 575 Fei Xia 03/06/07. Outline Introduction to MT: 1 Major approaches –SMT: 3 –Transfer-based MT: 2 –Hybrid systems: 2 Other topics.
C SC 620 Advanced Topics in Natural Language Processing Lecture 24 4/22.
Semi-Automatic Learning of Transfer Rules for Machine Translation of Low-Density Languages Katharina Probst April 5, 2002.
MT Summit VIII, Language Technologies Institute School of Computer Science Carnegie Mellon University Pre-processing of Bilingual Corpora for Mandarin-English.
Application of RNNs to Language Processing Andrey Malinin, Shixiang Gu CUED Division F Speech Group.
1 The Web as a Parallel Corpus  Parallel corpora are useful  Training data for statistical MT  Lexical correspondences for cross-lingual IR  Early.
1 Statistical NLP: Lecture 13 Statistical Alignment and Machine Translation.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Natural Language Understanding
Natural Language Processing Expectation Maximization.
An Automatic Segmentation Method Combined with Length Descending and String Frequency Statistics for Chinese Shaohua Jiang, Yanzhong Dang Institute of.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Natural Language Processing Lab Northeastern University, China Feiliang Ren EBMT Based on Finite Automata State Transfer Generation Feiliang Ren.
Machine translation Context-based approach Lucia Otoyo.
Statistical Alignment and Machine Translation
Machine Translation Dr. Radhika Mamidi. What is Machine Translation? A sub-field of computational linguistics It investigates the use of computer software.
English-Persian SMT Reza Saeedi 1 WTLAB Wednesday, May 25, 2011.
Chapter 1 Introduction Samuel College of Computer Science & Technology Harbin Engineering University.
Evaluation of the Statistical Machine Translation Service for Croatian-English Marija Brkić Department of Informatics, University of Rijeka
An Integrated Approach for Arabic-English Named Entity Translation Hany Hassan IBM Cairo Technology Development Center Jeffrey Sorensen IBM T.J. Watson.
Profile The METIS Approach Future Work Evaluation METIS II Architecture METIS II, the continuation of the successful assessment project METIS I, is an.
Advanced Signal Processing 05/06 Reinisch Bernhard Statistical Machine Translation Phrase Based Model.
Chapter 10 Language and Computer English Linguistics: An Introduction.
Machine Translation  Machine translation is of one of the earliest uses of AI  Two approaches:  Traditional approach using grammars, rewrite rules,
NUDT Machine Translation System for IWSLT2007 Presenter: Boxing Chen Authors: Wen-Han Chao & Zhou-Jun Li National University of Defense Technology, China.
IRCS/CCN Summer Workshop June 2003 Speech Recognition.
Approaches to Machine Translation CSC 5930 Machine Translation Fall 2012 Dr. Tom Way.
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
1 Statistical NLP: Lecture 7 Collocations. 2 Introduction 4 Collocations are characterized by limited compositionality. 4 Large overlap between the concepts.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
Machine Translation (Level 2) Anna Sågvall Hein GSLT Course, January 2003.
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
LREC 2008 Marrakech 29 May Caroline Lavecchia, Kamel Smaïli and David Langlois LORIA / Groupe Parole, Vandoeuvre-Lès-Nancy, France Phrase-Based Machine.
Auckland 2012Kilgarriff: NLP and Corpus Processing1 The contribution of NLP: corpus processing.
For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.
The Unreasonable Effectiveness of Data
Statistical Machine Translation Part II: Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart
Approaching a New Language in Machine Translation Anna Sågvall Hein, Per Weijnitz.
October 10, 2003BLTS Kickoff Meeting1 Transfer with Strong Decoding Learning Module Transfer Rules {PP,4894} ;;Score: PP::PP [NP POSTP] -> [PREP.
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
A Simple English-to-Punjabi Translation System By : Shailendra Singh.
Review: Review: Translating without in-domain corpus: Machine translation post-editing with online learning techniques Antonio L. Lagarda, Daniel Ortiz-Martínez,
Introduction to Machine Translation
LingWear Language Technology for the Information Warrior Alex Waibel, Lori Levin Alon Lavie, Robert Frederking Carnegie Mellon University.
Neural Machine Translation
Statistical Machine Translation Part II: Word Alignments and EM
Approaches to Machine Translation
Introduction to Machine Translation
Statistical NLP: Lecture 13
Approaches to Machine Translation
Natural Language Processing
Introduction to Machine Translation
Machine Translation(MT)
Information Retrieval
Presentation transcript:

1 Machine Translation Dai Xinyu

2 Outline  Introduction  Architecture of MT  Rule-Based MT vs. Data-Driven MT  Evaluation of MT  Development of MT  MT problems in general  Some Thinking about MT from recognition

3 Introduction machine translation - the use of computers to translate from one language to another The classic acid test for natural language processing. Requires capabilities in both interpretation and generation. About $10 billion spent annually on human translation. "I have a text in front of me which is written in Russian but I am going to pretend that it is really written in English and that it has been coded in some strange symbols. All I need do is strip off the code in order to retrieve the information contained in the text"

4 Introdution - MT past and present  mid-1950's : Great expectations  The dark ages for MT: Academic research projects  1980's 's: Successful specialized applications  1990's: Human-machine cooperative translation  1990's - now: Statistical-based MT Hybrid-strategies MT  Future prospects: ???

5 Interest in MT  Commercial interest: U.S. has invested in MT for intelligence purposes MT is popular on the web — it is the most used of Google ’ s special features EU spends more than $1 billion on translation costs each year. (Semi-)automated translation could lead to huge savings

6 Interest in MT  Academic interest: One of the most challenging problems in NLP research Requires knowledge from many NLP sub-areas, e.g., lexical semantics, parsing, morphological analysis, statistical modeling, … Being able to establish links between two languages allows for transferring resources from one language to another

7 Related Area to MT  Linguistics  Computer Science AI Compile Formal Semantics …  Mathematics Probability Statistics …  Informatics  Recognition

8 Architecture of MT -- (Levers of Transfer)

9 Rule-Based MT vs. Data-Driven MT  Rule-Based MT  Data-Driven MT Example-Based MT Statistics-Based MT

10 Rule-Based MT 翻译系统 规则 语言学 语义学 认知科学 人工智能 写规则 自然语言输入 翻译结果

11 Rule-Based MT

12 Hmm, every time he sees “banco”, he either types “bank” or “bench” … but if he sees “banco de…”, he always types “bank”, never “bench”… Man, this is so boring. Translated documents

13 Example-Based MT  origins: Nagao (1981)  first motivation: collocations, bilingual differences of syntactic structures  basic idea: human translators search for analogies (similar phrases) in previous translations MT should seek matching fragment in bilingual database, extract translations  aim to have less complex dictionaries, grammars, and procedures  improved generation (using actual examples of TL sentences)

14 EBMT still going  Bi-lingual corpus Collection  Store  Searching and matching  …

15 Statistical MT Basics  Based on assumption that translations observed statistical regularities origins: Warren Weaver (1949) Shannon ’ s information theory  core process is the probabilistic ‘ translation model ’ taking SL words or phrases as input, and producing TL words or phrases as output  succeeding stage involves a probabilistic ‘ language model ’ which synthesizes TL words as ‘ meaningful ’ TL sentences

16 Statistical MT 学习系统 预测系统 概率模型 统计学习 建立模型 自然语言输入 预测

17 Statistical MT schema

18 Statistical MT processes  Bilingual corpora: original and translation  little or no linguistic ‘ knowledge ’, based on word co- occurrences in SL and TL texts (of a corpus), relative positions of words within sentences, length of sentences  Alignment: sentences aligned statistically (according to sentence length and position)  Decoding: compute probability that a TL string is the translation of a SL string ( ‘ translation model ’ ), based on: frequency of co-occurrence in aligned texts of corpus position of SL words in SL string  Adjustment: compute probability that a TL string is a valid TL sentence (based on a ‘ language model ’ of allowable bigrams and trigrams)  search for TL string that maximizes these probabilities argmax e P(e/f) = argmax e P (f/e) P (e)

19 Language Modeling  Determines the probability of some English sequence of length l  P(e) is normally approximated as: where m is size of the context, i.e. number of previous words that are considered, m=1, bi-gram language model m=2, tri-gram language model

20 Translation Modeling  Determines the probability that the foreign word f is a translation of the English word e  How to compute P(f | e) from a parallel corpus?  Statistical approaches rely on the co- occurrence of e and f in the parallel data: If e and f tend to co-occur in parallel sentence pairs, they are likely to be translations of one another

21 SMT issues  ignores previous MT research (new start, new ‘ paradigm ’ ) basically ‘ direct ’ approach:  replaces SL word by most probable TL word,  reorders TL words decoding is effectively kind of ‘ back translation ’  originally wholly word-based (IBM ‘ Candide ’ 1988) ; now predominantly phrase-based (i.e. alignment of word groups); some research on syntax-based  mathematically simple, but huge amount of training (large databases)  problems for SMT: translation is not just selecting the most frequent ‘ equivalent ’ (wider context) no quality control of corpora lack of monolingual data for some languages insufficient bilingual data (Internet as resource) lack of structure information of language  merit of SMT: evaluation as integral process of system development

22 Rule-Based MT & SMT  SMT black box: no way of finding how it works in particular cases, why it succeeds sometimes and not others  RBMT: rules and procedures can be examined  RBMT and SMT are apparent polar opposites, but gradually ‘ rules ’ incorporated in SMT models first, morphology (even in versions of first IBM model) then, ‘ phrases ’ (with some similarity to linguistic phrases) now also, syntactic parsing

23 Rule-Based MT & SMT  Comparison from following perspectives: Theory background Knowledge expression Knowledge discovery Robust Extension Development Cycle

24 Evaluation of MT  Manual: Precise / fluency / integrality 信 达 雅  Automatically evaluation: BLEU: percentage of word sequences (n-grams) occurring in reference texts NIST

25 Development of MT - MT System

26 Knowledge Acquisition Strategy Knowledge Representation Strategy All manual Deep/ Complex Shallow/ Simple Fully automated Learn from un- annotated data Phrase tables Word-based only Learn from annotated data Example-based MT Original statistical MT Typical transfer system Classic interlingual system Original direct approach Syntactic Constituent Structure Interlingua New Research Goes Here! Semantic analysis Hand-built by non-experts Hand-built by experts Electronic dictionaries MT Development - Research

27 MT problems in general  Characters of language Ambiguous Dynamic Flexible  Knowledge How to express How to discovery How to use

28 Some Thinking about MT from recognition  Human Cerebra Memory Progress - Learning Model Pattern  Translation by human …  Translation by machine …

29 Further Reading  Arturo Trujillo, Translation Engines: Techniques for Machine Translation, Springer-Verlag London Limited 1999  P.F. Brown, et al., A Statistical Approach to MT, Computational Linguistics, 1990,16(2)  P.F. Brown, et al., The Mathematics of Statistical Machine Translation: Parameter Estimation, Computational Linguistics, 1993, 19(2)  Bonnie J. Dorr, et al, Survey of Current Paradigms in Machine Translation  Makoto Nagao, A Framework of a Mechanical Translation between Japanese and English by Analog Principle, In A. Elithorn and R. Banerji(Eds.), Artificial and Human Intelligence. NATO Publications, 1984  Hutchins WJ, Machine Translation: Past, Present, Future. Chichester: Ellis Horwood, 1986  Daniel Jurafsky & James H. Martin, Speech and Language Processing, Prentice-Hall, 2000  Christopher D. Manning & Hinrich Schutze, Foundations of Statistical Natural Langugae Processing, Massachusetts Institute of Technology, 1999  James Allen, Natural Language Understanding, The Benjamin/Cummings Publishing Company, Inc. 1987