Statistical Machine Translation Raghav Bashyal. Statistical Machine Translation Uses pre-translated text (copora) Compare translated text to original.

Slides:



Advertisements
Similar presentations
Statistical Machine Translation
Advertisements

1 CS 388: Natural Language Processing: N-Gram Language Models Raymond J. Mooney University of Texas at Austin.
Statistical Machine Translation Part II: Word Alignments and EM Alexander Fraser Institute for Natural Language Processing University of Stuttgart
Statistical Machine Translation Part II: Word Alignments and EM Alexander Fraser ICL, U. Heidelberg CIS, LMU München Statistical Machine Translation.
Translator Architecture Code Generator ParserTokenizer string of characters (source code) string of tokens abstract program string of integers (object.
Statistical Machine Translation Part II – Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart
NATURAL LANGUAGE PROCESSING. Applications  Classification ( spam )  Clustering ( news stories, twitter )  Input correction ( spell checking )  Sentiment.
Sarah Reonomy OSCON 2014 ANALYZING DATA WITH PYTHON.
1 An Introduction to Statistical Machine Translation Dept. of CSIE, NCKU Yao-Sheng Chang Date:
Descriptive Statistics In SAS Exploring Your Data.
1 Duluth Word Alignment System Bridget Thomson McInnes Ted Pedersen University of Minnesota Duluth Computer Science Department 31 May 2003.
1 Statistical NLP: Lecture 13 Statistical Alignment and Machine Translation.
A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora Benjamin Arai Computer Science and Engineering Department.
Natural Language Processing Expectation Maximization.
Multi-Style Language Model for Web Scale Information Retrieval Kuansan Wang, Xiaolong Li and Jianfeng Gao SIGIR 2010 Min-Hsuan Lai Department of Computer.
Multiple testing correction
Text Analysis Everything Data CompSci Spring 2014.
Statistical Machine Translation Part III – Phrase-based SMT / Decoding Alex Fraser Institute for Natural Language Processing University of Stuttgart
April 2005CSA2050:NLTK1 CSA2050: Introduction to Computational Linguistics NLTK.
Query Rewriting Using Monolingual Statistical Machine Translation Stefan Riezler Yi Liu Google 2010 Association for Computational Linguistics.
Statistical Machine Translation Part IV – Log-Linear Models Alex Fraser Institute for Natural Language Processing University of Stuttgart Seminar:
Intro to NLP - J. Eisner1 Finite-State and the Noisy Channel.
CSC 338: Compiler design and implementation
Systems Development Lifecycle Testing and Documentation.
Active Learning for Statistical Phrase-based Machine Translation Gholamreza Haffari Joint work with: Maxim Roy, Anoop Sarkar Simon Fraser University NAACL.
Python From the book “Think Python”
Retrieval Models for Question and Answer Archives Xiaobing Xue, Jiwoon Jeon, W. Bruce Croft Computer Science Department University of Massachusetts, Google,
Machine Translation  Machine translation is of one of the earliest uses of AI  Two approaches:  Traditional approach using grammars, rewrite rules,
Problem Solving using the Science of Computing MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way.
An Investigation of Statistical Machine Translation (Spanish to English) Raghav Bashyal.
11 Chapter 14 Part 1 Statistical Parsing Based on slides by Ray Mooney.
Martin KayTranslation—Meaning1 Martin Kay Stanford University with thanks to Kevin Knight.
Compact WFSA based Language Model and Its Application in Statistical Machine Translation Xiaoyin Fu, Wei Wei, Shixiang Lu, Dengfeng Ke, Bo Xu Interactive.
Statistical Machine Translation Part III – Phrase-based SMT / Decoding Alexander Fraser Institute for Natural Language Processing Universität Stuttgart.
For Wednesday No reading Homework –Chapter 23, exercise 15 –Process: 1.Create 5 sentences 2.Select a language 3.Translate each sentence into that language.
LING 388: Language and Computers Sandiway Fong Lecture 27: 12/6.
Lecture 4 Ngrams Smoothing
TEXT ANALYTICS - LABS Maha Althobaiti Udo Kruschwitz Massimo Poesio.
Chinese Word Segmentation Adaptation for Statistical Machine Translation Hailong Cao, Masao Utiyama and Eiichiro Sumita Language Translation Group NICT&ATR.
OCR GCSE Computing © Hodder Education 2013 Slide 1 OCR GCSE Computing Python programming 1: Introduction.
Naïve Bayes Classifier Christina Wallin, Period 3 Computer Systems Research Lab Christina Wallin, Period 3 Computer Systems Research Lab
1 Minimum Error Rate Training in Statistical Machine Translation Franz Josef Och Information Sciences Institute University of Southern California ACL 2003.
Naïve Bayes Classification Christina Wallin Computer Systems Research Lab
Statistical Machine Translation Part II: Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart
Part-of-Speech Tagging with Limited Training Corpora Robert Staubs Period 1.
Intro to NLP - J. Eisner1 Finite-State and the Noisy Channel.
23.3 Information Extraction More complicated than an IR (Information Retrieval) system. Requires a limited notion of syntax and semantics.
An Improved Hierarchical Word Sequence Language Model Using Word Association NARA Institute of Science and Technology Xiaoyi WuYuji Matsumoto.
Software Engineering Algorithms, Compilers, & Lifecycle.
Statistical Machine Translation Part II: Word Alignments and EM
Introduction to Compiler Construction
Tools for Natural Language Processing Applications
Alexander Fraser CIS, LMU München Machine Translation
Natural Language Processing (NLP)
Huffman Coding, Arithmetic Coding, and JBIG2
Statistical NLP: Lecture 13
Compiler Construction
Statistical Machine Translation Part III – Phrase-based SMT / Decoding
Morphoogle - A Multilingual Interface to a Web Search Engine
Statistical Machine Translation Papers from COLING 2004
Statistical n-gram David ling.
Machine Translation(MT)
Improving IBM Word-Alignment Model 1(Robert C. MOORE)
Lecture 12: Machine Translation (II) November 4, 2004 Dan Jurafsky
Natural Language Processing (NLP)
CSA2050: Introduction to Computational Linguistics
Statistical Machine Translation Part VI – Phrase-based Decoding
CS224N Section 2: PA2 & EM Shrey Gupta January 21,2011.
Statistical Machine Translation
Natural Language Processing (NLP)
Presentation transcript:

Statistical Machine Translation Raghav Bashyal

Statistical Machine Translation Uses pre-translated text (copora) Compare translated text to original Notice patterns, associate words

SMT Process Knight – A Statistical Translation Workbook Basic probabilities – P(word) Conditional probabilities – P(word | word) … Pick the most probable translation

SMT process

Project Translate basic text from Spanish to English Test effectiveness with/without hard-coded components (syntax) Specific procedures/algorithms that add speed

Literature Guides on Statistical Machine Translation Most research project follow the same procedure as outlined by Knight “state of the art” implementation – Google

Literature NLTK – Christina Wallin UC Berkeley – Modifications – Larger corpora more useful Syntax based – hard-code – Higher translation quality when used with SMT

Procedure NLTK – Natural Language ToolKit Python Made from Natural Language processing projects Current procedure – read the SMT worksheet Code along with worksheet

Development Create corpora Tokenization – Clean string Probability – P(word) in corpora

Smoothing Coefficients used to modify probability – Large coefficients for trigrams – Small for bigrams and single words Normalizes the weight of all the words/phrases – Trigrams are more valuable

Algorithm For translation, IMB Model 3 is used: 1. For each English word ei indexed by i = 1, 2,..., 1, choose fertility phi-i with probability n(phi-i | ei) 2. Choose the number phi-0 of "spurious" French words to be generated from e0 = NULL, using probability p1 and the sum of fertilities from step 1 3. Let m be the sum of fertilities for all words, including NULL 4. For each i = 0, 1, 2,...., 1, and each k = 1, 2,..., phi-i, choose a French word tau-ik with probability t(tau-ik | ei) 5. For each i = 1, 2,..., 1, and each k = 1, 2,..., phi-i, choose target French position pi-ik with probability d(pi-ik | i, l, m) 6. For each k = 1, 2,..., phi-0, choose a position pi-0k from the phi-0 - k + 1 remaining vacant positions in 1, 2,...m, for a total probability of 1/phi-0! 7. Output the French sentence with words tau-ik in positions pi-ik (0<=i<=1, 1<=k<phi-i)

Expected Results Probably will be very basic translation Usually perform better with “sample” text than “real” text Highlighted errors Program should use reference data to find some errors Error frequency plots for certain words Test the effectiveness of adjustments Hard coding, other algorithms

GUI