Flow Network Models for Sub-Sentential Alignment Ying Zhang (Joy) Advisor: Ralf Brown Dec 18 th, 2001.

Slides:

Advertisements

Similar presentations

Statistical Machine Translation

Advertisements

The Application of Machine Translation in CADAL Huang Chen, Chen Haiying Zhejiang University Libraries, Hangzhou, China

Maximum Flow and Minimum Cut Problems In this handout: Duality theory Upper bounds for maximum flow value Minimum Cut Problem Relationship between Maximum.

Statistical Machine Translation Part II: Word Alignments and EM Alexander Fraser Institute for Natural Language Processing University of Stuttgart

Statistical Machine Translation Part II: Word Alignments and EM Alexander Fraser ICL, U. Heidelberg CIS, LMU München Statistical Machine Translation.

Statistical Machine Translation Part II – Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart

Huffman code and ID3 Prof. Sin-Min Lee Department of Computer Science.

Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.

Crew Scheduling Housos Efthymios, Professor Computer Systems Laboratory (CSL) Electrical & Computer Engineering University of Patras.

A Maximum Coherence Model for Dictionary-based Cross-language Information Retrieval Yi Liu, Rong Jin, Joyce Y. Chai Dept. of Computer Science and Engineering.

Chinese Word Segmentation Method for Domain-Special Machine Translation Su Chen; Zhang Yujie; Guo Zhen; Xu Jin’an Beijing Jiaotong University.

Unsupervised Turkish Morphological Segmentation for Statistical Machine Translation Coskun Mermer and Murat Saraclar Workshop on Machine Translation and.

Re-ranking for NP-Chunking: Maximum-Entropy Framework By: Mona Vajihollahi.

Carnegie Mellon 1 Maximum Likelihood Estimation for Information Thresholding Yi Zhang & Jamie Callan Carnegie Mellon University

Advanced Topics in Algorithms and Data Structures 1 Lecture 4 : Accelerated Cascading and Parallel List Ranking We will first discuss a technique called.

The current status of Chinese- English EBMT -where are we now Joy (Ying Zhang) Ralf Brown, Robert Frederking, Erik Peterson Aug 2001.

EBMT1 Example Based Machine Translation as used in the Pangloss system at Carnegie Mellon University Dave Inman.

The current status of Chinese-English EBMT research -where are we now Joy, Ralf Brown, Robert Frederking, Erik Peterson Aug 2001.

ACL 2005 WORKSHOP ON BUILDING AND USING PARALLEL TEXTS (WPT-05), Ann Arbor, MI. June Competitive Grouping in Integrated Segmentation and Alignment.

Machine Translation A Presentation by: Julie Conlonova, Rob Chase, and Eric Pomerleau.

Symmetric Probabilistic Alignment Jae Dong Kim Committee: Jaime G. Carbonell Ralf D. Brown Peter J. Jansen.

Semi-Automatic Learning of Transfer Rules for Machine Translation of Low-Density Languages Katharina Probst April 5, 2002.

MT Summit VIII, Language Technologies Institute School of Computer Science Carnegie Mellon University Pre-processing of Bilingual Corpora for Mandarin-English.

Parameter estimate in IBM Models: Ling 572 Fei Xia Week ??

ABC--- A Phrase-to-Phrase Alignment Method Integrating monolingual and bilingual information in sub sentential phrase alignment Ying Zhang (Joy)

9/12/2003LTI Student Research Symposium1 An Integrated Phrase Segmentation/Alignment Algorithm for Statistical Machine Translation Joy Advisor: Stephan.

1 The Web as a Parallel Corpus  Parallel corpora are useful  Training data for statistical MT  Lexical correspondences for cross-lingual IR  Early.

LEARNING WORD TRANSLATIONS Does syntactic context fare better than positional context? NCLT/CNGL Internal Workshop Ankit Kumar Srivastava 24 July 2008.

A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora Benjamin Arai Computer Science and Engineering Department.

Natural Language Processing Expectation Maximization.

Natural Language Processing Lab Northeastern University, China Feiliang Ren EBMT Based on Finite Automata State Transfer Generation Feiliang Ren.

Machine translation Context-based approach Lucia Otoyo.

English-Persian SMT Reza Saeedi 1 WTLAB Wednesday, May 25, 2011.

METEOR-Ranking & M-BLEU: Flexible Matching & Parameter Tuning for MT Evaluation Alon Lavie and Abhaya Agarwal Language Technologies Institute Carnegie.

Advanced Signal Processing 05/06 Reinisch Bernhard Statistical Machine Translation Phrase Based Model.

Minimum Cost Flows. 2 The Minimum Cost Flow Problem u ij = capacity of arc (i,j). c ij = unit cost of shipping flow from node i to node j on (i,j). x.

Scalable Inference and Training of Context- Rich Syntactic Translation Models Michel Galley, Jonathan Graehl, Keven Knight, Daniel Marcu, Steve DeNeefe.

Péter Schönhofen – Ad Hoc Hungarian → English – CLEF Workshop 20 Sep 2007 Performing Cross-Language Retrieval with Wikipedia Participation report for Ad.

Recent Major MT Developments at CMU Briefing for Joe Olive February 5, 2008 Alon Lavie and Stephan Vogel Language Technologies Institute Carnegie Mellon.

NUDT Machine Translation System for IWSLT2007 Presenter: Boxing Chen Authors: Wen-Han Chao & Zhou-Jun Li National University of Defense Technology, China.

Reordering Model Using Syntactic Information of a Source Tree for Statistical Machine Translation Kei Hashimoto, Hirohumi Yamamoto, Hideo Okuma, Eiichiro.

Why Not Grab a Free Lunch? Mining Large Corpora for Parallel Sentences to Improve Translation Modeling Ferhan Ture and Jimmy Lin University of Maryland,

Carnegie Mellon Goal Recycle non-expert post-editing efforts to: - Refine translation rules automatically - Improve overall translation quality Proposed.

Iterative Translation Disambiguation for Cross Language Information Retrieval Christof Monz and Bonnie J. Dorr Institute for Advanced Computer Studies.

An Iterative Approach to Extract Dictionaries from Wikipedia for Under-resourced Languages G. Rohit Bharadwaj Niket Tandon Vasudeva Varma Search and Information.

LREC 2008 Marrakech 29 May Caroline Lavecchia, Kamel Smaïli and David Langlois LORIA / Groupe Parole, Vandoeuvre-Lès-Nancy, France Phrase-Based Machine.

Improving Named Entity Translation Combining Phonetic and Semantic Similarities Fei Huang, Stephan Vogel, Alex Waibel Language Technologies Institute School.

Mutual bilingual terminology extraction Le An Ha*, Gabriela Fernandez**, Ruslan Mitkov*, Gloria Corpas*** * University of Wolverhampton ** Universidad.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Iterative Translation Disambiguation for Cross-Language.

ECE 8443 – Pattern Recognition Objectives: Bayes Rule Mutual Information Conditional Likelihood Mutual Information Estimation (CMLE) Maximum MI Estimation.

Joint Power and Channel Minimization in Topology Control: A Cognitive Network Approach J ORGE M ORI A LEXANDER Y AKOBOVICH M ICHAEL S AHAI L EV F AYNSHTEYN.

A Joint Source-Channel Model for Machine Transliteration Li Haizhou, Zhang Min, Su Jian Institute for Infocomm Research 21 Heng Mui Keng Terrace, Singapore.

Multi-level Bootstrapping for Extracting Parallel Sentence from a Quasi-Comparable Corpus Pascale Fung and Percy Cheung Human Language Technology Center,

1 Minimum Error Rate Training in Statistical Machine Translation Franz Josef Och Information Sciences Institute University of Southern California ACL 2003.

A New Approach for English- Chinese Named Entity Alignment Donghui Feng Yayuan Lv Ming Zhou USC MSR Asia EMNLP-04.

Network Simplex Animations Network Simplex Animations.

Statistical Machine Translation Part II: Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart

Parallel Implementation Of Word Alignment Model: IBM MODEL 1 Professor: Dr.Azimi Fateme Ahmadi-Fakhr Afshin Arefi Saba Jamalian Dept. of Electrical and.

Large Vocabulary Data Driven MT: New Developments in the CMU SMT System Stephan Vogel, Alex Waibel Work done in collaboration with: Ying Zhang, Alicia.

A Simple English-to-Punjabi Translation System By : Shailendra Singh.

Semi-Automatic Learning of Transfer Rules for Machine Translation of Minority Languages Katharina Probst Language Technologies Institute Carnegie Mellon.

ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Bayes Rule Mutual Information Conditional.

LingWear Language Technology for the Information Warrior Alex Waibel, Lori Levin Alon Lavie, Robert Frederking Carnegie Mellon University.

Asynchronous Distributed ADMM for Consensus Optimization Ruiliang Zhang James T. Kwok Department of Computer Science and Engineering, Hong Kong University.

The minimum cost flow problem

James B. Orlin Presented by Tal Kaminker

Expectation-Maximization Algorithm

Improved Word Alignments Using the Web as a Corpus

Boltzmann Machine (BM) (§6.4)

Improving IBM Word-Alignment Model 1(Robert C. MOORE)

Presentation transcript:

Flow Network Models for Sub-Sentential Alignment Ying Zhang (Joy) Advisor: Ralf Brown Dec 18 th, 2001

Language Technologies Institute School of Computer Sciences Carnegie Mellon University 2 Background Sub-sentential alignment problem – Given a bilingual parallel sentence pair, to find the correspondence between source words and target words – One of the major issues in Data-Driven MT (SMT/EBMT) Some approaches – IBM Model 1 – Smooth Injective Map Recognizer [I. Dan Melamed]

Language Technologies Institute School of Computer Sciences Carnegie Mellon University 3 Flow Network A well-studied area since 1950’s Used widely in electrical engineering, computer science, social science and economic problems Any system involving a binary relation can be represented by a network

Language Technologies Institute School of Computer Sciences Carnegie Mellon University 4 Flow Network [Jensen]

Language Technologies Institute School of Computer Sciences Carnegie Mellon University 5 Minimum Cost Flow One of the basic problems in flow network theory For a network G= (V,E), Total cost = Minimum cost problem: given a network, assign the flow for each arc, so that the total cost of the network is minimum

Language Technologies Institute School of Computer Sciences Carnegie Mellon University 6 Alignment as a Flow Network The “abstract concepts” are transformed through this network Flow>=1, if there is alignment between two words, Flow =0. O/W

Language Technologies Institute School of Computer Sciences Carnegie Mellon University 7 Alignment as a Mini-cost Flow Network Assume the alignment probabilities only lies in the possibilities of words across languages. (Alignments between other words do not have impact on this pair), then For word pair (s i,t j ), assign the cost for the arc as -lnp(s i,t j ), then the mini-cost flow network corresponds to the maximum P(a,s,t)

Language Technologies Institute School of Computer Sciences Carnegie Mellon University 8 Pure Mini-Cost Flow Algorithm Primal simplex algorithm

Language Technologies Institute School of Computer Sciences Carnegie Mellon University 9 Pure Mini-cost Flow Algorithm (Cont.) [Jensen]

Language Technologies Institute School of Computer Sciences Carnegie Mellon University 10 Pure Mini-cost Flow Algorithm (Cont.) Two phase algorithm [Jensen]

Language Technologies Institute School of Computer Sciences Carnegie Mellon University 11 Our Model

Language Technologies Institute School of Computer Sciences Carnegie Mellon University 12 Solve the Network – Phase 1 Phase 1: to get the basis solution (solved on paper) BridgeArc

Language Technologies Institute School of Computer Sciences Carnegie Mellon University 13 Solve the Network – Phase 2 If not optimal: – Using bridgeArc to calculate the dual values of tgt nodes – If the minimal dual values of arcs in n_0 < 0 Then make this arc to be the new bridge arc; Insert the previous bridgeArc to n_0; – Else If all dual values in n_0 >0 If min(dual values of n_0) < max(dual values of n_1) make this n_1 arc to be the new bridge arc; insert the previous bridgeArc to n_1; Else optimal, stop

Language Technologies Institute School of Computer Sciences Carnegie Mellon University 14 Update the Cost by E.M. Each sentence pair is represented by a network After the network is solved, update the counts and probability contribution of the word pairs in the solved network Update the probability of word association between source and target language Using Good-Turing smoothing Does not work very well so far! –High frequency words

Language Technologies Institute School of Computer Sciences Carnegie Mellon University 15 Model Training Sentence level aligned bilingual corpus (hknews) Stemmed by Porter stemmer Building a seed statistical dictionary using Ralf’s method [Brown97] Combined with the Chinese-English glossary to get a seed dictionary (coverage != 100%) Solve the network and update the probability until the total cost of the corpus converge

Language Technologies Institute School of Computer Sciences Carnegie Mellon University 16 Performance & Results Fast (align 10,000 sentence pairs in 2 minutes*) * words in sentence <=25 * system ran on PC with 128 RAM Using only the seed dictionary, tested on 30 sentence pairs: – Recall: 53%~61% (because of the coverage of the seed dictionary) – Precision: 74%~76% – Expected to achieve a higher recall when E.M. works properly

Language Technologies Institute School of Computer Sciences Carnegie Mellon University 17 Future Work Currently, the model is no more powerful than IBM model1 We are planning to integrate the monolingual co- occurrence probability and the bilingual association probability into the model

Language Technologies Institute School of Computer Sciences Carnegie Mellon University 18 References Éric Gaussier, “Flow Network Models for Word Alignment and Terminology Extraction from Bilingual Corpora”, Proceedings of the 36th Annual Meeting of the association for Computational Linguistics and the 17th International Conference on Computational Linguistics, COLING-ACL'98. Montreal, Canada Paul A. Jensen and Jonathan F. Bard, “Operations Research Models and Methods”, Ralf D. Brown, "Automated Dictionary Extraction for ``Knowledge- Free'' Example-Based Translation". In Proceedings of the Seventh International Conference on Theoretical and Methodological Issues in Machine Translation, p Santa Fe, July 23-25, 1997.

Language Technologies Institute School of Computer Sciences Carnegie Mellon University 19 Acknowledgement Many thanks to Jian Zhang, Katharina Probst, and Alicia Tribble for their help !

Language Technologies Institute School of Computer Sciences Carnegie Mellon University 20 Questions and Comments?