A Syntax-Driven Bracketing Model for Phrase-Based Translation Deyi Xiong, et al. ACL 2009.

Slides:



Advertisements
Similar presentations
The Application of Machine Translation in CADAL Huang Chen, Chen Haiying Zhejiang University Libraries, Hangzhou, China
Advertisements

Joint Parsing and Alignment with Weakly Synchronized Grammars David Burkett, John Blitzer, & Dan Klein TexPoint fonts used in EMF. Read the TexPoint manual.
Progress update Lin Ziheng. System overview 2 Components – Connective classifier Features from Pitler and Nenkova (2009): – Connective: because – Self.
Bayesian Learning of Non- Compositional Phrases with Synchronous Parsing Hao Zhang; Chris Quirk; Robert C. Moore; Daniel Gildea Z honghua li Mentor: Jun.
Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.
Word Sense Disambiguation for Machine Translation Han-Bin Chen
Using Percolated Dependencies in PBSMT Ankit K. Srivastava and Andy Way Dublin City University CLUKI XII: April 24, 2009.
1 A Tree Sequence Alignment- based Tree-to-Tree Translation Model Authors: Min Zhang, Hongfei Jiang, Aiti Aw, et al. Reporter: 江欣倩 Professor: 陳嘉平.
A Tree-to-Tree Alignment- based Model for Statistical Machine Translation Authors: Min ZHANG, Hongfei JIANG, Ai Ti AW, Jun SUN, Sheng LI, Chew Lim TAN.
Machine Translation via Dependency Transfer Philip Resnik University of Maryland DoD MURI award in collaboration with JHU: Bootstrapping Out of the Multilingual.
Inducing Information Extraction Systems for New Languages via Cross-Language Projection Ellen Riloff University of Utah Charles Schafer, David Yarowksy.
Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.
Symmetric Probabilistic Alignment Jae Dong Kim Committee: Jaime G. Carbonell Ralf D. Brown Peter J. Jansen.
A Hierarchical Phrase-Based Model for Statistical Machine Translation Author: David Chiang Presented by Achim Ruopp Formulas/illustrations/numbers extracted.
Probabilistic Parsing Ling 571 Fei Xia Week 5: 10/25-10/27/05.
1 Basic Parsing with Context Free Grammars Chapter 13 September/October 2012 Lecture 6.
Does Syntactic Knowledge help English- Hindi SMT ? Avinesh. PVS. K. Taraka Rama, Karthik Gali.
Czech-to-English Translation: MT Marathon 2009 Session Preview Jonathan Clark Greg Hanneman Language Technologies Institute Carnegie Mellon University.
Technical Report of NEUNLPLab System for CWMT08 Xiao Tong, Chen Rushan, Li Tianning, Ren Feiliang, Zhang Zhuyu, Zhu Jingbo, Wang Huizhen
Tree Kernels for Parsing: (Collins & Duffy, 2001) Advanced Statistical Methods in NLP Ling 572 February 28, 2012.
1 Data-Driven Dependency Parsing. 2 Background: Natural Language Parsing Syntactic analysis String to (tree) structure He likes fish S NP VP NP VNPrn.
Advanced Signal Processing 05/06 Reinisch Bernhard Statistical Machine Translation Phrase Based Model.
2010 Failures in Czech-English Phrase-Based MT 2010 Failures in Czech-English Phrase-Based MT Full text, acknowledgement and the list of references in.
Scalable Inference and Training of Context- Rich Syntactic Translation Models Michel Galley, Jonathan Graehl, Keven Knight, Daniel Marcu, Steve DeNeefe.
Evaluation of Context-Dependent Phrasal Translation Lexicons for Statistical Machine Translation M arine C ARPUAT and D ekai W U Human Language Technology.
Recent Major MT Developments at CMU Briefing for Joe Olive February 5, 2008 Alon Lavie and Stephan Vogel Language Technologies Institute Carnegie Mellon.
The ICT Statistical Machine Translation Systems for IWSLT 2007 Zhongjun He, Haitao Mi, Yang Liu, Devi Xiong, Weihua Luo, Yun Huang, Zhixiang Ren, Yajuan.
NUDT Machine Translation System for IWSLT2007 Presenter: Boxing Chen Authors: Wen-Han Chao & Zhou-Jun Li National University of Defense Technology, China.
Reordering Model Using Syntactic Information of a Source Tree for Statistical Machine Translation Kei Hashimoto, Hirohumi Yamamoto, Hideo Okuma, Eiichiro.
Approaches to Machine Translation CSC 5930 Machine Translation Fall 2012 Dr. Tom Way.
11 Chapter 14 Part 1 Statistical Parsing Based on slides by Ray Mooney.
Sentence Compression Based on ILP Decoding Method Hongling Wang, Yonglei Zhang, Guodong Zhou NLP Lab, Soochow University.
A Systematic Exploration of the Feature Space for Relation Extraction Jing Jiang & ChengXiang Zhai Department of Computer Science University of Illinois,
Effective Use of Linguistic and Contextual Information for Statistical Machine Translation Libin Shen and Jinxi Xu and Bing Zhang and Spyros Matsoukas.
What’s in a translation rule? Paper by Galley, Hopkins, Knight & Marcu Presentation By: Behrang Mohit.
INSTITUTE OF COMPUTING TECHNOLOGY Forest-to-String Statistical Translation Rules Yang Liu, Qun Liu, and Shouxun Lin Institute of Computing Technology Chinese.
Yang Liu State Key Laboratory of Intelligent Technology and Systems Tsinghua National Laboratory for Information Science and Technology Department of Computer.
Alignment of Bilingual Named Entities in Parallel Corpora Using Statistical Model Chun-Jen Lee Jason S. Chang Thomas C. Chuang AMTA 2004.
NRC Report Conclusion Tu Zhaopeng NIST06  The Portage System  For Chinese large-track entry, used simple, but carefully- tuned, phrase-based.
CSA2050 Introduction to Computational Linguistics Parsing I.
Soft Syntactic Constraints for Hierarchical Phrase-Based Translation Yuval Marton and Philip Resnik Department of Linguistics and the Laboratory for Computational.
A non-contiguous Tree Sequence Alignment-based Model for Statistical Machine Translation Jun Sun ┼, Min Zhang ╪, Chew Lim Tan ┼ ┼╪
Improving Named Entity Translation Combining Phonetic and Semantic Similarities Fei Huang, Stephan Vogel, Alex Waibel Language Technologies Institute School.
University of Texas at Austin Machine Learning Group Department of Computer Sciences University of Texas at Austin Learning a Compositional Semantic Parser.
Imposing Constraints from the Source Tree on ITG Constraints for SMT Hirofumi Yamamoto, Hideo Okuma, Eiichiro Sumita National Institute of Information.
Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.
2003 (c) University of Pennsylvania1 Better MT Using Parallel Dependency Trees Yuan Ding University of Pennsylvania.
CPSC 422, Lecture 27Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 27 Nov, 16, 2015.
11 Project, Part 3. Outline Basics of supervised learning using Naïve Bayes (using a simpler example) Features for the project 2.
Statistical Machine Translation Part II: Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart
Discriminative Modeling extraction Sets for Machine Translation Author John DeNero and Dan KleinUC Berkeley Presenter Justin Chiu.
Large Vocabulary Data Driven MT: New Developments in the CMU SMT System Stephan Vogel, Alex Waibel Work done in collaboration with: Ying Zhang, Alicia.
October 10, 2003BLTS Kickoff Meeting1 Transfer with Strong Decoding Learning Module Transfer Rules {PP,4894} ;;Score: PP::PP [NP POSTP] -> [PREP.
Soft Syntactic Constraints for Hierarchical Phrase-Based Translation Yuval Marton and Philip Resnik Department of Linguistics and the Laboratory for Computational.
LING 575 Lecture 5 Kristina Toutanova MSR & UW April 27, 2010 With materials borrowed from Philip Koehn, Chris Quirk, David Chiang, Dekai Wu, Aria Haghighi.
Natural Language Processing : Probabilistic Context Free Grammars Updated 8/07.
Natural Language Processing Vasile Rus
Approaches to Machine Translation
Basic Parsing with Context Free Grammars Chapter 13
Urdu-to-English Stat-XFER system for NIST MT Eval 2008
Suggestions for Class Projects
Statistical NLP Spring 2011
Deep Learning based Machine Translation
Eiji Aramaki* Sadao Kurohashi* * University of Tokyo
Approaches to Machine Translation
Statistical Machine Translation Papers from COLING 2004
A Path-based Transfer Model for Machine Translation
Dekai Wu Presented by David Goss-Grubbs
Johns Hopkins 2003 Summer Workshop on Syntax and Statistical Machine Translation Chapters 5-8 Ethan Phelps-Goodman.
Rule Markov Models for Fast Tree-to-String Translation
Presentation transcript:

A Syntax-Driven Bracketing Model for Phrase-Based Translation Deyi Xiong, et al. ACL 2009

把 7 月 11 日 設立 為 航海 節 Introduction Machine Translation –Chinese to English –Chinese 把 7 月 11 日 設立 為 航海 節 An ideal case: to establish July 11 as Sailing Festival day

Wrong Linguistic Structure 航海 節 is a syntactic constituent 把 7 月 11 日 設立 為 航海 節 to set up for navigation on July 11 knots

A Naive Solution Employ syntactic constraints –Fully respect linguistic structures

把 今天 設立 為航海 節 A Naive Solution (2) Unfortunately, it damages the performance –Non-syntactic translations are sometimes useful Sailing Festival dayestablish today as

Syntax-Driven Bracketing Model SDB model Translation unit is more important –Whether it is syntactic or non-syntactic Include but not limited to constituent matching/violation Protect the strength of the phrase-based system

Translation Unit Bracketable source phrase and its corresponding translation Bracketable –A source phrase is bracketable Its translation is contiguous –A pair of neighboring phrases is bracketable Their translations are contiguous after combined

establish today as Translation Unit Examples Bracketable 把 今天 設立 為 establish today as 把 今天 設立為 把 今天 設立 and 為 are bracketable 把 今天 設立 為 is bracketable

把 今天 設立 為 establish today as Translation Unit Examples Unbracketable 設立 and 為 are unbracketable 設立 為 is unbracketable

Bracketing Instances Extraction Extract bracketable and unbracketable instances from training data –Aligned sentence pair + parsed source sentence Estimate whether a source phrase is bracketable at run time

SDB Features

Rule Features Rule Features (RF) –CFG rule –Horizontal context

Rule Features (2) S 1 : ADVP  AD S 2 : VP  VV AS NP S: VP  ADVP VP

Path Features Path features (PF) –Path to roots S1 to the root of S S2 to the root of S S to the root of this tree –Vertical context

Path Features (2) S 1 : ADVP VP S 2 : VP VP S: VP IP

Constituent Boundary Matching Features Constituent Boundary Matching Features (CBMF) –Exact match Source phrase covers the boundaries of its tree –Inside match Source phrase covers a sequence of its tree –Crossing match Source phrase crosses the subtree of its tree

Constituent Boundary Matching Features (3) Exact match Inside match Crossing match

Integration into Phrase-based MT SDB model estimate the probability that a source phrase is bracketable. –Whether it can be translated as a unit Integrated into BTG MT system –Bracketing Transduction Grammar (Wu, 1997) establish today as 把 今天 設立為 as establish today 把 今天 設立為 Straight Inverted

Experiment Comparing models –Baseline: BTG system –XP+ (Marton and Resnik, 2008) NP, VP, PP, ADVP…. Penalize each time when violating the syntactic boundaries. (soft constraint) –UniSDB Only S features –BiSDB S 1, S 2 and S features

Experiment (2) Chinese parser –Lexicalized PCFG parser (Xiong et al., 2005) Parallel corpus –FBIS corpus Word alignment –GIZA++ Four-gram language model –Built with SRILM –Xinhua section of the the English Gigaword corpus Maximum Entropy (ME) Trainer –Zhang 2004

Result SDB receives the largest feature weight –Imply its impact on decoder. Baseline features (Common for phrase-based systems) XP+ and SDB

Result (2) NIST MT-05 test set –Improvement of 1.67 BLEU over baseline –Improvement of 0.59 BLEU over XP+

Result (3) Based on CBMF, adding rule and path feature achieves further improvement BiSDB is constantly better than UniSDB –Inner contexts (S1 and S2) are useful

XP+ and SDB Same –Consider syntactic constituent Different –XP+ only punishes non-syntactic source phrase –SDB is able to encourage non-syntactic if the phrase is bracketable

XP+ and SDB

Conclusion SDM model predict whether a source phrase can be translated as a unit. Appropriate constituent violations are helpful –Because it better inherit the strength of phrase-based approach