CSA4050: Advanced Topics in NLP Example Based MT.

Slides:



Advertisements
Similar presentations
ROPES Weekly Procedures How will I set up my weekly ROPES? This sheet will provide you a step by step guide to solving the weekly ROPES problem. You should.
Advertisements

Rationale for a multilingual corpus for machine translation evaluation Debbie Elliott Anthony Hartley Eric Atwell Corpus Linguistics 2003, Lancaster, England.
The Application of Machine Translation in CADAL Huang Chen, Chen Haiying Zhejiang University Libraries, Hangzhou, China
Example-based Machine Translation The other corpus-based approach to MT.
CSA4050: Advanced Topics in NLP Semantics IV Partial Execution Proper Noun Adjective.
Machine Translation III Empirical approaches to MT: Example-based MT Statistical MT LELA30431/chapter50.pdf.
Please get your SpringBoard books.. Bellringer… … Respond to the following quickwrite prompts in your English II spiral notebook. Your answers do not.
1 Historical Developments of Translation Technology (TT) widespread use of fax machines, enabling translation services to operate internationally 1980s.
1 Session 1 Advantages and Disadvantages of Translation Technology (TT) - Historical development of translation technology - Focus on TM and MT (Theory.
C SC 620 Advanced Topics in Natural Language Processing Lecture 20 4/8.
C SC 620 Advanced Topics in Natural Language Processing Lecture 23 4/20.
Query Languages: Patterns & Structures. Pattern Matching Pattern –a set of syntactic features that must occur in a text segment Types of patterns –Words:
EBMT1 Example Based Machine Translation as used in the Pangloss system at Carnegie Mellon University Dave Inman.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
Example-Based Machine Translation Kevin Duh April 21, 2005 UW Machine Translation Reading Group.
C SC 620 Advanced Topics in Natural Language Processing Lecture 24 4/22.
Semi-Automatic Learning of Transfer Rules for Machine Translation of Low-Density Languages Katharina Probst April 5, 2002.
MT Summit VIII, Language Technologies Institute School of Computer Science Carnegie Mellon University Pre-processing of Bilingual Corpora for Mandarin-English.
1 Statistical NLP: Lecture 13 Statistical Alignment and Machine Translation.
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
Chapter 4 Query Languages.... Introduction Cover different kinds of queries posed to text retrieval systems Keyword-based query languages  include simple.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Natural Language Processing Lab Northeastern University, China Feiliang Ren EBMT Based on Finite Automata State Transfer Generation Feiliang Ren.
Machine translation Context-based approach Lucia Otoyo.
Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-Language Information Retrieval Paul Clough 1 and Mark Stevenson 2 Department.
October 2004CSA4050 Advanced Techniques in NLP 1 CSA4050: Advanced Topics in NLP Semantics 6 Semantics of Questions and Assertions involving Quantification.
On the Issue of Combining Anaphoricity Determination and Antecedent Identification in Anaphora Resolution Ryu Iida, Kentaro Inui, Yuji Matsumoto Nara Institute.
Automatic Detection of Tags for Political Blogs Khairun-nisa Hassanali Vasileios Hatzivassiloglou The University.
1 University of Palestine Topics In CIS ITBS 3202 Ms. Eman Alajrami 2 nd Semester
CSNB143 – Discrete Structure Topic 11 – Language.
Approaches to Machine Translation CSC 5930 Machine Translation Fall 2012 Dr. Tom Way.
An Investigation of Statistical Machine Translation (Spanish to English) Raghav Bashyal.
GUIDE : PROF. PUSHPAK BHATTACHARYYA Bilingual Terminology Mining BY: MUNISH MINIA (07D05016) PRIYANK SHARMA (07D05017)
NLP ? Natural Language is one of fundamental aspects of human behaviors. One of the final aim of human-computer communication. Provide easy interaction.
Compare & Contrast Essay Examine the similarities and differences between two or more objects.
For Wednesday No reading Homework –Chapter 23, exercise 15 –Process: 1.Create 5 sentences 2.Select a language 3.Translate each sentence into that language.
Cluster-specific Named Entity Transliteration Fei Huang HLT/EMNLP 2005.
February 2006Machine Translation II.21 Postgraduate Diploma In Translation Example Based Machine Translation Statistical Machine Translation.
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
For Friday Finish chapter 24 No written homework.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
E SSENTIAL Q UESTIONS R EVIEW PLC February 28, 2013.
Improving Named Entity Translation Combining Phonetic and Semantic Similarities Fei Huang, Stephan Vogel, Alex Waibel Language Technologies Institute School.
1 Minimum Error Rate Training in Statistical Machine Translation Franz Josef Och Information Sciences Institute University of Southern California ACL 2003.
SDL Trados Studio 2014 Getting Started. Components of a CAT Tool Translation Memory Terminology Management Alignment – transforming previously translated.
Generating Query Substitutions Alicia Wood. What is the problem to be solved?
A System for Automatic Personalized Tracking of Scientific Literature on the Web Tzachi Perlstein Yael Nir.
Vocabulary Workshop Advanced English 7. Vocabulary Workshop expectations: Complete the unit exercises for homework – Use pencil or blue or black ink pen.
Natural Language Processing Group Computer Sc. & Engg. Department JADAVPUR UNIVERSITY KOLKATA – , INDIA. Professor Sivaji Bandyopadhyay
Discriminative Modeling extraction Sets for Machine Translation Author John DeNero and Dan KleinUC Berkeley Presenter Justin Chiu.
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
A Simple English-to-Punjabi Translation System By : Shailendra Singh.
Evaluating Translation Memory Software Francie Gow MA Translation, University of Ottawa Translator, Translation Bureau, Government of Canada
King Faisal University جامعة الملك فيصل Deanship of E-Learning and Distance Education عمادة التعلم الإلكتروني والتعليم عن بعد [ ] 1 جامعة الملك فيصل عمادة.
Human features are those things created by man.
LingWear Language Technology for the Information Warrior Alex Waibel, Lori Levin Alon Lavie, Robert Frederking Carnegie Mellon University.
Approaches to Machine Translation
Multiplication Strategies
Today’s Topic: Physical and Chemical Changes
Assessing Grammar Module 5 Activity 5.
期中考试 Quarter’s Final Oct. 11th – Listening / Reading / Writing (60)
Statistical NLP: Lecture 13
Technical translation
Before: October 19, 2017 A function is represented by f (x) = 2x + 5. Find f (3). Create a table representing the above function with a domain: {1,2,3,4}
Eiji Aramaki* Sadao Kurohashi* * University of Tokyo
Approaches to Machine Translation
Review Stations and FR Kinetics February 5, 2018
Machine Translation(MT)
Internet Research & PowerPoint Presentation
Presentation transcript:

CSA4050: Advanced Topics in NLP Example Based MT

Man does translation, first, by properly decomposing an input sentence into certain fragmental phrases, then by translating these phrases into other language phrases, and finally by properly composing these fragmental translations into one long sentence. Nagao, 1984

Example Problem: translate He buys a book on international politics Database: He buys a notebook Kare wa noto o kau he topic notebook object buys I read a book on international politics Watachi wa kokusai seiji nitsuite kakareta hon o yomu Answer: kare wa kokusai seiji nitsuite kakareta hon o kau.

Translation Process Input: he buys a book on international politics Indentify/translate fragments –he buys kare wa ….. kau –a book on international politics kokusai seiji nitsuite kakareta hon Combine translated fragments. kare wa kokusai seiji nitsuite kakareta hon kau

Three Step Process Match: identify relevant source language examples in database. Align: find corresponding fragments in target language. Recombine: target language fragments to form sentences.

Matching Nature of matching process depends on database organisation. In the simplest case it is simply a bilingual corpus that has been aligned at sentence level. Effectively this means that the database is a collection of sentence pairs. Identification of relevant pairs is carried out by matching sentences.

Matching Different methods of sentence/sentence matching Character Based Word Based Structure Based Partial

Character Based Problem – semantic edit distance versus character edit distance Paper tray A holds up to 400 sheets Paper tray B holds up to 400 sheets The large paper tray The small paper tray d=1 d=5

Word Based Allows matching between different words on the basis of a similarity metric (e.g based on semantic features). This allows inexact matches and also solves the problem of which stored translation to choose. In the following case which involve different translations for the word “eat” in Japanese. Stored: A man eats vegetables. Acid eats metal. Example to translate: He eats potatoes

Matching Fragments He buys a book on international politics he buys a notebook he buys a horse he buys a car for his mum he buys a politician John buys toothpaste he reads a book on international politics books on international poltics are exciting John sold a a book on antiques

Bibliography Example Based MT H Somers – Review Article Machine Translation 14.2