Introduction to Machine Translation

Slides:



Advertisements
Similar presentations
Speech-to-Speech Translation Hannah Grap Language Weaver, Inc.
Advertisements

1 Statistical Machine Translation Bonnie Dorr Christof Monz CMSC 723: Introduction to Computational Linguistics Lecture 8 October 27, 2004.
The Application of Machine Translation in CADAL Huang Chen, Chen Haiying Zhejiang University Libraries, Hangzhou, China
Machine Translation II How MT works Modes of use.
Help communities share knowledge more effectively across the language barrier Automated Community Content Editing PorTal.
Probabilistic Language Processing Chapter 23. Probabilistic Language Models Goal -- define probability distribution over set of strings Unigram, bigram,
Machine Translation (Level 2) Anna Sågvall Hein GSLT Course, September 2004.
Speech Translation on a PDA By: Santan Challa Instructor Dr. Christel Kemke.
CSE111: Great Ideas in Computer Science Dr. Carl Alphonce 219 Bell Hall Office hours: M-F 11:00-11:
Erasmus University Rotterdam Frederik HogenboomEconometric Institute School of Economics Flavius Frasincar.
Machine Translation Prof. Alexandros Potamianos Dept. of Electrical & Computer Engineering Technical University of Crete, Greece May 2003.
Course Summary LING 575 Fei Xia 03/06/07. Outline Introduction to MT: 1 Major approaches –SMT: 3 –Transfer-based MT: 2 –Hybrid systems: 2 Other topics.
1.3 Executing Programs. How is Computer Code Transformed into an Executable? Interpreters Compilers Hybrid systems.
Machine Translation History of Machine Translation Difficulties in Machine Translation Structure of Machine Translation System Research methods for Machine.
MACHINE TRANSLATION TRANSLATION(5) LECTURE[1-1] Eman Baghlaf.
March 1, 2009 Dr. Muhammed Al-Mulhem 1 ICS 482 Natural Language Processing INTRODUCTION Muhammed Al-Mulhem March 1, 2009.
MACHINE TRANSLATION A precious key to communicate beyond linguistic barriers 1.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
The Internet as a Publishing Channel Teppo Räisänen LIIKE/OAMK.
Machine translation Context-based approach Lucia Otoyo.
Machine Translation Dr. Radhika Mamidi. What is Machine Translation? A sub-field of computational linguistics It investigates the use of computer software.
INTRODUCTION TO COMPUTING CHAPTER NO. 06. Compilers and Language Translation Introduction The Compilation Process Phase 1 – Lexical Analysis Phase 2 –
Chapter 10: Compilers and Language Translation Invitation to Computer Science, Java Version, Third Edition.
Globalisation and machine translation Machine Translation (MT) The ‘decoding’ paradigm Ambiguity Translation models Interlingua and First Order Predicate.
1 Natural Language Processing Gholamreza Ghassem-Sani Fall 1383.
1 Computational Linguistics Ling 200 Spring 2006.
Thanks to Dan Klein of UC Berkeley and Chris Manning of Stanford for many of the materials used in this lecture. CS 479, section 1: Natural Language Processing.
© Copyright 2013 ABBYY NLP PLATFORM FOR EU-LINGUAL DIGITAL SINGLE MARKET Alexander Rylov LTi Summit 2013 Confidential.
Natural Language Processing Guangyan Song. What is NLP  Natural Language processing (NLP) is a field of computer science and linguistics concerned with.
Evolution of Machine Translation: systems and use John Hutchins [ homepages/WJHutchins] [
Approaches to Machine Translation CSC 5930 Machine Translation Fall 2012 Dr. Tom Way.
Research Topics CSC Parallel Computing & Compilers CSC 3990.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
C SC 620 Advanced Topics in Natural Language Processing Lecture 25 5/4.
Languages at Inxight Ian Hersey Co-Founder and SVP, Corporate Development and Strategy.
Jan 2005CSA4050 Machine Translation II1 CSA4050: Advanced Techniques in NLP Machine Translation II Direct MT Transfer MT Interlingual MT.
MT and the Future Eduard Hovy Information Sciences Institute University of Southern California.
1 An Introduction to Computational Linguistics Mohammad Bahrani.
Natural Language Processing Group Computer Sc. & Engg. Department JADAVPUR UNIVERSITY KOLKATA – , INDIA. Professor Sivaji Bandyopadhyay
23.3 Information Extraction More complicated than an IR (Information Retrieval) system. Requires a limited notion of syntax and semantics.
King Faisal University جامعة الملك فيصل Deanship of E-Learning and Distance Education عمادة التعلم الإلكتروني والتعليم عن بعد [ ] 1 جامعة الملك فيصل عمادة.
Jan 2012MT Architectures1 Human Language Technology Machine Translation Architectures Direct MT Transfer MT Interlingual MT.
Introduction to Machine Translation
Arabic NLP: Overview, the State of the Art Challenges and Opportunities Ali Farghaly.
Computer Systems Nat 5 Computing Science
Automatic Calculation of Translator Productivity Improvement
Compiler Design (40-414) Main Text Book:
RECENT TRENDS IN SMT By M.Balamurugan, Phd Research Scholar,
Approaches to Machine Translation
How to teach translation technologies
Computer Systems Nat 5 Computing Science
Techniques and Principles in Language Teaching
عمادة التعلم الإلكتروني والتعليم عن بعد
LACONEC A Large-scale Multilingual Semantics-based Dictionary
Viewing Information and Communication Technology as Pervasive Science Rajeev Sangal International Institute of Information Technology Hyderabad
Are End-to-end Systems the Ultimate Solutions for NLP?
Head, IT Systems Section
Semantic Web: Commercial Opportunities and Prospects
9.a Report on IPC-related IT systems IPC Committee of Experts 50
Approaches to Machine Translation
Introduction to Machine Translation
Computational Linguistics: New Vistas
Web Mining Department of Computer Science and Engg.
Business Careers: Human Resources A discussion with Lacrystal Horne’s High Point Central High School Class.
Knowledge Representation for Natural Language Understanding
Chapter 10: Compilers and Language Translation
Data Analysis and R : Technology & Opportunity
Introduction to Artificial Intelligence
Artificial Intelligence 2004 Speech & Natural Language Processing
Presentation transcript:

Introduction to Machine Translation CSC 5930 Machine Translation Fall 2016 Dr. Tom Way

History of Machine Translation

History of Machine Translation (Based on work by John Hutchins, mt-archive.info) Before the computer: In the mid 1930s, a French-Armenian Georges Artsrouni and a Russian Petr Troyanskii applied for patents for ‘translating machines’. The pioneers (1947-1954): the first public MT demo was given in 1954 (by IBM and Georgetown University). Machine translation was one of the first applications envisioned for computers

History of MT (2) Warren Weaver, PhD was an American scientist, mathematician, and science administrator. He is widely recognized as one of the pioneers of machine translation, and as an important figure in creating support for science in the United States.

History of MT (3) First demonstrated by IBM in 1954 with a basic word-for-word translation system

History of MT (4) The decade of optimism (1954-1966) ended with the… ALPAC (Automatic Language Processing Advisory Committee) report in 1966: “There is no immediate or predictable prospect of useful machine translation."

History of MT (5) The ALPAC Report The ALPAC (Automatic Language Processing Advisory Committee) was a govt. committee of seven scientists. Their 1966 report was very skeptical of the progress in computational linguistics and machine translation.

History of MT (6) The aftermath of the ALPAC report… Research on machine translation virtually stopped from 1966 to 1980

History of MT (7) Then, a rebirth… The 1980s: Interlingua, example-based Machine Translation The 1990s: Statistical MT The 2000s: Hybrid MT The 2010s: Google, real-time, mobile, Crowdsourcing, more hybrid approaches

Machine Translation today

Where are we now? Huge potential/need due to the internet, globalization and international politics. Quick development time due to Statistical Machine Translation (SMT), the availability of parallel data and computers. Translation is reasonable for language pairs with a large amount of resources. Start to include more “minor” languages.

Rule-based MT The Vauquois Triangle

Statistical MT The Rosetta Stone

What is MT good for? Rough translation: web data Computer-aided human translation Translation for limited domain Cross-lingual IR Machines beat humans at: Speed: much faster than humans Memory: can easily memorize millions of word/phrase translations. Manpower: machines are much cheaper than humans Fast learner: it takes minutes or hours to build a new system. Never complain, never get tired, …

Interest in Machine Translation (1) Commercial interest: U.S. has invested in machine translation (MT) for intelligence purposes MT is popular on the web—it is the most used of Google’s special features EU spends more than $1 billion on translation costs each year. (Semi-)automated translation could lead to huge savings

Interest in Machine Translation (2) Academic interest: One of the most challenging problems in NLP research Requires knowledge from many NLP sub-areas, e.g., lexical semantics, syntactic parsing, morphological analysis, statistical modeling,… Being able to establish links between two languages allows for transferring resources from one language to another

Goals & Uses Translating Summarizing Communicating Pre-editing Grammar analysis Analyzing text Understanding text and images

Do we really need Machine Translation?

Languages on the Internet

Languages on Twitter

Languages in Los Angeles

Why do we need MT?

Why do we need MT?

Why do we need MT?

Why is MT hard?

Why is MT hard?

Why is MT hard?

Why is MT hard? For example… Commercial system “Language Weaver” created in 2002 Uses statistical techniques from cryptography and machine to acquire statistical models from human translations Sold in 2010 for $42.5 million

“Language Weaver” SMT System – Comparison: Arabic to English v.3.0 - February 2005 v.2.0 – October 2003 v.2.4 – October 2004