GOOD, MULTILINGUAL interpretation, translation, resources What can we do for the OG-08? Christian BOITET GETA, CLIPS, IMAG-campus UJF & CNRS, Grenoble,

Slides:



Advertisements
Similar presentations
IBM WebSphere Everyplace Access for Multiplatforms Managing the e-business Customer Experience.
Advertisements

Click to continue click to go the web site click to End Show Three minutes that could save your life er123.com Universal, Multilingual Electronic Records.
GOTOEX is a modern information technology. Extracting information from the world wide web is a thing of the past. Realistic sense of action is something.
LingTour
Whiteboard Content Sharing Audio Video PollsRecordingMeet Now Skype Integration MS Lync 2013 Tools & Tips for facilitators… Limitations Alternatives One.
Machine Translation II How MT works Modes of use.
Translation & Interpretation: A Guide to Services.
Why am I here Development techniques are headed down the wrong path! Our world should be getting simpler – it is not The solutions that will solve the.
Register Laulima Workshop for Instructors Solutions to help you engage your students through Laulima.
Computational Paradigms in the Humanities – eHumanities and their role and impact in transdisciplinary research Gerhard Budin University of Vienna.
Help communities share knowledge more effectively across the language barrier Automated Community Content Editing PorTal.
Laboratoire de Communication Langagière et Interaction Personne-Système (CLIPS-IMAG) 291 rue de la bibliothèque, BP 53, Grenoble cedex 9, France.
ENTERFACE’08 Multimodal Communication with Robots and Virtual Agents.
Peak Net Ltd Unit 7, Rock Mill Business Park, Stoney Middleton, Derbyshire S32 4TF // // Creating Your.
J. Kunzmann, K. Choukri, E. Janke, A. Kießling, K. Knill, L. Lamel, T. Schultz, and S. Yamamoto Automatic Speech Recognition and Understanding ASRU, December.
Languages & The Media, 5 Nov 2004, Berlin 1 New Markets, New Trends The technology side Stelios Piperidis
Reflective Practice in Research Undertaken Multilingually Mariam Attia BAAL 2011 September 3rd, 2011 © TemplatesWise.com.
Features and Uses of a Multilingual Full-Text Electronic Theses and Dissertations (ETDs) System Yin Zhang Kent State University Kyiho Lee, Bumjong You.
C SC 620 Advanced Topics in Natural Language Processing Lecture 22 4/15.
ADMS Automatic Data Master Server
About the use of UNL in key words, key images and key concepts transcultural analysis.
Chapter 12: Web Usage Mining - An introduction
J. Turmo, 2006 Adaptive Information Extraction Summary Information Extraction Systems Multilinguality Introduction Language guessers Machine Translators.
Promoting Success for All Students through Technology.
MACHINE TRANSLATION TRANSLATION(5) LECTURE[1-1] Eman Baghlaf.
ANTHONY PYM TRANSLATION FOR ALL : COMMUNICATIVE TRANSLATION AND ADULT LANGUAGE-LEARNING.
1 / 25Sat. 31 Aug. 2002SEMANET Workshop Frameworks, Implementation & Open Problems for the Collaborative Building of a Multilingual Lexical Database Mathieu.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Help communities share knowledge more effectively across the language barrier Automated Community Content Editing PorTal.
Towards a definition of GestBase - an open database of gestures Milan Rusko Institute of Informatics of the Slovak Academy of Sciences, Bratislava.
INTRODUCTION TO WEB DATABASE PROGRAMMING
Copyright © Allyn & Bacon 2008 POWER PRACTICE Chapter 6 Academic Software START This multimedia product and its contents are protected under copyright.
1/24 17/7/2002 (Papillon-02) Translation in Papillon (Ch. Boitet) The translation of examples, citations, definitions and glosses in the Papillon project.
What Future Mobile Learning Holds for the Foreign Service Officers? Limin Zheng Foreign Service Institute
Machine Translation Dr. Radhika Mamidi. What is Machine Translation? A sub-field of computational linguistics It investigates the use of computer software.
Processing of large document collections Part 10 (Information extraction: multilingual IE, IE from web, IE from semi-structured data) Helena Ahonen-Myka.
FLAVIUS Presentation of Softissimo WP1 Project Management.
Copyright © 2009 Pearson Education, Inc. Slide 6-1 Chapter 6 E-commerce Marketing Concepts.
TWIRL Twinning virtual World (on- line) Information with Real world (off-Line) data sources Kick-Off Meeting Cassidian 08 & 09 October 2012, Paris - France.
Experiments on Building Language Resources for Multi-Modal Dialogue Systems Goals identification of a methodology for adapting linguistic resources for.
M. Lafourcade (LIRMM & Ch. Boitet (GETA, CLIPS)LREC-02, Las Palmas, 31/5/ LREC-2002, Las Palmas, May 2002 Mathieur Lafourcade & Christian Boitet.
Language Connector Dahwun Kim Jonathan Palley Wuping Lu Guided English Conversation Learning.
PDA Applications for the Olympic Games. Consolidated collaborations GET & Tsinghua University: –(Prof. Ding Xiaoqing) LingTour Chinese character recognition.
STUDY 500/1200 SANAKO. SANAKO Corporation in Short Sanako designs and manufactures virtual classrooms and language labs for the educational market. Sanako.
A roadmap for MT : four « keys » to handle more languages, for all kinds of tasks, while making it possible to improve quality (on demand) International.
© Ch. Boitet & Wang-Ju Tsai (GETA, CLIPS) ICUKL-2002, Goa, 25-29/11/02 1 Proposals for solving some problems in UNL encoding International Conference on.
Laulima Workshop for Instructors Solutions to help you engage your students through Laulima.
Project Overview Vangelis Karkaletsis NCSR “Demokritos” Frascati, July 17, 2002 (IST )
NESPOLE! is a project which aims at providing a system capable of supporting communication in the field of e-commerce and e-service by resorting to automatic.
Preparing for the 2008 Beijing Olympics : The LingTour and KNOWLISTICS projects. MAO Yuhang, DING Xiao-Qing, NI Yang, LIN Shiuan-Sung, Laurence LIKFORMAN,
Auckland 2012Kilgarriff: NLP and Corpus Processing1 The contribution of NLP: corpus processing.
© NCSR, Frascati, July 18-19, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Use of PROTÉGÉ to generate ontology and lexicons for the 1 st domain.
Worldwide Lexicon Brian McConnell May, WWL – Brian McConnell Worldwide Lexicon Intro Automatic discovery of dictionary, semantic net and translation.
Rapid Development in new languages Limited training data (6hrs) provided by NECTEC from 34 speakers, + 8 spks for development and test Romanization of.
1 Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents S. Kawamoto, et al. October 27, 2004.
11/23/00UNU/IAS/UNL Centre1 The Universal Networking Language United Nations University Institute of Advanced Studies United Networking Language ® UNU/IAS.
What do you know about PowerPoint? Interactivity in PowerPoint.
Preparing for the 2008 Beijing Olympics : The LingTour and KNOWLISTICS projects. MAO Yuhang, DING Xiao-Qing, NI Yang, LIN Shiuan-Sung, Laurence LIKFORMAN,
DocLing2016 Software Tools Peter K. Austin Department of Linguistics SOAS, University of London
Removing the Language Barrier Machine Translation And Digital Libraries.
Constructing A Yami Language Lexicon Database from Yami Archiving Projects Meng-Chien Yang(Providence University, Taiwan) D. Victoria Rau(National Chung.
A Brief Summary of MISS Project Weiquan Liu Feiyu Xu the multilingual world before MISO.
Using the Automatic Captions Feature. Objectives Learn how to use the Automatic Captions feature in YouTube  Edit the generated captions  Extract the.
XP Creating Web Pages with Microsoft Office
© NCSR, Frascati, July 18-19, 2002 CROSSMARC big picture Domain-specific Web sites Domain-specific Spidering Domain Ontology XHTML pages WEB Focused Crawling.
 Corpus Formation [CFT]  Web Pages Annotation [Web Annotator]  Web sites detection [NEACrawler]  Web pages collection [NEAC]  IE Remote.
The ACCEPT Project Enabling machine translation for the emerging community content paradigm. Allowing citizens across the EU better access to communities.
LACONEC A Large-scale Multilingual Semantics-based Dictionary
Lecture 8 Information Retrieval Introduction
Automatically Populating Acception Lexical Database through Bilingual Dictionaries and Conceptual Vectors PAPILLON 2002 Mathieu Lafourcade LIRMM -
Presentation transcript:

GOOD, MULTILINGUAL interpretation, translation, resources What can we do for the OG-08? Christian BOITET GETA, CLIPS, IMAG-campus UJF & CNRS, Grenoble, France

28/10/03G. Fafiotte & Ch. Boitet 2 2 applications, 1 common resource Communication in 2 languages Multilingual dissemination of information Building multilingual lexical resources

28/10/03G. Fafiotte & Ch. Boitet 3 Fully automatic HQ all-purpose Speech Translation …is a dream! ( as underlined by Prof. Feng ZhiWei) even more than FAHQMT of any text up to 30% "clarification" turns (Oviatt & Cohen91) For QUALITY bilingual communication 2 complementary feature sets, to be introduced in order: Interpretation Aids Immediately applicable (as Translation Aids!) But need humans, even if through the net Human control/participation Feedbacks & control (reformulate if SR or reverse MT bad…) Correction & interactive disambiguation

28/10/03G. Fafiotte & Ch. Boitet 4 1. Communication in 2 languages All-purpose multilingual communication human interpreters (volunteer students?) all languages! System integrating machine helps dictionaries, SR for better understanding management (human resources, rendez-vous…) up to automatic speech translation task-related: 100% automatic, WITH feedback, control all-purpose: with some interactive disambiguation

28/10/03G. Fafiotte & Ch. Boitet 5 Scenario 1 2 persons meet… ex: French visitor and Chinese soccer fan manage to say something in English, but They connect to the IBI service IBI = intermittent Beijing interpretation using 1 or 2 PDA (or cell-phone, or HH-PC…) If Chinese is not one of the 2 languages simply use 2 human interpreters (L1-C-C-L2)

28/10/03G. Fafiotte & Ch. Boitet 6 Scenario 1 2 persons meet… ex: French visitor and Chinese soccer fan manage to say something in English, but They connect to the IBI service IBI = intermittent Beijing interpretation using 1 or 2 PDA (or cell-phone, or HH-PC…) If Chinese is not one of the 2 languages simply use 2 human interpreters (L1-C-C-L2)

28/10/03G. Fafiotte & Ch. Boitet 7 Scenario 2 2 persons meet… ex: American visitor and Chinese soccer fan use ST & other machine aids to converse connect to IBI when ST breaks down :-( interpreter looks at dialog log, then helps using on-the-fly built dialog dictionary and may leave them again if s/he is needed by others and they say OK Then they continue on their own.

28/10/03G. Fafiotte & Ch. Boitet 8 Scenario 2 2 persons meet… ex: American visitor and Chinese soccer fan use ST & other machine aids to converse connect to IBI when ST breaks down :-( interpreter looks at dialog log, then helps using on-the-fly built dialog dictionary and may leave them again if s/he is needed by others and they say OK Then they continue on their own.

28/10/03G. Fafiotte & Ch. Boitet 9 Scenario 3 Afterwards, from hotel & from home… they have a distant conversation using their PC + headset [+ videocam] using a multimodal, multilingual chat no human interpreter, but user involvement control, feedback, interactive disambiguation

28/10/03G. Fafiotte & Ch. Boitet 10 Scénarios for ERiM-Interp (schéma 1) Communication and Collection servers Speaker 1Speaker 2 Professional, or casual situation (recorded if agreement) On-line oral translation (recorded if agreement) ERIM-i platform  distant ‘intermittent’ i nterpreting … professional interpreter / volunteer interpreter

28/10/03G. Fafiotte & Ch. Boitet 11 Scénarios for ERiM-Interp (schéma 7) Communication and Collection servers Speaker 1 Professional, or casual situation (recorded if agreement) On-line oral translation (recorded if agreement) ERIM-i platform  distant ‘intermittent’ i nterpreting … NEW SCENARIOS : a foreign researcher or University colleague, available for intermittent volunteer interpreting, from his/her usual workstation (office) a professional interpreter working home or in an office a multilingual multimodal ‘hot-webline’ for medical, technical, juridical… assistance a ‘junior’ interpreter (volunteer) … cf. user evaluation / demand Speaker 2 professional interpreter / volunteer interpreter

28/10/03G. Fafiotte & Ch. Boitet Dissemination of information Supposing situation is: From Chinese & English to other languages Written source, general + many domains High quality needed Then: FAHQMT is still a dream! Only practical way = disambiguation in source preedition (annotation) interactive disambiguation

28/10/03G. Fafiotte & Ch. Boitet 13 How? 2 possible strategies Cooperate with MT vendors (not 1!) Build a system from research efforts Technical aspects "Split" system & introduce interactive disambiguation after analysis "UNL++" as practical linguistico-semantic pivot Too sophisticated linterlinguas may be better, but are usable only by very very developers (if any)

28/10/03G. Fafiotte & Ch. Boitet Lexical resources Put what you have in Papillon site in the "contributed dictionaries" format basically, it is ANY XML form Transform into "Papillon format" (refine the ore) DiCo-NL for monolingual entries Add info to lexies (word senses) & refine them Connect lexies (through "axies") Generate application-oriented lexicons on the fly (real time server mode) or off-line no IPR problems here (Linux way, GPL)

Cooperative Building of a Large, Rich Multilingual Lexical Database of Monolingual Dictionaries & Interlingual Links to generate open source dictionaries on the Web Christian Boitet & Emmanuel Planas GETA, CLIPS, IMAG, CNRS, INPG & UJF Grenoble, France {Christian.Boitet, PAPILLON:

28/10/03G. Fafiotte & Ch. Boitet 16 French Dictionary Interlingual Dictionary Japanese Dictionary Vocable Carte n.f. Lexie carte à jouer Lexie carte géographique 地図 カード Acception 343 UNL: card(icl>play) Acception 345 UNL: map(fld>geography) Internal Architecture of the Database Architecture Derived from Dr. Gilles Sérasset’s Ph.D. Thesis

28/10/03G. Fafiotte & Ch. Boitet 17 Interlingual links motivated by translations = "AXIEs" Possibilitity to link 1 lexie to >1 acception Links to other representations: AXIE—1——n—>UW PAPILLON diagram French DiCo Vocable carte n.f. lexie carte.1 carte à jouer lexie carte.2 carte géographique Japanese DiCo 地図 カード Acception 343 UNL: card(icl>play), card(icl>thing)… Acception 345 UNL: map(fld>geography) Interlingual links Acception 1002 UNL: card(fld>money) a Thai DiCo English DiCo Vocable card N lexie card.1 playing card lexie card.2 money card Vocable=lexie map