© Ch. Boitet & Wang-Ju Tsai (GETA, CLIPS) ICUKL-2002, Goa, 25-29/11/02 1 Proposals for solving some problems in UNL encoding International Conference on.

Slides:



Advertisements
Similar presentations
From the UNL hypergraph to GETA's multilevel tree Etienne BLANC GETA, CLIPS-IMAG BP 53, F Grenoble cedex 09
Advertisements

CSCI N241: Fundamentals of Web Design Copyright ©2004 Department of Computer & Information Science Introducing XHTML: Module B: HTML to XHTML.
1 STRUCTURAL AND LEXICAL TRANSFER from a UNL GRAPH to a NATURAL LANGUAGE DEPENDENCY TREE Etienne BLANC, Gilles SERASSET, WangJu TSAI GETA, CLIPS-IMAG.
The Universal Networking Language UNL Foundation United Nations University Institute of Advanced Studies United Networking Language ® UNU/IAS.
6-1 Chapter Goals Determine whether a problem is suitable for a computer solution Describe the computer problem-solving process and relate it to Polya’s.
Problem Solving and Algorithm Design
Chapter 6 Problem Solving and Algorithm Design. 6-2 Chapter Goals Determine whether a problem is suitable for a computer solution Describe the computer.
Free construction of a free dictionary of synonyms using computer science Viggo Kann and Magnus Rosell KTH, Stockholm Talk given by Viggo at Amherst College.
The KB on its way to Web 2.0 Lower the barrier for users to remix the output of services. Theo van Veen, ELAG 2006, April 26.
References Kempen, Gerard & Harbusch, Karin (2002). Performance Grammar: A declarative definition. In: Nijholt, Anton, Theune, Mariët & Hondorp, Hendri.
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
XSL Unit 6 November 2. XSL –eXtensible Stylesheet Language –Basically a stylesheet for XML documents XSL has three parts: –XSLT –XPath –XSL-FO.
Chapter 6 Problem Solving and Algorithm Design Nell Dale John Lewis.
XML on Semantic Web. Outline The Semantic Web Ontology XML Probabilistic DTD References.
Evaluating an MT French / English System Widad Mustafa El Hadi Ismaïl Timimi Université de Lille III Marianne Dabbadie LexiQuest - Paris.
XML October 24, Unit 6. What is XML? Stands for eXtensible Markup Language It is a markup language, like HTML But, –XML is designed to markup data –HTML.
Introducing HTML & XHTML:. Goals  Understand hyperlinking  Understand how tags are formed and used.  Understand HTML as a markup language  Understand.
MACHINE TRANSLATION A precious key to communicate beyond linguistic barriers 1.
Requirements for DSML 2.0. Summary RFC 2251 fidelity Represent existing directory protocols with new transport syntax Backwards compatibility with DSML.
CLARIN tools for workflows Overview. Objective of this document  Determine which are the responsibilities of the different components of CLARIN workflows.
1/24 17/7/2002 (Papillon-02) Translation in Papillon (Ch. Boitet) The translation of examples, citations, definitions and glosses in the Papillon project.
GOOD, MULTILINGUAL interpretation, translation, resources What can we do for the OG-08? Christian BOITET GETA, CLIPS, IMAG-campus UJF & CNRS, Grenoble,
Universal Networking Language (UNL) by Pantha Kanti Nath (05IT6021) Under the Guidance of Prof. Debasis Samanta School of Information Technology Indian.
Artificial Intelligence for Universal Networking Language (UNL) (Perspective Bengali Language) By Deen Islam Muslim ID: Ariful Hoque Tuhin ID:
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Towards Translating between XML and WSML based on mappings between.
Logics for Data and Knowledge Representation Semantic Matching.
SupervisorStudent Dr. Atilla ELÇİHussam Hussein ABUAZAB Assoc. Prof Fall 2007 Ontology-based Support for Human Disease Study CMPE 583 WEB SEMANTICS:
Phase 2: Systems Analysis
NLP superficial and lexic level1 Superficial & Lexical level 1 Superficial level What is a word Lexical level Lexicons How to acquire lexical information.
Adding Whole Numbers © Math As A Second Language All Rights Reserved next #5 Taking the Fear out of Math
Chapter 1 Understanding the Web Design Environment Principles of Web Design, 4 th Edition.
1 XML Data Management Course Outline and Organisation Werner Nutt.
What is XML (Extensible Markup Language)? XML is basically a better comma delimited file. Example: Your client asks you to write a new reporting system.
Session IV Chapter 9 – XML Schemas
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
November 2003CSA4050: Semantics I1 CSA4050: Advanced Topics in NLP Semantics I What is semantics for? Role of FOL Montague Approach.
Chapter 1, Part II: Predicate Logic With Question/Answer Animations.
Intro to XML Dr. Lam TECM5191. Why XML? Text CHRISLAM138 to
A roadmap for MT : four « keys » to handle more languages, for all kinds of tasks, while making it possible to improve quality (on demand) International.
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
17 Apr 2002 XML Syntax: Documents Andy Clark. Basic Document Structure Element tags – Elements have associated attributes Text content Miscellaneous –
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Comparability of language data and analysis Using an ontology for linguistics Scott Farrar, U.
Using Surface Syntactic Parser & Deviation from Randomness Jean-Pierre Chevallet IPAL I2R Gilles Sérasset CLIPS IMAG.
XML Extras Outline 1 - XML in 10 Points 2 - XML Family of Technologies 3 - XML is Modular 4 - RDF and Semantic Web 5- XML Example: UK GovTalk Group’s Schema.
© Copyright 2013 STI INNSBRUCK “How to put an annotation in HTML?” Ioannis Stavrakantonakis.
Introduction to PowerPoint The Basics of Microsoft Word 2007 Excel.
Andy Dawson– University College London 1 EABH SUMMER SCHOOL Web Page Construction Andy Dawson Department of Information Studies, UCL.
1 Indexing The syntax for creating a index is: CREATE [UNIQUE] INDEX index_name ON table_name (column1, column2,... column_n) [ COMPUTE STATISTICS ]; Why.
How To Do NPV’s ©2007 Dr. B. C. Paul Note – The principles covered in these slides were developed by people other than the author, but are generally recognized.
Semantic Object Language By: Jason Wells Semantic Research Inc. /sol_whitepaper.pdf Presented By:
(c) University of Washington02-1 CSC 143 Java Object and Class Relationships: Interfaces Reading: Ch. 9 (on Java interfaces)
XP Tutorial 9New Perspectives on HTML and XHTML, Comprehensive 1 Working with XHTML Creating a Well-Formed Valid Document Tutorial 9.
11/23/00UNU/IAS/UNL Centre1 The Universal Networking Language United Nations University Institute of Advanced Studies United Networking Language ® UNU/IAS.
HTML Review * is used as a reference for most of the notes in this powerpoint.
UNL Document Summarization Virach Sornlertlamvanich, Tanapong Potipiti and Thatsanee Charoenporn Information Research and Development Division National.
Introduction to modeling
XML Extensible Markup Language
XML Schema – XSLT Week 8 Web site:
Of 24 lecture 11: ontology – mediation, merging & aligning.
Our Co-Teaching Experiences Hamish Rolls, Jo Kyeongseon Hogye Middle School.
The UNL Program A program created by the United Nations University / Institute of Advanced Studies Now carried out by the UNDL Foundation
1 Representing and Reasoning on XML Documents: A Description Logic Approach D. Calvanese, G. D. Giacomo, M. Lenzerini Presented by Daisy Yutao Guo University.
Standardization of Lexicon
Text Analytics in ITS 2.0: Annotation of Named Entities
Parsing & Context-Free Grammars Hal Perkins Autumn 2011
Implementing Language Extensions with Model Transformations
CSc4730/6730 Scientific Visualization
La télé LEARNING OBJECTIVE: to talk about tv programmes and express whether you like them or not SUCCESS CRITERIA: Grade D+ detailed description of tv.
Extracting Recipes from Chemical Academic Papers
Implementing Language Extensions with Model Transformations
Presentation transcript:

© Ch. Boitet & Wang-Ju Tsai (GETA, CLIPS) ICUKL-2002, Goa, 25-29/11/02 1 Proposals for solving some problems in UNL encoding International Conference on Universal Knowledge and Language (ICUKL2002), Goa, November 2002 Christian BOITET GETA, CLIPS, IMAG, Grenoble

© Ch. Boitet & Wang-Ju Tsai (GETA, CLIPS) ICUKL-2002, Goa, 25-29/11/02 2 Which problems? What Igor said "remains to be done" 1.representation of multi-word concepts (« long UWs »); 2.elliptical expressions; 3.treatment of arguments both in the UW dictionary and in the UNL expressions and 1.conventions about attributes 2.XML formats for UNL documents

© Ch. Boitet & Wang-Ju Tsai (GETA, CLIPS) ICUKL-2002, Goa, 25-29/11/02 3 Representation of multi-word concepts (long UWs) — 1 Problematic examples of "UNKNOWN LONG UWs" "Institute of Advanced studies (UNU/IAS)"(icl>…) "East-Asia cooperation office" East-Asia cooperation office east-asia cooperation office(icl>…) "Tokyo University" "University of Kyoto" "World Bank(icl>…)"

© Ch. Boitet & Wang-Ju Tsai (GETA, CLIPS) ICUKL-2002, Goa, 25-29/11/02 4 Representation of long UWs — 2 What are the problems? 1.No hope of including all these long UWs in our UNL-LLL dictionaries  because of potentially immense, unbounded number of such UWs  Maybe never more than 5%, 10% of them in open domains 2.Necessity to include an analyzer of English compounds in order to translate "unknown long UWs" piece by piece.  but such compounds are extremely ambiguous

© Ch. Boitet & Wang-Ju Tsai (GETA, CLIPS) ICUKL-2002, Goa, 25-29/11/02 5 Let us think a bit more  Proper nouns CAN be decomposed.  This is NOT to say that their translation is always compositional. Compositional: World Bank ==> Banque du Mondefalse Idiomatic: World Bank ==> Banque mondialecorrect  So that we should have a solution allowing BOTH Compositional deconversion if the long UW is unknown Idiomatic deconversion after it put in the UNL-LLL dictionary

© Ch. Boitet & Wang-Ju Tsai (GETA, CLIPS) ICUKL-2002, Goa, 25-29/11/02 6 Proposal of a solution Origin  Proposed by H.Uchida at a meeting in Tokyo (1999?)  Not yet included but still needed and still the best Principle  Headword encodes a UNL representation of the compound Possible syntax entity) "(mod(bank(icl> entity) … or a better one!

© Ch. Boitet & Wang-Ju Tsai (GETA, CLIPS) ICUKL-2002, Goa, 25-29/11/02 7 How to deconvert  Case 1: on) is not in the UNL-FR dictionary ==> French deconverter "unwraps" into a scope of the UNL-graph

© Ch. Boitet & Wang-Ju Tsai (GETA, CLIPS) ICUKL-2002, Goa, 25-29/11/02 8 Another example Compositional deconversion  Université de Tokyo  University of Tokyo  Universität von Tokyo  Tokyo no daigaku (or Tokyo ni daigaku) Idiomatic deconversion  Université de Tokyo (or Todai!)  Tokyo University / University of Tokyo  Universität Tokyo  Tokyo daigaku / Todai

© Ch. Boitet & Wang-Ju Tsai (GETA, CLIPS) ICUKL-2002, Goa, 25-29/11/02 9 Elliptical expressions Example Do you prefer the first or the second solution? I prefer the first.  Je préfère le premier?  Je préfère la première? ==> A bad deconversion will be very misleading. Possible solution Encode the elided element and on That is equivalent to "preedit" the input text  I prefer the first solution. …and in the spirit of the new idea by H.Uchida of preediting for semantic relations

© Ch. Boitet & Wang-Ju Tsai (GETA, CLIPS) ICUKL-2002, Goa, 25-29/11/02 10 Treatment of arguments in the UW dictionary in the UNL expressions See talk by I.Bogulslavskij The solution proposed entails 1.a very small change in the UNL syntax  Allow on arcs hence also on restrictions by 2.a discipline in the UW creation  all arguments should appear as restrictions

© Ch. Boitet & Wang-Ju Tsai (GETA, CLIPS) ICUKL-2002, Goa, 25-29/11/02 11 "Argument-full" + "readable" UW Argument-full Readable look(icl>do, for something Even more readable look for something

© Ch. Boitet & Wang-Ju Tsai (GETA, CLIPS) ICUKL-2002, Goa, 25-29/11/02 12 Continuing that list… look for something look at something or look at something look like something look like something might also cover "look as" in "he looks as a good man" or look as looks as if… for something

© Ch. Boitet & Wang-Ju Tsai (GETA, CLIPS) ICUKL-2002, Goa, 25-29/11/02 13 Attributes The problem ==> un lion, les lions, lions? We don't know whether definiteness has been computed ==> it ==> use or not ==> it is UNKNOWN ==> compute default Solution: for every attribute XXXX, for +XXXX (1 or for -XXXX (0 or false) nothingfor XXXX unknown (? or undefined)

© Ch. Boitet & Wang-Ju Tsai (GETA, CLIPS) ICUKL-2002, Goa, 25-29/11/02 14 XML formats for UNL documents A minimal UNL-xml format strictly equivalent of UNL-htmlr –proposed & used by Tsai W.J. for the SWIIVRE-UNL web site & his Ph.D. Methodology for defining and using other, more detailed UNL-xml-xyz formats: –xyz is an application (e.g. a graphical editor, or statistics- gathering tool, etc.), –Automatic parsing of the basic UNL-xml format introduces new tags, –An object document model (DOM) suitable for application xyz can then be defined and used.