ISO TC37/SC4 N429 ISO/TC37/SC4/TDG6 Language Resource Ontologies 2007-11-11/12, Busan 2007-11-11/12, Busan HASIDA Koiti HASIDA Koiti

Slides:



Advertisements
Similar presentations
Registry breakout group DC-8, National Library of Canada 5 October 2000.
Advertisements

Metadata vocabularies and ontologies Dr. Manjula Patel Technical Research and Development
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
RDF Schemata (with apologies to the W3C, the plural is not ‘schemas’) CSCI 7818 – Web Technologies 14 November 2001 Van Lepthien.
The Semantic Web – WEEK 4: RDF
CS570 Artificial Intelligence Semantic Web & Ontology 2
ANSI TAG 37 Committee F43 Language Services and Products Interagency Language Roundtable September 30, 2011 Sue Ellen Wright ISO TC 37, Terminology and.
MLIF: A Metamodel to Represent and Exchange Multilingual Textual Information ISO TC37 SC4 WG Samuel Cruz-Lara, Gil Francopoulo, Laurent Romary,
SKOS and Other W3C Vocabulary Related Activities Gail Hodge Information International Assoc. NKOS Workshop Denver, CO June 10, 2005.
OWL-AA: Enriching OWL with Instance Recognition Semantics for Automated Semantic Annotation 2006 Spring Research Conference Yihong Ding.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
From SHIQ and RDF to OWL: The Making of a Web Ontology Language
1 Technologies and Modelling Frameworks XML ontology RDF taxonomy OWL thesaurus Semantic Web.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
RDF: Concepts and Abstract Syntax W3C Recommendation 10 February Michael Felderer Digital Enterprise.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
/21LIRICS IAG Meeting Barcelona LIRICS IAG Meeting /21 Universitat Pompeu Fabra Barcelona Introduction Gerhard Budin.
CLARIN web services and workflow Marc Kemps-Snijders.
Practical RDF Chapter 1. RDF: An Introduction
Standards for language resources the ISO/TC 37(/SC 4) perspective
►Thierry Declerck (DFKI GmbH, LT Lab. Saarbrücken, Germany) Standards and Infrastructures for Language Resources.
LIRICS mid-term review 1 LIRICS WP3: Morpho-syntactic and syntactic annotations Thierry Declerck DFKI-LT - Saarbrücken 23rd May 2006.
ISO Project Semantic Annotation Framework, Part 2: Dialogue Acts Editorial Group first meeting Pisa, September 2008 TC 37/SC 4/WG 2 Kiyong.
Working group on multimodal meaning representation Dagstuhl workshop, Oct
Experiments with ODD outside the TEI framework Laurent Romary & Piotr Banski The ISO-TEI connection.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
OWL Capturing Semantic Information using a Standard Web Ontology Language Aditya Kalyanpur Jennifer Jay Banerjee James Hendler Presented By Rami Al-Ghanmi.
Logics for Data and Knowledge Representation
The 2010 Secretary’s Annual Report on ISO/TC37/SC4 “Language resource management” /
LIRICS Mid-term Review 1 LIRICS WP2 – NLP Lexica Monica Monachini CNR-ILC - Pisa 23rd May 2006.
ISO TC 37 / SC4 Language Resources An overview (Ammended 2-5 février 2002) Laurent Romary.
Towards multimodal meaning representation Harry Bunt & Laurent Romary LREC Workshop on standards for language resources Las Palmas, May 2002.
Resource Description Framework (RDF) Course: Electronic Document Team member: Ding Feng Ding Wei Wang Ling Date:
Nicoletta Calzolari Berlin, October PWI ISO SC 4/WG 4 Lexicon-Ontology relations PWI Nicoletta Calzolari Exploratory meeting.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Comparability of language data and analysis Using an ontology for linguistics Scott Farrar, U.
Technology – Broad View Aspects that play a role when integrating archives leave the details of some core topics to the 2. day Bernhard Neumair:Base Technologies.
Chapter 7: Resource Description Framework (RDF) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley,
SynAF:Provo ISO Meeting Thierry Declerck, DFKI GmbH.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
It’s all semantics! The premises and promises of the semantic web. Tony Ross Centre for Digital Library Research, University of Strathclyde
Evidence from Metadata INST 734 Doug Oard Module 8.
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
ISO/TC37/SC4/TDG6 Language Resource Ontologies , Marrakech HASIDA Koiti CfSR, AIST, Japan.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
ISO/TC37/SC4/TDG6 Language Resource Ontologies , Pisa HASIDA Koiti CfSR, AIST, Japan.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
TMF - Terminological Markup Framework Laurent Romary Laboratoire LORIA (CNRS, INRIA, Universités de Nancy) ISO meeting London, 14 August 2000.
Towards Linguistically Grounded Ontologies Paul Buitelaar, Philipp Cimiano, Peter Haase, and Michael Sintek Proceedings of the 6 th European Semantic Web.
ISO/TC37/SC4/N377 secretary report
Metadata, Resources, and the RDF 김민수 Chapter 1. Creating the Sementic Web with RDF2 Overview Knowledge Representation Library Metadata RDFRDF.
Towards a roadmap for standardization in language technology Laurent Romary & Nancy Ide Loria-INRIA — Vassar College.
Extending the MDR for Semantic Web November 20, 2008 SC32/WG32 Interim Meeting Vilamoura, Portugal - Procedure for the Specification of Web Ontology -
ISO/IEC JTC 1/SC 32 Plenary and WGs Meetings Jeju, Korea, June 25, 2009 Jeong-Dong Kim, Doo-Kwon Baik, Dongwon Jeong {kjd4u,
ISO/TC37/SC4 Draft Resolution
SemAF – Basics: Semantic annotation framework Harry Bunt Tilburg University isa -6 Joint ISO - ACL/SIGSEM workshop Oxford, January 2011 TC 37/SC.
Formats, interoperability and standards Marc Kemps-Snijders.
ISO TC37/SC4 N435 Nov 12, 2007 Presented by Miran Choi/ETRI Written by Jae Sung Lee/Chungbuk National Univ.
Extending the Metadata Registry for Semantic Web - Enforcing the MDR for supporting ontology concept - May 28, 2008 ISO/IEC JTC 1/SC 32 WG 2 Meeting Sydney,
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
An Introduction and UML Profile for the Web Ontology Language (OWL) October 23, 2002 Elisa F. KendallMark E. Dutra CEO & FounderChief Architect
Linked Data & Semantic Web Technology The Semantic Web Part 4. Resource Description Framework (1) Dr. Myungjin Lee.
Implementing the TEI Feature System Declaration Gary F. Simons SIL International ___________________________ TEI Members Meeting 11 Oct 2002, Chicago.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
The Semantic Web By: Maulik Parikh.
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
Zachary Cleaver Semantic Web.
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
Resource Description Framework (RDF)
Presentation transcript:

ISO TC37/SC4 N429 ISO/TC37/SC4/TDG6 Language Resource Ontologies /12, Busan /12, Busan HASIDA Koiti HASIDA Koiti ITRI, AIST, Japan ITRI, AIST, Japan

Agenda 1st Day (Sunday, November 11, 2007) 1st Day (Sunday, November 11, 2007) 16:30/16:45 Introduction to ISO/TC37/SC4/TDG6 (Koiti Hasida) 16:30/16:45 Introduction to ISO/TC37/SC4/TDG6 (Koiti Hasida) 16:45/18:00 RDF Issues (Koiti Hasida, etc.) 16:45/18:00 RDF Issues (Koiti Hasida, etc.) 18:00/18:15 break 18:00/18:15 break 18:15/20:00 Ontology Issues (Paul Buitelaar, Aldo Gangemi, etc.) 18:15/20:00 Ontology Issues (Paul Buitelaar, Aldo Gangemi, etc.) (Dinner will be provided for the participants) (Dinner will be provided for the participants) 2nd Day (Monday, November 12, 2007) 2nd Day (Monday, November 12, 2007) 15:00/16:30 Ontology of Taxonomies (Bodil Nistrup Madsen) 15:00/16:30 Ontology of Taxonomies (Bodil Nistrup Madsen) 16:30/16:40 break 16:30/16:40 break 16:40/17:20 Ontology Work Guide for Web Content (Miran Choi) 16:40/17:20 Ontology Work Guide for Web Content (Miran Choi) 17:20/18:00 Wrap-Up 17:20/18:00 Wrap-Up 2

ISO/TC37 Terminology and Other Language and Content Resources SC1: Principles and Methods SC1: Principles and Methods SC2: Terminography and Lexicography SC2: Terminography and Lexicography SC3: Computer Applications for Terminology SC3: Computer Applications for Terminology ISO12620: Data Categories ISO12620: Data Categories SC4: Language Resources Management SC4: Language Resources Management Technical Committee Intl. Standardization Org. Sub-Committee 3

ISO/TC37/SC4 WG1 Basic Description and Mechanisms for Language Resources (Laurent Romary) WG1 Basic Description and Mechanisms for Language Resources (Laurent Romary) ISO :2006 Feature Structure - Part 1:Feature System Representation (FSR) Kiyong Lee ISO :2006 Feature Structure - Part 1:Feature System Representation (FSR) Kiyong Lee ISO Feature Structure-Part 2: Feature System Declaration (FSD) Kiyong Lee ISO Feature Structure-Part 2: Feature System Declaration (FSD) Kiyong Lee ISO Linguistic Annotation Framework (LAF) Nancy Ide ISO Linguistic Annotation Framework (LAF) Nancy Ide ISO Citation of Electronic Resource (CitER) Daan Broeder ISO Citation of Electronic Resource (CitER) Daan Broeder WG2 Representation Schemes (Kiyong Lee) WG2 Representation Schemes (Kiyong Lee) ISO Word Segmentation - Part1: Basic concepts and General Principle (WordSeg-1) Maosong Sun ISO Word Segmentation - Part1: Basic concepts and General Principle (WordSeg-1) Maosong Sun ISO Word Segmentation - Part 2: Chinese, Japanese and Korean (WordSeg-2) ISO Word Segmentation - Part 2: Chinese, Japanese and Korean (WordSeg-2) ISO Semantic Annotation Framework - Part1: Time and Events (SemAF/Time) ISO Semantic Annotation Framework - Part1: Time and Events (SemAF/Time) ISO Syntactic Annotation Framework (SynAF) Thierry Declerck ISO Syntactic Annotation Framework (SynAF) Thierry Declerck ISO Morpho-Syntactic Annotation Framework (MAF) ISO Morpho-Syntactic Annotation Framework (MAF) WG3 Multilingual Information Representation WG3 Multilingual Information Representation ISO Multi-lingual information framework (MLIF) ISO Multi-lingual information framework (MLIF) WG4 Lexical Resources/database (Nicoretta Calzolari) WG4 Lexical Resources/database (Nicoretta Calzolari) ISO Lexical Markup Framework ISO Lexical Markup Framework TDG1 Data Categories for Metadata (Peter Wittenburg) TDG1 Data Categories for Metadata (Peter Wittenburg) TDG2 Data Categories for Morphsyntax (Gil Francopoulo) TDG2 Data Categories for Morphsyntax (Gil Francopoulo) TDG3 Data Categories for Semantic Content Representation (Harry Bunt) TDG3 Data Categories for Semantic Content Representation (Harry Bunt) TDG4 Syntactic Data Categories (Thierry Declerck) TDG4 Syntactic Data Categories (Thierry Declerck) TDG5 Machine Readable Dictionaries (Monte George) TDG5 Machine Readable Dictionaries (Monte George) TDG6 Language Resource Ontologies (Koiti Hasida) TDG6 Language Resource Ontologies (Koiti Hasida) 4 Thematic Domain Grp.

TDG6 Issues ontologization ontologization DC, LAF, LMF, FS, MAF, SemAF, SynAF, TDG3, etc. DC, LAF, LMF, FS, MAF, SemAF, SynAF, TDG3, etc. Cf. the Pisa group’s work on LMF Cf. the Pisa group’s work on LMF extension of RDF (and ontology framework) to more straightforwardly address linguistic information extension of RDF (and ontology framework) to more straightforwardly address linguistic information extended RDF instead of XML extended RDF instead of XML nodes embedding nodes … rdf:Container? nodes embedding nodes … rdf:Container? publish TRs publish TRs launch ISs launch ISs 5

Ontologization ontology-based reformulation ontology-based reformulation Most current standards are based on XML and lack formal metamodeling tools. Most current standards are based on XML and lack formal metamodeling tools. not XML but RDF as base description and modeling tool not XML but RDF as base description and modeling tool Semantic interpretation is standardized not for XML but for RDF. Semantic interpretation is standardized not for XML but for RDF. something like N3 for textual representation something like N3 for textual representation ontology as schema ontology as schema not DTD, XML Schema, RELAXNG, etc. not DTD, XML Schema, RELAXNG, etc. 6

Motivations of Ontologization Lack of formal tool by which to write schemas fully addressing the specifications. Lack of formal tool by which to write schemas fully addressing the specifications. DCR model lacks descriptive power. DCR model lacks descriptive power. 7

Semantic Mess of XML Semantic interpretation of XML is not standardized but rather arbitrary. Semantic interpretation of XML is not standardized but rather arbitrary. Many inconsistent `standards’ on overlapping issues. Many inconsistent `standards’ on overlapping issues. Huge standards containing many different semantic interpretation manners. Huge standards containing many different semantic interpretation manners. e.g., MPEG-7 > 2000 pages e.g., MPEG-7 > 2000 pages 8

RDF Resource Description Framework Resource Description Framework W3C recommendation W3C recommendation basis of ontology standards such as RDFS, OWL, and SKOS. basis of ontology standards such as RDFS, OWL, and SKOS. graph model graph model textual representation textual representation XML XML N3 N3 9

RDF Graph m:homePagem:homePage m:attendingm:attending m:givenNamem:givenName FredFred m:has m:has 10

RDF in Text XML N3 <rdf:RDF xmlns:rdf=" xmlns:m=" xmlns=" xmlns:p=" Fred <rdf:RDF xmlns:rdf=" xmlns:m=" xmlns=" xmlns:p=" m:. m:homePage. p:GivenName "Fred"; p:has ; m:. m:homePage. p:GivenName "Fred"; p:has ; m:attending. 11

<rdf:RDF xmlns:rdf=" xmlns:m=" xmlns=" xmlns:p=" Fred <rdf:RDF xmlns:rdf=" xmlns:m=" xmlns=" xmlns:p=" Fred m:homePagem:homePage m:attendingm:attending m:givenNamem:givenName FredFred m:has m:has 12

m:homePagem:homePage m:attendingm:attending m:givenNamem:givenName FredFred m:. m:homePage. p:GivenName "Fred"; p:has ; m:. m:homePage. p:GivenName "Fred"; p:has ; m:attending. 13

ISO 24610: Feature Structure typed feature structure as in HPSG, etc. typed feature structure as in HPSG, etc. ISO : Feature Structure Representation ISO : Feature Structure Representation cf. ISO : Feature System Declaration cf. ISO : Feature System Declaration graph model graph model AVM (attribute-value matrix) AVM (attribute-value matrix) textual encoding textual encoding XML XML 14

FS Graph determinerdeterminer POSPOS SPECIFIERSPECIFIER ORTHORTHlala HEADHEAD AGRAGR AGRAGR nounnoun POSPOS ORTHORTH pommepomme singularsingular NUMBERNUMBER 15

FS in AVM SPECIFIER HEAD POSdeterminer ORTH`la’ AGR [1] [NUMBER singular] POSnoun ORTH`pomme’ AGR [1] 16

FS in XML la pomme la pomme 17

FS in RDF (i.e., Graph) determinerdeterminer POSPOS SPECIFIERSPECIFIER ORTHORTHlala HEADHEAD AGRAGR AGRAGR nounnoun POSPOS ORTHORTHpommepomme singularsingular NUMBERNUMBER 18

FS in RDF/N3 [ fs:specifier [ fs:pos fs:determiner; fs:orth "la"; fs:agr ] fs:head [ fs:pos fs:noun; fs:orth "pomme"; fs:agr ] ] fs:number fs:singular. [ fs:specifier [ fs:pos fs:determiner; fs:orth "la"; fs:agr ] fs:head [ fs:pos fs:noun; fs:orth "pomme"; fs:agr ] ] fs:number fs:singular. 19

Ontologies Subsume Feature Systems Features are partial functions, whereas RDF properties are mappings in general (possibly partial functions). Features are partial functions, whereas RDF properties are mappings in general (possibly partial functions). Usual feature systems have no taxonomy of features, whereas usual ontologies have taxonomies of properties (e.g., due to rdfs:subPropertyOf). Usual feature systems have no taxonomy of features, whereas usual ontologies have taxonomies of properties (e.g., due to rdfs:subPropertyOf). 20

Embedded Situation in N3 Formula Tom believes that Mary kisses Bill. Tom believes that Mary kisses Bill. [a :believe; :experiencer ; :content { a :kiss; :agent ; :patient } ] [a :believe; :experiencer ; :content { a :kiss; :agent ; :patient } ] 21

FS Ontologization (Summary) RDF ⊃ FS RDF ⊃ FS Use N3 (instead of XML) for textual representation of FS as a kind of RDF. Use N3 (instead of XML) for textual representation of FS as a kind of RDF. Use ontologies for feature-system declarations. Use ontologies for feature-system declarations. 22

XML vs. RDF <syn:S> Tom Tom doesn’t look doesn’t look good. good.</syn:S> ● doesn’t look ●. TomTom goodgood dict1:gooddict1:good semsem a node embedding nodes S NP AJP 23

RDF in Graph vs. N3 24 [a syn:S; syn:child [a syn:NP; syn:child ”Tom”]; syn:child ” doesn’t look ”; syn:child [a syn:AJP; :sem dict1:good; syn:child ”good”]; syn:child ”.” ] [a syn:S; syn:child [a syn:NP; syn:child ”Tom”]; syn:child ” doesn’t look ”; syn:child [a syn:AJP; :sem dict1:good; syn:child ”good”]; syn:child ”.” ] optional and ordered ● doesn’t look ●. TomTom goodgood dict1:gooddict1:good :sem:sem S NP AJP

XML vs. RDF/N3 25 [a syn:S; [a syn:NP; ”Tom”]; ” doesn’t look ”; [a syn:AJP; :sem dict1:good; ”good”]; ”.” ] [a syn:S; [a syn:NP; ”Tom”]; ” doesn’t look ”; [a syn:AJP; :sem dict1:good; ”good”]; ”.” ] <syn:S> Tom Tom doesn’t look doesn’t look good. good.</S><syn:S> Tom Tom doesn’t look doesn’t look good. good.</S>

Cf. XML vs. RDF/XML 26 ” ” Tom doesn’t look ” good. ” ” Tom doesn’t look ” good. <syn:S> Tom Tom doesn’t look doesn’t look good. good.</S><syn:S> Tom Tom doesn’t look doesn’t look good. good.</S> Of course we won’t do this.

Merits of N3 over XML N3 is restricted to RDF, whereas XML has unnecessary freedom. N3 is restricted to RDF, whereas XML has unnecessary freedom. It’s confusing to use XML to represent RDF encoding XML. It’s confusing to use XML to represent RDF encoding XML. N3 can readily represent embedded situations. (cf. named graph) N3 can readily represent embedded situations. (cf. named graph) 27

Weaknesses of DCR Metamodel DCR metamodel cannot address DCR metamodel cannot address sorts of DCs: such as unary predicate, binary relation, symmetric binary relation, etc. sorts of DCs: such as unary predicate, binary relation, symmetric binary relation, etc. types of the domain (1 st arg.) and the range (2 nd arg.) of binary relations (properties) types of the domain (1 st arg.) and the range (2 nd arg.) of binary relations (properties) 28

Discussion What’s essential is the graph model. What’s essential is the graph model. Textual representation is not essential. Textual representation is not essential. But people tend to specify XML representation, etc. W3C insist on plain- test encoding. But people tend to specify XML representation, etc. W3C insist on plain- test encoding. 29

2 nd Day Purposes of ontology, taxonomy, data model, etc.? Purposes of ontology, taxonomy, data model, etc.? CAOS vs. TDG6 Sem. Roles? CAOS vs. TDG6 Sem. Roles? A concept is a conjunction of characteristics. … partial equivalence A concept is a conjunction of characteristics. … partial equivalence DC schema and ontology are different. DC schema and ontology are different. But DC metamodel may be defined as an ontology. But DC metamodel may be defined as an ontology. 30

Resolutions TDG6 appoints Paul Buitelaar to launch and lead the project for ontologizing LMF, collaborating with Nicoletta Calzorali and other experts. TDG6 appoints Paul Buitelaar to launch and lead the project for ontologizing LMF, collaborating with Nicoletta Calzorali and other experts. 31