Www.landc.be W. Ceusters a, I. Desimpel a, B. Smith b, S. Schulz c a Language and Computing nv., Zonnegem, Belgium b IFOMIS, Leipzig, Germany c Dept. of.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

An Overview of Ontologies and their Practical Applications Gianluca Correndo
April 15, 2004SPIE1 Association in Level 2 Fusion Mieczyslaw M. Kokar Christopher J. Matheus Jerzy A. Letkowski Kenneth Baclawski Paul Kogut.
ECO R European Centre for Ontological Research Ontology-based Error Detection in SNOMED-CT ® Werner Ceusters European Centre for Ontological Research Universität.
Ontology From Wikipedia, the free encyclopedia In philosophy, ontology (from the Greek oν, genitive oντος: of being (part. of εiναι: to be) and –λογία:
Of 27 lecture 7: owl - introduction. of 27 ece 627, winter ‘132 OWL a glimpse OWL – Web Ontology Language describes classes, properties and relations.
W. Ceusters, M. Cassella dos Santos, M. Fielding Language & Computing nv Applying a realist ontology for medical natural language understanding.
Ontology management for NLU: the L&C approach W. Ceusters CTO * Language & Computing nv, Zonnegem, Belgium.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
An Approach to Evaluate Data Trustworthiness Based on Data Provenance Department of Computer Science Purdue University.
OASIS Reference Model for Service Oriented Architecture 1.0
Lecturer: Sebastian Coope Ashton Building, Room G.18 COMP 201 web-page: Lecture.
PR-OWL: A Framework for Probabilistic Ontologies by Paulo C. G. COSTA, Kathryn B. LASKEY George Mason University presented by Thomas Packer 1PR-OWL.
Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.
Knowledge Acquisition CIS 479/579 Bruce R. Maxim UM-Dearborn.
Werner Ceusters Language & Computing nv Ontologies for the medical domain: current deficiencies in light of the needs of medical natural language.
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
Introduction to databases from a bioinformatics perspective Misha Taylor.
 2003 CSLI Publications Ling 566 Oct 16, 2007 How the Grammar Works.
L & C Dr. W. Ceusters Language & Computing nv 1 L&C’s LinkBase: a multi-lingual Hub to medical terminologies Dr. W. Ceusters Dir R&D Language.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Zinovy Diskin and Juergen Dingel Queen’s University Kingston, Ontario, Canada Mappings, maps and tables: Towards formal semantics for associations in UML.
1/19 Component Design On-demand Learning Series Software Engineering of Web Application - Principles of Good Component Design Hunan University, Software.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Towards Translating between XML and WSML based on mappings between.
Ontology Development in the Sciences Some Fundamental Considerations Ontolytics LLC Topics:  Possible uses of ontologies  Ontologies vs. terminologies.
Applying Belief Change to Ontology Evolution PhD Student Computer Science Department University of Crete Giorgos Flouris Research Assistant.
Protege OWL Plugin Short Tutorial. OWL Usage The world wide web is a natural application area of ontologies, because ontologies could be used to describe.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
Provenance Metadata for Shared Product Model Databases Etiel Petrinja, Vlado Stankovski & Žiga Turk University of Ljubljana Faculty of Civil and Geodetic.
Othello Artificial Intelligence With Machine Learning
Reading Discussions Design of Ontologies (Tom Gruber) Conceptualization: class, relation, function or other object What is ontology, what is your understanding.
Lecture2: Database Environment Prepared by L. Nouf Almujally & Aisha AlArfaj 1 Ref. Chapter2 College of Computer and Information Sciences - Information.
Dimitrios Skoutas Alkis Simitsis
LOGIC AND ONTOLOGY Both logic and ontology are important areas of philosophy covering large, diverse, and active research projects. These two areas overlap.
Taken from Schulze-Kremer Steffen Ontologies - What, why and how? Cartic Ramakrishnan LSDIS lab University of Georgia.
A Context Model based on Ontological Languages: a Proposal for Information Visualization School of Informatics Castilla-La Mancha University Ramón Hervás.
The ICPS: A taxonomy, a classification, an ontology or an information model? Stefan SCHULZ IMBI, University Medical Center, Freiburg, Germany.
Proposed NWI KIF/CG --> Common Logic Standard A working group was recently formed from the KIF working group. John Sowa is the only CG representative so.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
ISO-PWI Lexical ontology some loose remarks Thierry Declerck, DFKI GmbH.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Ontology-Centered Personalized Presentation of Knowledge Extracted from the Web Ralitsa Angelova.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Topic Maps introduction Peter-Paul Kruijsen CTO, Morpheus software ISOC seminar, april 5 th 2005.
Formal Specification: a Roadmap Axel van Lamsweerde published on ICSE (International Conference on Software Engineering) Jing Ai 10/28/2003.
Approach to building ontologies A high-level view Chris Wroe.
Clinical research data interoperbility Shared names meeting, Boston, Bosse Andersson (AstraZeneca R&D Lund) Kerstin Forsberg (AstraZeneca R&D.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Quality Assurance in the Presence of Variability Kim Lauenroth, Andreas Metzger, Klaus Pohl Institute for Computer Science and Business Information Systems.
Detection of underspecifications in SNOMED CT concept definitions using language processing 1 Federal Technical University of Paraná (UTFPR), Curitiba,
Software Engineering, COMP201 Slide 1 Software Requirements BY M D ACHARYA Dept of Computer Science.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
Background-assumptions in knowledge representation systems Center for Cultural Informatics, Institute of Computer Science Foundation for Research and Technology.
Semantic Wiki: Automating the Read, Write, and Reporting functions Chuck Rehberg, Semantic Insights.
Indexing Medical Documents using related ontologies: towards a strategy for automatic quality assurance Dr. W. Ceusters CTO Language and Computing.
International Workshop 28 Jan – 2 Feb 2011 Phoenix, AZ, USA Ontology in Model-Based Systems Engineering Henson Graves 29 January 2011.
Language, terminology and ontology in a medical context: theory en reality in industrial applications Werner CEUSTERS CTO Language & Computing.
Ontologies COMP6028 Semantic Web Technologies Dr Nicholas Gibbins
Artificial Intelligence Logical Agents Chapter 7.
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U.
Frequent Criticisms too ambitious recreation of a Textbook of Medicine competing with SNOMED-CT replicates the work done elsewhere: DSM, ICPC, too academic.
1 LinkSuite™: formally robust ontology-based data and information integration Werner Ceusters a, Barry Smith b, James Matthew Fielding b a.
Knowledge Representation Part I Ontology Jan Pettersen Nytun Knowledge Representation Part I, JPN, UiA1.
The Semantic Web By: Maulik Parikh.
Medical Natural Language Understanding now and tomorrow
Ontology From Wikipedia, the free encyclopedia
ece 720 intelligent web: ontology and beyond
Stefan SCHULZ IMBI, University Medical Center, Freiburg, Germany
ONTOMERGE Ontology translations by merging ontologies Paper: Ontology Translation on the Semantic Web by Dejing Dou, Drew McDermott and Peishen Qi 2003.
Presentation transcript:

W. Ceusters a, I. Desimpel a, B. Smith b, S. Schulz c a Language and Computing nv., Zonnegem, Belgium b IFOMIS, Leipzig, Germany c Dept. of Medical Informatics, Freiburg University Hospital, Germany Using Cross-Lingual Information to Cope with Underspecification in Formal Ontologies.

Presentation overview Ontologies and underspecification Implementation of a novel algorithm to detect underspecification Evaluation of results Applications Conclusion

From concept-based representations to ontology “Ontology” in Information Science: –“An ontology is a description (like a formal specification of a program) of the concepts and relationships that can exist for an agent or a community of agents.” (Tom Gruber) “Ontology” in Philosophy: –“Ontology is the science of what is, of the kinds and structures of objects, properties, events, processes and relations in every area of reality.” (Barry Smith)

What is ontologic underspecification ? SARS: “Severe Acute Respiratory Syndrome” A tentative description (in CEN/TC251 MOSE style) : –ISA respiratory syndrome –HAS-ONSET acute –HAS-SEVERITY severe A DL-classifier using this description would classify ANY respiratory syndrome that is acute and severe as SARS, and not just that particular disease now recognised as being caused by a rapidly mutating coronavirus

“Minimal ontological commitment” An ontology should make as few claims as possible about the world being modeled, allowing the parties committed to the ontology freedom to specialize and instantiate the ontology as needed. Since ontological commitment is based on consistent use of vocabulary, ontological commitment can be minimized by specifying the weakest theory (allowing the most models) and defining only those terms that are essential to the communication of knowledge consistent with that theory. –Toward Principles for the Design of Ontologies Used for Knowledge Sharing, 1993, Thomas R. Gruber

Pro’s and con’s of minimal ontological commitment Some arguments in favour: –it is better to have partial information than no information at all –reasoning with fewer information is faster than with lots of information –less risk for descriptive errors Some arguments against: –it reduces applicability of the ontology –knowing that a specific entity in the real world fits a class in the ontology, allows you to infer some characteristics for that entity, but knowing that an entity has some characteristics, does not allow you to infer that it fits a specific class –simple subsumption-based reasoning goes wrong quickly Key issue: it is a doctrine, hence it may be rejected, and we believe the arguments against are strong enough to do so !

Underspecification can be very subtle (Fistula which < isPartitivelyTo AbdominalSkin isPartitivelyFrom Colon isSpecificImmediateConsequenceOf SurgicalConstructingProcess >) name ColostomyStructure Grail-6; Dec 2002 Just any surgical construction ?

From underspecification to wrong classification

Objectives As developers and users of LinkBase, we want to avoid such mistakes

LinkBase architecture Formal Domain Ontology Lexicon Grammar Language A Lexicon Grammar Language B Cassandra Linguistic Ontology MEDDRA ICD SNOMED ICPC Others... Proprietary Terminologies

Objectives As developers and users of LinkBase, we want to avoid such mistakes Approach expand an existing LinkFactory algorithm (FRVP) such that it takes into account linguistic information

Mechanism: finding cross-roads

Ranking of best results in case of multiple cross-roads: x5 or x3 ? Applying a cost function based on a mixture of: shortest path type of links traversed

Long distance intersections PNAS polymer: no direct ISA link to any of the concepts queried for; many non-ISA links traversed;  high cost

Basic improvement: starting search with words instead of concepts homonym disambiguation required !

Additional improvements pick up also concepts associated with terms containing only a subset of the words from the query term, to be able to deal with: –terms containing words not associated with LinKBase® concepts –semi-tautologies: dorsal back pain, knee joint arthropathy language-specific term generator based on inflection-, derivation-, and clause-generation rules, with prevention of overgeneration by checking whether such constructed combinations of words qualify as terms for an existing concept in LinKBase®. generate larger sections for a given word by checking the ontology also for translations and/or possible synonyms of the word and its generated words in other languages

An example pulmonaryembolism ?? pulmonary pulmonaire embolism embolie infarction pulmonaire infarctus du poumon C1 lung poumon C2 lung embolism embolie pulmonaire pulmonary infarction C3 when more ontological information available

FRVP versus TermModeling

Evaluation with double purpose Quantification of effect Applicability for Quality Control

Experiment design Random selection of 100 terms from LinKBase®, all of them associated with concepts for which explicit conceptual information is lacking. Application of 6 languages plus Morphosaurus® MIDs We ran 7 tests, for each of which a separate base language was chosen and then the other languages added in order of next least available terms. As an exception, the MID-language was always added last. For quantification purposes we used the cost function as described earlier: the gain in cost after applying additional linguistic information is a good measure for how much implicit information could be used.

Some results for 72th term in French

Results “winner takes nearly all” Language processed

Some applications

Improving classification the concept acute viral infection does not yet subsume acute viral respiratory infection

Finding missing links

Finding different concepts with same meaning

Finding mistakes (say no more)

Conclusion We have shown that there is an objectively measurable value to exploiting implicit linguistic-semantic information present in multi- lingual annotations of concepts in resolving the problem of formal underspecification in ontologies. Hence, multilingual annotations are an additional means for quality assurance in ontologies, adding a dimension that cannot be covered by description logics only.