Www.landc.be 1 LinkSuite™: formally robust ontology-based data and information integration Werner Ceusters a, Barry Smith b, James Matthew Fielding b a.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

ECO R European Centre for Ontological Research Realist Ontology for Electronic Health Records Dr. Werner Ceusters ECOR: European Centre for Ontological.
Ontological analysis of the semantic types Anand Kumar MBBS, PhD IFOMIS, University of Saarland, Germany. BIOMEDICALONTOLOGYBIOMEDICALONTOLOGY.
W. Ceusters a, I. Desimpel a, B. Smith b, S. Schulz c a Language and Computing nv., Zonnegem, Belgium b IFOMIS, Leipzig, Germany c Dept. of.
Catalina Martínez-Costa, Stefan Schulz: Ontology-based reinterpretation of the SNOMED CT context model Ontology-based reinterpretation of the SNOMED CT.
Ontology From Wikipedia, the free encyclopedia In philosophy, ontology (from the Greek oν, genitive oντος: of being (part. of εiναι: to be) and –λογία:
W. Ceusters, M. Cassella dos Santos, M. Fielding Language & Computing nv Applying a realist ontology for medical natural language understanding.
Ontology management for NLU: the L&C approach W. Ceusters CTO * Language & Computing nv, Zonnegem, Belgium.
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
Ontology Notes are from:
1/39 Terminology and Ontology Management Systems Dr. W. Ceusters CTO Language & Computing nv.
References Kempen, Gerard & Harbusch, Karin (2002). Performance Grammar: A declarative definition. In: Nijholt, Anton, Theune, Mariët & Hondorp, Hendri.
The Semantic Web Week 13 Module Website: Lecture: Knowledge Acquisition / Engineering Practical: Getting to know.
1 Introduction to (Geo)Ontology Barry Smith
1 Ontology in 15 Minutes Barry Smith. 2 Main obstacle to integrating genetic and EHR data No facility for dealing with time and instances (particulars)
1 Using ontology in query answering systems: scenarios, requirements and challenges Werner Ceusters a, Barry Smith b, Maarten Van Mol a a.
1 The Ontology of Measurement Barry Smith ONTOLOGIST.cOm.
New York State Center of Excellence in Bioinformatics & Life Sciences R T U Discovery Seminar /UE 141 CC – Fall 2007 Difficult Problems, Easy Solutions:
VT. From Basic Formal Ontology to Medicine Barry Smith and Anand Kumar.
Pathways and Networks for Realists Barry Smith 1.
Werner Ceusters Language & Computing nv Ontologies for the medical domain: current deficiencies in light of the needs of medical natural language.
1 Ontologie als konkretisierte Darstellung der Wirklichkeit Barry Smith.
BFO/MedO: Basic Formal Ontology and Medical Ontology Draft ( )
L & C Dr. W. Ceusters Language & Computing nv 1 L&C’s LinkBase: a multi-lingual Hub to medical terminologies Dr. W. Ceusters Dir R&D Language.
Knowledge Representation Reading: Chapter
Philosophy and Computer Science: New Perspectives of Collaboration
Ifomis.org 1 Biomedical Ontology in Saarbrücken Barry Smith
L & C Dr. W. Ceusters Language & Computing nv 1 L&C’s LinkBase: a multi-lingual Hub to medical terminologies Dr. W. Ceusters Dir R&D Language.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Ontology Development in the Sciences Some Fundamental Considerations Ontolytics LLC Topics:  Possible uses of ontologies  Ontologies vs. terminologies.
1 CS 456 Software Engineering. 2 Contents 3 Chapter 1: Introduction.
Knowledge representation
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Recent advances in the field of Family Medicine classifications ICPC into WHO-FIC J K Soler Wonca International Classification Committee.
LOGIC AND ONTOLOGY Both logic and ontology are important areas of philosophy covering large, diverse, and active research projects. These two areas overlap.
The ICPS: A taxonomy, a classification, an ontology or an information model? Stefan SCHULZ IMBI, University Medical Center, Freiburg, Germany.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Mining the Biomedical Research Literature Ken Baclawski.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
CS621 : Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 12 RDF, OWL, Minimax.
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
VT. Institute for Formal Ontology and Medical Information Science.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
Language, terminology and ontology in a medical context: theory en reality in industrial applications Werner CEUSTERS CTO Language & Computing.
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
1 Standards and Ontology Barry Smith
Knowledge Representation Part I Ontology Jan Pettersen Nytun Knowledge Representation Part I, JPN, UiA1.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
The UMLS and the Semantic Web
Philosophy and Computer Science: New Perspectives of Collaboration
Knowledge Representation Part I Ontology
Achieving Semantic Interoperability of Cancer Registries
ece 627 intelligent web: ontology and beyond
Medical Natural Language Understanding now and tomorrow
Ontology From Wikipedia, the free encyclopedia
Research Methods in Nursing
Knowledge Representation
 DATAABSTRACTION  INSTANCES& SCHEMAS  DATA MODELS.
Conceptual Frameworks, Models, and Theories
Ontology in 15 Minutes Barry Smith.
Methontology: From Ontological art to Ontological Engineering
Introduction to Applied and Theoretical Ontology Barry Smith
Ontological analysis of the semantic types
Database Systems Instructor Name: Lecture-3.
Knowledge Representation (Part I)
Stefan SCHULZ IMBI, University Medical Center, Freiburg, Germany
Ontology in 15 Minutes Barry Smith.
The Foundational Model of Anatomy
Presentation transcript:

1 LinkSuite™: formally robust ontology-based data and information integration Werner Ceusters a, Barry Smith b, James Matthew Fielding b a Language & Computing nv (L&C) b Institute for Formal Ontology and Medical Information Science

2

3 The problem A (simple?) question... –What genes are involved in juvenile diabetes ?... may lead to many more questions: –Where is the answer to be found ? knowledge sources: text books, scientific papers,... information sources: physician reports, medical records,... data sources: clinical laboratory databases,... –Is there a known correct answer ? –How should the question be phrased for machine processing ? –...

4 Partial solutions are availableSame question – different answers

5 How to solve this ? By developing a framework for data-, information- and ontology-integration –across all levels of generalisation –including information in both structured and unstructured forms. what requires three tasks to be dealt with properly: 1.identifying the basic ontological foundations of a framework expressive enough to describe life science data at all levels; 2.carrying out the research in information engineering needed to create technology able to exploit this ontological framework in a way that can support the integration of massively heterogenous structured and semi- structured life science databases; 3.developing the tools for natural language understanding in the domain of the life sciences needed to extract structured data from free text documents. our approach to “ontology” L&C’s LinkSuite

6 “Ontology” N. Guarino, P. Giaretta, "Ontologies and Knowledge Bases: Towards a Terminological Clarification". In Towards Very Large Knowledge Bases: Knowledge Building and Knowledge Sharing, N. Mars (ed.), pp IOS Press, Amsterdam, 1995.

7 From buzz-word to the “O-word” “An ontology is a classification methodology for formalizing a subject's knowledge or belief system in a structured way. Dictionaries and encyclopedias are examples of ontologies.” (X1) “A terminology (or classification) is a kind of ontology by definition and it should preserve (and "understand") the relationships between the 1,000s of terms in it or else it would become a mere dictionary (or at best a thesaurus).” (X2) “Ontologies are Web pages that contain a mystical unifying force that gives differing labels common meaning.” (X3)

8 If, later, you can remember just one thing of this representation, then make sure it is this one: If you use the word “ontology”, ALWAYS be specific about what you understand by it.

9 a for a computer understable representation of some pre-existing domain of REALITY, reflecting the properties of the objects within its domain in such a way that there obtain substantial and systematic correlations between reality and the ontology itself. modified from Barry Smith My understanding of an ontology to be used by software (agents) in a machine, and NOT by humans does not rely on what people know or think, hence no “concepts” instance driven, although it accepts universals that are not instanciated does not “create” or “constrain” reality The T-Box has no meaning without the A-Box

10 Ontological theories = theories between reality and “the ontology” (“ontology” as a representation) –Granular Partition Theory (T Bittner & B. Smith) –Logic of Classes (B. Smith)

11 Theory of granular partitions (B. Smith) Think of it as Alberti’s grid

12 Granular partitions: main principles a partition is the drawing of a (typically complex) fiat boundary over a certain domain a partition typically comes with labels and/or an address system partitions are artefacts of our cognition a partition is transparent (veridical) bona fide objects exist independently of our partitions, fiat objects are determined by partitions different partitions may represent cuts through the same reality which are skew to each other entities (existing in reality) located in the same cell of a partition share common characteristics

13 Logic of classes primitive: –entities: particulars versus universals –relation inst such that: all classes are universals; all instances are particulars some universals are not classes, hence have no instances: pet, adult, physician some particulars are not instances; e.g. some mereological sums subsumption defined resorting to instances:

14 Basic Formal Ontology Basic Formal Ontology consists in a series of sub-ontologies (most properly conceived as a series of perspectives on reality), the most important of which are: –SnapBFO, a series of snapshot ontologies (O ti ), indexed by times –SpanBFO a single videoscopic ontology (O v ). Each O ti is an inventory of all entities existing at a time. O v is an inventory (processory) of all processes unfolding through time.

15

16 UMLS Semantic Types EntityEvent Language Organisation Group Attribute Idea or Concept Finding Organism Attribute Intellectual Product Occupation Or Discipline Group Substance Organism Anatomical Structure Manufactured Object Behaviour Daily or Recreational activity Occupational Activity Machine Actiivty Laboratory Procedure Diagnostic Procedure Therapeutic Procedure Individual Behaviour Social Behaviour Health care Activity Research Activity Educational Activity Governmental or Regulatory Activity Injury or Poisoning Natural Phenomenon Or Process Human-caused Phenomenon Or Process Environment Effect of Humans PhysicalObject Conceptual Entity Phenomenon Or Process Activity Biologic Function Physiologic Function Pathologic Function Organ or Tissue Function Organism Function Mental Process Cell Function Molecular Function Genetic Function Disease or Syndrome Mental or Behavioural Dysfunction Neoplastic Process Cell or Molecular Dysfunction Experimental Model of Disease

17 L&C’s LinkSuite Tm

18 Technology overview structuredtext LinKFactory Server MaDBoKS TeSSI indexer Information Extraction System LinKFactory Client

19 LinKBase Formal Domain Ontology Lexicon Grammar Language A Lexicon Grammar Language B Cassandra Linguistic Ontology MEDDRA ICD SNOMED ICPC Others... Proprietary Terminologies

20 Based on formal ontology HAS- PARTIAL- SPATIAL- OVERLAP IS- TOPO- INSIDE- OF IS-GEO- INSIDE- OF IS- INSIDE- CONVEX- HULL-OF IS-PARTLY- IN-CONVEX- HULL-OF IS- OUTSIDE- CONVEX- HULL-OF HAS- DISCONNECTED- REGION HAS- EXTERNAL- CONNECTING- REGION HAS-DISCRETED- REGION HAS- TANG.- SPAT.- PART HAS-NON- TANG.- SPAT.- PART IS- SPAT.- EQUIV.- OF IS- TANG.- SPAT.- PART-OF IS-NON- TANG.- SPAT.- PART-OF HAS- PROPER- SPATIAL -PART IS- PROPER- SPAT.- PART-OF HAS- SPATIAL -PART IS- SPATIAL -PART- OF HAS- OVERLAPPING -REGION HAS- CONNECTING- REGION HAS-SPATIAL- POINT- REFERENCE

21 Linking external ontologies MESH-2001 : “Seizures” MESH-2001 : “Convulsions” Snomed-RT : “Convulsion” Snomed-RT : “Seizure” L&C : ConvulsionL&C : Seizure L&C : Health crisis L&C : Epileptic convulsion IS-A IS-narrower-than ISA Has-CCC

22 Managing different views External ontology Internal ontology Criteria Mappings Definitions Terms

23 Ontological theory inside LinKBase if you know that a real-world entity satisfies the Full Definition of a domain-entity- type, then you may infer that that object is an instance of that type. if a real-world entity is an instance of a domain-entity, all that is said about the domain- entity applies to the instance; the statement “A-Link-B” says something about all instances of A, but nothing about instances of B unless the Link is declared to have an inverse;

24 Ontology based parsing ONTOLOGY Patient Is-possessor-of Cancer patient IS-A Has-Healthcare- phenomenon 2 2 IS-A 3 3 Having a healthcare phenomenon Healthcare phenomenon IS-A Has- possessor Has- possessed Malignant neoplasm IS-A Mr. Smith has a pulmonary carcinoma Generalised Possession Human lung carcinoma 1. Parsing 2. Relating 3. Inferring Mr. Smith has a pulmonary carcinoma

25 L&C Parser output

26 Information Extraction

27 Semantic indexing

28 Conclusions There is a huge need for life science data integration technology able to deal with both structured and unstructured data formats. To keep the data manageable, the technology should be able to understand the data. The proper sort of ontology is a means to accomplish this. Based on several POCs, L&C’s LinKSuite can be claimed to be a successful attempt to exploit these insights. But humble as we are, we understand that it is still far from where it should be.