Data and ontology integration issues in the biosciences Marijke Keet Napier University, 10 Colinton Road, Edinburgh EH10 5DT /

Slides:



Advertisements
Similar presentations
Chapter 7 System Models.
Advertisements

OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
Ontological Resources and Top-Level Ontologies Nicola Guarino LADSEB-CNR, Padova, Italy
Database Design: ER Modelling (Continued)
E-Science AHM, Nottingham. September 20 th, Glen Dobson: An OWL Ontology for QoS Glen Dobson (Russell Lock, Ian Sommerville)
1/ 26 AGROVOC and the OWL Web Ontology Language: the Agriculture Ontology Service - Concept Server OWL model NKOS workshop Alicante,
DELOS WP5 Workshop: Semantic Interoperability in DL systems, 17 th September 2004, Bath, UK Semantic Interoperability in Digital Library Systems Task 3:
KR-2002 Panel/Debate Are Upper-Level Ontologies worth the effort? Chris Welty, IBM Research.
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
ARCHITECTURES FOR ARTIFICIAL INTELLIGENCE SYSTEMS
Music Encoding Initiative (MEI) DTD and the OCVE
An Overview of Ontologies and their Practical Applications Gianluca Correndo
1 CIS224 Software Projects: Software Engineering and Research Methods Lecture 11 Brief introduction to the UML Specification (Based on UML Superstructure.
IPY and Semantics Siri Jodha S. Khalsa Paul Cooper Peter Pulsifer Paul Overduin Eugeny Vyazilov Heather lane.
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
Ontology From Wikipedia, the free encyclopedia In philosophy, ontology (from the Greek oν, genitive oντος: of being (part. of εiναι: to be) and –λογία:
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 8 Slide 1 System models.
The Semantic Web Week 13 Module Website: Lecture: Knowledge Acquisition / Engineering Practical: Getting to know.
7M701 1 Software Engineering Systems Models Sommerville, Ian (2001) Software Engineering, 6 th edition: Chapter 7 (some items)
How can Computer Science contribute to Research Publishing?
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
Biological Ontologies Neocles Leontis April 20, 2005.
Systems Engineering Foundations of Software Systems Integration Peter Denno, Allison Barnard Feeney Manufacturing Engineering Laboratory National Institute.
Foundations This chapter lays down the fundamental ideas and choices on which our approach is based. First, it identifies the needs of architects in the.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
10 December, 2013 Katrin Heinze, Bundesbank CEN/WS XBRL CWA1: DPM Meta model CWA1Page 1.
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 7 Slide 1 System models l Abstract descriptions of systems whose requirements are being.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
FRE 2672 Urban Ontologies : the Towntology prototype towards case studies Chantal BERDIER (EDU), Catherine ROUSSEY (LIRIS)
SC32 WG2 Metadata Standards Tutorial Metadata Registries and Big Data WG2 N1945 June 9, 2014 Beijing, China.
Ontology Development in the Sciences Some Fundamental Considerations Ontolytics LLC Topics:  Possible uses of ontologies  Ontologies vs. terminologies.
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 7 Slide 1 System models l Abstract descriptions of systems whose requirements are being.
Chapter 4 System Models A description of the various models that can be used to specify software systems.
Ontologies and Classifications
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
Ontology-Driven Information Retrieval Nicola Guarino Laboratory for Applied Ontology Institute for Cognitive Sciences and Technology (ISTC-CNR) Trento-Roma,
Categories of Vocabulary Compatibility Dmitry Lenkov Oracle.
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
Chapter 7 System models.
System models l Abstract descriptions of systems whose requirements are being analysed.
LOGIC AND ONTOLOGY Both logic and ontology are important areas of philosophy covering large, diverse, and active research projects. These two areas overlap.
CS3773 Software Engineering Lecture 04 UML Class Diagram.
Software Engineering, 8th edition Chapter 8 1 Courtesy: ©Ian Somerville 2006 April 06 th, 2009 Lecture # 13 System models.
Sommerville 2004,Mejia-Alvarez 2009Software Engineering, 7th edition. Chapter 8 Slide 1 System models.
Taken from Schulze-Kremer Steffen Ontologies - What, why and how? Cartic Ramakrishnan LSDIS lab University of Georgia.
Semantic Data & Ontologies CMPT 455/826 - Week 5, Day 2 Sept-Dec 2009 – w5d21.
Andreas Abecker Knowledge Management Research Group From Hypermedia Information Retrieval to Knowledge Management in Enterprises Andreas Abecker, Michael.
Christoph F. Eick University of Houston Organization 1. What are Ontologies? 2. What are they good for? 3. Ontologies and.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
SOCoP 2013 Workshop: Vision and Strategy Gary Berg-Cross SOCoP Executive Secretary Nov NSF Stafford II facility Wilson Blvd, Ballston VA.
Oreste Signore- Quality/1 Amman, December 2006 Standards for quality of cultural websites Ministerial NEtwoRk for Valorising Activities in digitisation.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 8 Slide 1 System models.
Artificial Intelligence: Research and Collaborative Possibilities a presentation by: Dr. Ernest L. McDuffie, Assistant Professor Department of Computer.
Of 24 lecture 11: ontology – mediation, merging & aligning.
The Agricultural Ontology Server (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Food and Agriculture Organization.
Ontologies COMP6028 Semantic Web Technologies Dr Nicholas Gibbins
Knowledge Representation Part I Ontology Jan Pettersen Nytun Knowledge Representation Part I, JPN, UiA1.
COMP6215 Semantic Web Technologies
ece 627 intelligent web: ontology and beyond
Ontology From Wikipedia, the free encyclopedia
Data Models.
Abstract descriptions of systems whose requirements are being analysed
Semantic Web - Ontologies
Methontology: From Ontological art to Ontological Engineering
2. An overview of SDMX (What is SDMX? Part I)
Ontology-Based Approaches to Data Integration
Presentation transcript:

Data and ontology integration issues in the biosciences Marijke Keet Napier University, 10 Colinton Road, Edinburgh EH10 5DT / Presentation at the Micro-Array Department, University of Amsterdam

Overview presentation Data integration ontology Ontologies kinds, formalisation, bias & bioscience [after the break] Ontology integration categorisation, some challenges

Overview presentation Data integration ontology Ontologies kinds, formalisation, bias & bioscience [after the break] Ontology integration categorisation, some challenges

Data heterogeneity Schematic data type, labelling, aggregation, generalisation Semantic naming, scaling and units, confounding Intensional domain, integrity constraints Based on Goh (1996)

Integrating data e.g. DB 1 has attribute name colour and value green and DB 2 with color and 2DE60E Data is different, but the conceptualisation is the same. Capture this agreement in an ontology. Shorthand: specification of a shared conceptualisation (Gruber), but better: An ontology is a logical theory accounting for the intended meaning of a formal vocabulary, i.e. its ontological commitment to a particular conceptualisation of the world. The intended models of a logical language using such a vocabulary are constrained by its ontological commitment. An ontology indirectly reflects this commitment (and the underlying conceptualisation) by approximating these intended models. (Guarino, 1998).

Overview presentation Data integration ontology Ontologies kinds, formalisation, bias & bioscience [after the break] Ontology integration categorisation, some challenges

Kinds of ontologies Representation ontologies: conceptualisations that underlie knowledge representation formalisms. Top-level ontologies: generic and intermediate ontology concepts. This can be on top of a domain ontology or as stand-alone effort; main aspect is domain independence. Generic ontologies consist of the general, foundational aspects of a conceptualisation (a lower branch in a top-level) Intermediate ontologies are slightly more tailored towards a conceptualisation of a specific domain. There may not be references to generic ontologies. Domain ontologies specialize in a subset of generic ontologies in a domain or sub-domain. Application ontologies (…): the UoD is even narrower than a domain ontology.

Levels of formalisation (1-2) Heavyweight ontologies Lightweight ontologies Catalogue of normalised terms : is a simple list without inclusion order, axioms or glosses. Glossed catalogue : a catalogue with natural language glossary entries, e.g. a dictionary of medicine. Prototype-based ontology : types and subtype are distinguished by prototypes rather than definitions and axioms in a formal language Taxonomy : is a collection of concepts having a partial order induced by inclusion. Axiomatised taxonomy : as taxonomy, but then with axioms and stated in a formal language. Context library / axiomatised ontology : a set of axiomatised taxonomies with relations among them, like the inclusion of one context into another one, or the use of a concept from one in the other one. Formal ontology Informal ontology Semi-formal ontology?

Formalisations (2-2) Ontological precision Axiomatized theory Glossary Thesaurus Taxonomy DB/OO scheme tennis football game field game court game athletic game outdoor game Catalog game athletic game court game tennis outdoor game field game football game NT athletic game NT court game RT court NT tennis RT double fault game(x) activity(x) athletic game(x) game(x) court game(x) athletic game(x) y. played_in(x,y) court(y) tennis(x) court game(x) double fault(x) fault(x) y. part_of(x,y) tennis(y) precision: the ability to catch all and only the intended meaning (for a logical theory, to be satisfied by intended models) Gangemi (2004)

Overview presentation Data integration ontology Ontologies kinds, formalisation, bias & bioscience [after the break] Ontology integration categorisation, some challenges

Ontology integration (1-4) Combining different conceptualisations (views on reality)… somehow. System, language/syntax, structure, and semantic integration. Latter most difficult. Structure and/versus semantic integration example example Anarchy of terminology, definitions and methodologies (now at least 24 terms and 48 definitions & methodologies) Organise into levels of integration. Develop taxonomy of ontology integration?levels

Ontology integration (2-4) back Example structure/semantics

Ontology integration (3-4) Use in/for applications Increase in level of integration Unification, total compatibility, merging [similar subject domains] Merging [different subject domains], partial compatibility Mapping, approximations, helper model, alignment, intersection ontology Queried ontologies, hybrid ontologies Extending, incremental loading Increase in (perceived) difficulty of operation Initial categorisation

Ontology integration (4-4) (In)formal ontologies (In)consistencies in ontology design decisions during development (relationships) detail detail Top-down versus/combined with bottom-up Using foundational aspects in ontology development decreases the chance of design inconsistencies and facilitates integration Subject domain heterogeneity example example Conflicting goals More conflicts and mismatches here here Some challenges

(In)consistencies in ontology design decisions (1-2) Subsumption versus instantiation: if A isA B, then all instances of A are also instances of B. The latter says a instanceOf A, i.e. a is an individual (particular, instance) and not a subtype of A. Desiderata to create the hierarchy. Like keeping function, structure, process separate. E.g. the OBO phenotype ontology does not: %attribute\:excretory_function ; PATO: %attribute\:urination ; PATO: %attribute\:urine_composition ; PATO:

(In)consistencies in ontology design decisions (2-2) E.g. aseptate hypha isa hypha [aseptate = hypha without cross walls] and hypha in mycelium isa hypha. Former is about a special kind of hypha, the latter takes topology as distinction for subtyping -> are distinct factors though treated as a same kind of isA where in the FAO hypha subsumes both. Allow multiple inheritance - or not? partOf : such as parthood, proper parthood, connection, external connection, tangential parthood, interior parthood, partial coincidence and located-in (see e.g. Smith and Rosse, 2004; Donnelly, 2004) Properties and meta properties (see Guarino and Welty (2000) for details) back

Conflicts and mismatches Factors affecting ontology combination tasks Practical problems: finding matchings, diagnosis repeatability, software usability, social factors of cooperation, goals Mismatches between ontologies - language level syntax, logical representation, semantics of primitives, language expressivity, precision - ontology level - conceptualisation content/UoD, concept scope, relationship scope, context, aggregation, accuracy - explication terminological: hyper-/hyponyms (generalization), homonyms, synonyms modelling style: paradigm, entity/concept description encoding Versioning: identification, traceability, translation

Overview presentation Data integration ontology Ontologies kinds, formalisation, bias & bioscience Ontology integration categorisation, some challenges

Ontologies for bioscience (1-3) Theory (3) New empirical axioms/laws (universal) (4) Facts with an empirical basis (1) Empirical axioms/laws (universal) (2) Formation of a theory Explanation Confirmation Formation of hypothesis Induction, confirmation Prediction Confirmation Prediction

Ontologies for bioscience (2-3) Ontologies as engineering artefacts - Facilitate knowledge reuse, interoperability Modelling practice Another item in the problem-solvers toolbox Part of a new/improved software system - SW tools for ontology development, maintenance, integration Ontologies embedded in science - Top-level ontologies Attempt to understand, what/why - W.r.t. bioscience Co-defining concepts? Part of falsification paradigm and steps 2, 3 of standard view -> synergy, mutually beneficial process, but…

Ontologies for bioscience (3-3) The very essence of scientific progress is change, redefinition and creation of new concepts. -> ontology subject to (extensive) modification. Complicates integration Concepts underspecified, hypotheses and theories exist simultaneously. -> accommodate this in an ontology? E.g. a library of alternative views ontologies, with loose coupling instead of integration? -> capture what is, what can be, (and what might be?) Biological data is more complicated than technological and practice data. -> more here here Systems Thinking, integrative concepts, holism and process-orientation contradict with objectifying knowledge in ontologies -> interdisciplinary work of ontologists with scientists Empiricism and the theoretical methodology in life sciences. -> bottom-up resp. top-down procedures for ontology development

Formalising biological knowledge Challenging biological data characteristics detail detail Are these aspects real challenges, or due to limited expressiveness of non-formal approaches and software modelling paradigms (ER, OO, …), or maybe due to limited knowledge of both the domain expert and ontologist? Applied sciences within bio (medicine, ecology, environmental sciences), contexts detail detail next back

Main biological data characteristics No legacy, no full knowledge of UoD. -> Former might be alleviated over time; double curriculum, but still difference in science and engineering approaches Gradations/non-discrete data, occasional relationships, conditionality. -> Separate layer of sw, or semantics intricate part of bio data? Uncertainties, postulations, importance of parameters, properties. -> characteristic of conducting scientific research; lack vs abundance of data can be argued as design decision, not characteristic of the data/concepts; upgrading of concepts Definitional problems and lack of standardisation in nomenclature. -> Is the surface layer of next point; overabundance of semi- standards; can be in itself interdisciplinary within bioscience Disagreements between and within research groups, alternative hypotheses and theories coexist. -> There is not one what is; development of multiple theories, concepts before agreement is part of doing scientific research; library of models, aliases back

Applied bioscience Emphasis core sciences: All-inclusive comprehensive models Emphasis applied bioscience: Conceptually representing the integration of various core disciplines, Only what is relevant in limited context example back

Example applied bioscience back

References and more info (1-2) Donnelly, M. (2004). On parts and holes: the spatial structure of the human body. MEDINFO 2004, San Francisco, USA. Gangemi, A. (2004). Some design patterns for domain ontology building and analysis. Manchester January. Goh, C.H. (1996). Representing and reasoning about semantic conflicts in heterogeneous information sources. PhD, MIT. Guarino, N. (1998). Formal Ontology and Information Systems. In: Formal Ontology in Information Systems, Proceedings of FIOS'98, Trento, Italy, Amsterdam: IOS Press. Guarino, N. and Welty, C. (2000). A formal ontology of properties. Proceedings of 12th Int. Conf. on Knowledge Engineering and Knowledge Management, Lecture Notes on Computer Science, Springer Verlag. Keet, C.M. (2004). Ontology development and integration for the biosciences. Technical Report, Napier University, Edinburgh, UK. Smith, B. and Rosse, C. (2004). The role of foundational relations in the alignment of biomedical ontologies. Proceedings of MEDINFO, San Francisco, USA.

References and more info (2-2) Some websites with different perspectives/aims/information on ontologies LOA IFOMIS Ontology Formal Ontology RE Kent WonderWeb project JF Sowa SUMO AAAI page Links to a few of groups developing tools KAON Protégé VU STARLab

Thank you!