Enabling Systems Genetics to Translational Medicine: The PATO approach George Gkoutos Department of Genetics University of Cambridge.

Slides:



Advertisements
Similar presentations
Discovering Disease Associations using a Biomedical Semantic Web: Integration and Ranking One of the principal goals of biomedical research is to elucidate.
Advertisements

Biological pathway and systems analysis An introduction.
Species-Neutral vs. Multi-Species Ontologies Barry Smith.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Wrapup. NHGRI strategic plan What does the NIH think genomics should be for the next 10 years? [Nature, Feb. 2011]
Combating major diseases Gender aspects in research Gender aspects in research have a particular relevance to this theme as risk factors, biological mechanisms,
Bridging Multiple Ontologies: Route to Representation of the Liver Immune Response Anna Maria Masci, Jeffrey Roach, Bernard de Bono, Pierre Grenon, Lindsay.
Gene Ontology John Pinney
A Systematic approach to the Large-Scale Analysis of Genotype- Phenotype correlations Paul Fisher Dr. Robert Stevens Prof. Andrew Brass.
Automated tools to help construction of Trait Ontologies Chris Mungall Monarch Initiative Gene.
16 November 2004Biomedical Imaging BMEN Biomedical Imaging of the Future Alvin T. Yeh Department of Biomedical Engineering Texas A&M University.
Iowa State University Animal Science Department Bioinformatics & Computational Biology Program - 01/16/06 1 Overview of Animal Trait Ontology and PATO.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Room for Lunch: Arlington Room Room for Evening Reception: Grand Prairie Room.
What Do Toxicologists Do?
CISC667, F05, Lec24, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) DNA Microarray, 2d gel, MSMS, yeast 2-hybrid.
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
Biological pathway and systems analysis An introduction.
PATO An ontology for phenotypes. The development of PATO is the work of George Gkoutos, supported by the NCBO, working in Cambridge.
Knowledgebase Creation & Systems Biology: A new prospect in discovery informatics S.Shriram, Siri Technologies (Cytogenomics), Bangalore S.Shriram, Siri.
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Computational Biology and Informatics Laboratory Development of an Application Ontology for Beta Cell Genomics Based On the Ontology for Biomedical Investigations.
Representing, Querying and Mining Knowledge about Autism Phenotypes
Chapter 13. The Impact of Genomics on Antimicrobial Drug Discovery and Toxicology CBBL - Young-sik Sohn-
GO and OBO: an introduction. Jane Lomax EMBL-EBI What is the Gene Ontology? What is OBO? OBO-Edit demo & practical What is the Gene Ontology? What is.
Resurrecting SOWG BS, Baltimore, CTS Ontology Workshop April
BIOMARKERS Diagnostics and Prognostics. OMICS Molecular Diagnostics: Promises and Possibilities, p. 12 and 26.
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
Computational biology of cancer cell pathways Modelling of cancer cell function and response to therapy.
Leveraging Ontologies for Human Immunology Research Barry Smith, Alexander Diehl, Anna- Maria Masci Presented at Leveraging Standards and Ontologies to.
Abstract Background: In this work, a candidate gene prioritization method is described, and based on protein-protein interaction network (PPIN) analysis.
Agent-based methods for translational cancer multilevel modelling Sylvia Nagl PhD Cancer Systems Science & Biomedical Informatics UCL Cancer Institute.
Ontology of Disease and the OBO Foundry Chris Mungall NCBO GO Nov 2006.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Linking Animal Models and Human Diseases Supported by NIH P41 HG002659, U54 HG004028, & R01 HG Cambridge University & the University of Oregon.
Systems Biology ___ Toward System-level Understanding of Biological Systems Hou-Haifeng.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
1- Introduction of Pathology
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
2 3 where in the body ? where in the cell ?
By: Amira Djebbari and John Quackenbush BMC Systems Biology 2008, 2: 57 Presented by: Garron Wright April 20, 2009 CSCE 582.
1 Image-Based Biomedical Big Data Analytics Jens Rittscher Department of Engineering Science, Nuffield Department of Medicine, University of Oxford.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
Need for common standard upper ontology
Introduction to Biomedical Ontology for Imaging Informatics Barry Smith, PhD, FACMI University at Buffalo May 11, 2015.
Shankar Subramaniam University of California at San Diego Data to Biology.
Phenotype And Trait Ontology (PATO) and plant phenotypes
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Inflammatory Bowel Diseases November 19, 2007 NCDD Meeting Chair: Daniel K. Podolsky, MD Vice Chair: Eugene B. Chang, MD.
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
Effect of Alcohol on Brain Development NormalFetal Alcohol Syndrome.
1 Survey of Biodata Analysis from a Data Mining Perspective Peter Bajcsy Jiawei Han Lei Liu Jiong Yang.
Of 24 lecture 11: ontology – mediation, merging & aligning.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
Linking Animal Models and Human Diseases
TDM in the Life Sciences Application to Drug Repositioning *
Towards a unified MOD resource: An Overview
TITIN ANDRI WIHASTUTI SCHOOL OF NURSING FACULTY OF MEDICINE
Diagnostics Scientific Committee
The Teleost Anatomy Ontology: computable evolutionary morphology for teleost fishes Wasila Dahdul University of South Dakota & National Evolutionary Synthesis.
Development of the Amphibian Anatomical Ontology
Functional Annotation of the Horse Genome
Lixia Yao, James A. Evans, Andrey Rzhetsky  Trends in Biotechnology 
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
PREDICT.
DICOM – A Preclinical Perspective
LiSyM- Pillar II Chronic liver disease progression
Presentation transcript:

Enabling Systems Genetics to Translational Medicine: The PATO approach George Gkoutos Department of Genetics University of Cambridge

Exploring the Phenome Key EU/NIH missions: – integration and analysis of disease data within and across species  diagnostic and therapeutic advances at the clinical level – identification of causative genes for Mendelian orphan diseases

Power of the Phenotype The meaningful cross species translation of phenotype is essential  phenotype-driven gene function discovery and comparative pathobiology Goal - “A platform for facilitating mutual understanding and interoperability of phenotype information across species and domains of knowledge amongst people and machines” …..

Phenotype And Trait Ontology (PATO) phenotypes may be described in many different dimensions, e.g. – the biochemical ('alcohol dehydrogenase null') – the cellular ('cell division arrested at metaphase’) – the anatomical ('eye absent') – the behavioral (‘hyperactive’) – etc. in whatever dimension and granularity, however, there is a commonality so that phenotypic descriptions can be decomposed into two parts – An entity that is affected. This entity may be an enzyme, an anatomical structure or a complex biological process. – The qualities of that entity.

Type and Sources Type of data Behaviour and cognition, Clinical chemistry and haematology, Hormonal and Metabolic Systems, Cardiovascular, Allergy and Infectious diseases, Sensory Systems, Central/Peripheral Nervous and Skeletal Muscle Systems, Cancer Phenotyping, Bone, Cartilage, Arthritis, Osteoporosis, Necropsy Exam, Pathology, Histology, etc. etc. etc. Source of phenotype information – Literature – Experimental data – Various representation methodologies – Complex phenotype data

PATO today PATO is now being used as a community standard for phenotype description – many consortia (e.g. Phenoscape, The Virtual Human Physiology project (VPH), IMPC, BIRN, NIF) – most of the major model organism databases, (e.g. example Flybase, Dictybase, Wormbase, Zfin, Mouse genome database (MGD)) – international projects

PATO’s Semantic Framework Conceptual Layer Semantic Components Layer Unification Layer Formalisation Layer Integration Layer

PATO’s Conceptual Layer PATO Species Independent Core Ontologies (e.g. anatomy, biological process, chemistry) EQ Phenotype Description Entity (E)Quality (Q) PATO Species Independent EQ Phenotype Description

Mouse Body weight PATO Species Independent Mouse Anatomy (MA) EQ Phenotype Description Body(E)Weight(Q) PATO Species Independent EQ mouse body weight

Semantic Components Layer Behavior – NeuroBehavior Ontology – Behavioral Phenotype Ontology Pathology Physiology – Cerebellar ataxia Create links to behavioral observation to physiology manifestations Cell Phenotype Quantitative measurement (Units Ontology)

PATO’s Unification Layer Following the GO paradigm, several examples of attempts to formalize species specific phenotype description have been adopted: e.g. Mammalian Phenotype Ontology (MP), Plant & Trait Ontology, Human Phenotype Ontology (HPO), etc. Advantages – Easy for annotation – Control – Complex phenotypic information Disadvantages – lack of rigidity e.g. quantitative data – ontology management e.g. expansion – incapable of bridging different phenotype descriptions (for either the same or separate species)

HELLP syndrome Liver failure Hepatic failure Pregnancy related premature death Glomerular vascular disorder Abnormal glomeruli Hypertension Thrombocytopenia Renal Failure Renal failure Hepatic necrosis Acute and subacute liver necrosis Proteinuria Haemolytic anaemia Anaemia haemolytic HPOMP

PATO-based definitions Aristotelian definitions (genus-differentia) A *which* inheres_in an [Term] id: MP: name: decreased body weight namespace: mammalian_phenotype_xp Synonym: low body weight Synonym: reduced body weight def: " lower than normal average weight “[] is_a: MP: ! abnormal body weight intersection_of: PATO: ! decreased weight intersection_of: inheres_in MA: ! adult mouse

HELLP syndrome Liver failure Hepatic failure Pregnancy related premature death Glomerular vascular disorder Abnormal glomeruli Hypertension Thrombocytopenia Renal Failure Renal failure Hepatic necrosis Acute and subacute liver necrosis Proteinuria Haemolytic anaemia Anaemia haemolytic HPOMP

HELLP syndrome Liver failure Hepatic failure Pregnancy related premature death Glomerular vascular disorder Abnormal glomeruli Hypertension E: Blood (MA) Q: Increased pressure (PATO) Hypertension E: Blood (FMA) Q: Increased pressure (PATO) Thrombocytopenia E: Platelet(CL) Q: Decreased number (PATO ) Thrombocytopenia E: Platelet (CL) Q: Decreased number (PATO) Renal Failure E: Renal system process (GO) Q: disfunctional (PATO) Renal failure E: Renal system process (GO) Q: disfunctional (PATO) Hepatic necrosis E: Liver (MA) Q: Necrosis (MPATH) Acute and subacute liver necrosis E: Liver (FMA) Q:Necrotic (PATO) Proteinuria Haemolytic anaemia Anaemia haemolytic E: Hepatocobiliary system process (GO) Q: disfunctional (PATO) E: Hepatocobiliary system process (GO) Q: disfunctional (PATO) E: Glomerulus (MA) Q: abnormal ( PATO) E: Glomerulus (FMA) Q: abnormal ( PATO) E: Urine(MA) Q: Increased concentration E2:Protein( CheBI) E: Urine(FMA) Q: Increased concentration E2:Protein( CheBI)

Progress to date

Comparative Phenomics

PATO Conceptual Layer PATO ProcessObject EQ EQ Model link Entities (E) from GO, CheBI, FMA etc. to Qualities (Q) from PATO  EQ statements

PATO NBOMPATHGOCHEBIUBERONCLUO Semantic Components Layer EQ Behavior Pathology Physiology UBERON Cell Phenotype Measurements (Units Ontology)

EQCPOYPOTOHPOFBCVMPWBPhenotype Unification Layer PATO NBOMPATHGOCHEBIUBERONCLUO Provision of PATO based equivalence definitions

PATO NBOMPATHGOCHEBIUBERONCLUO EQCPOYPOTOHPOFBCVMPWBPhenotype UBERON- Integrating Species- Centric Anatomies

Formalisation Layer PATO NBOMPATHGOCHEBIUBERONCLUO EQCPOYPOTOHPOFBCVMPWBPhenotype transform OWL ontologies into OWL EL  enable tractable reasoning

PATO NBOMPATHGOCHEBIUBERONCLUO EQCPOYPOTOHPOFBCVMPWBPhenotype Integration Layer

Cross Species Data Integration

Cross species integration framework A PATO-based cross species phenotype network based on experimental phenotype data for 5 model organisms yeast, fly, worm, fish, mouse and human integration of anatomy and phenotype ontologies – exploit through OWL reasoning – more than 500,000 classes and 1,500,000 axioms PhenomeNET forms a network with more than complex phenotype nodes representing complex phenotypes

PhenomeNet quantitative evaluation based on predicting orthology, pathway, disease Receiver Operating Characteristic (ROC) Curve analysis Area Under Curve (AUC) = 0.7

E1: Aorta(FMA) Q: overlap with (PATO) E2: Membranous part of the interventricular septum (FMA) Candidate disease gene prioritization Predict all known human and mouse disease genes Adam19 and Fgf15 mouse genes using zebrafish phenotypes - mammalian homologues of Cx36.7 and Nkx2.5 are involved in TOF

Enhance the network e.g. – Semantics e.g Behavior and pathology related phenotypes etc. – Methods e.g. text mining, machine learning etc. PhenomeNET now significantly outperforms previous phenotype- based approaches of predicting gene–disease associations Performance matches gene prioritization methods based on prior information about molecular causes of a disease AUC = 0.9

IRDiRC

dbGAP dbSNP ClinVar

Translational Research

The power of phenotype Candidate disease gene prioritization Copy number variations Rare and orphan diseases Functional validation of human variation studies (e.g GWAS) identification of pathogenicity of human mutations new therapeutic strategies

Novel drug discovery and repurposing

Phenotype-based drug discovery and repurposing Variety of methods successfully being applied for drug repositioning and the suggestions of potentially novel drugs Can a phenotype of gene which the drug interacts be used to predict diseases in which the drug is active?

Results AUC = 0.65 PharmGKB 0.63 FDA 0.69 CTD

Future work integrated system for the analysis and prediction of drug– disease associations with emphasis on orphan diseases include other drug resources such DrugBank and CTD combine them with other methods such as: – drug response – gene expression profiles – drug–drug similarity – drug–disease similarity – text mining of known associations employ other computational approaches (machine learning approach, statistical testing, semantic similarity)

Mathematical Modelling

Model-based investigation of optimal cancer chemotherapy mathematical modelling of cancer progression and optimal cancer chemotherapy cancer dynamics, pharmacokinetic and drug-related toxicity models  study the effect of widely used anti-cancer agents irinotecan (CPT-11) and 5-fluorouracil (5-FU) include drug related side-effects categorised in terms of undesirability of the side-effect as well as the frequency of appearance

models replicate animal data successfully optimal administration:  5-FU  CPT-11 future directions – experimental validation – specific cancer characteristics, drug resistance, metastasis and cell-cycle b) Optimal control a) Model predictions alongiside experimental data

RICORDO - Towards Physiology knowledge representation Virtual Physiology Human (VPH) - “A major challenge for the future how is to integrate physiology knowledge into robust and fully reliable computer models and "in silico" environments” The RICORDO approach ( – ontology based framework for the description of VPH models and data – connect distributed repositories with software tools – standardization of the minimal information content Goal - qualitative representation of physiology

Translational Medicine Diagnostic Strategies Systems Genetics Therapeutic strategies Human Variation Disease Gene Prioritsation CNVs Orphan Diseases Functional Analysis etc Animal Models Systematic genome-wide phenotypingForward GeneticsReverse GeneticsModel Organism DatabasesLiteratureExperimental Data Translational Medicine Personalised Medicine translation Cross Species Integration