The Comparative Toxicogenomics Database (CTD):

Slides:



Advertisements
Similar presentations
Environmental Health Tracking Technical Team Meeting 1 Future Assessment and Needs Assessment Advisory Discussion Craig Wolff IT/GIS Manager March 5, 2003.
Advertisements

1 Inference of Regulatory Networks via Systems Genetics Ina Hoeschele.
CTD: a resource for predicting chemical-gene-disease networks Carolyn Mattingly North Carolina State University.
Statistical methods and tools for integrative analysis of perturbation signatures Mario Medvedovic Laboratory for Statistical Genomics and Systems Biology.
Carcinogen Classification Criteria Patricia Richter Ph.D., DABT Tobacco Products Scientific Advisory Committee June 8, 2010.
Human Heredity, the Environment and Disease: Nature and Nurture National Human Genome Research Institute National Institutes of Health U.S. Department.
Pathways & Networks analysis COST Functional Modeling Workshop April, Helsinki.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
SCB : 1 Department of Computer Science Simulation and Complexity SCB : Simulating Complex Biosystems Susan Stepney Department of Computer Science Leo Caves.
What Do Toxicologists Do?
Introduction of Cancer Molecular Epidemiology Zuo-Feng Zhang, MD, PhD University of California Los Angeles.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Approaches to Assessing Unintended Health Effects of Genetically Engineered Foods.
Modeling Functional Genomics Datasets CVM Lessons 4&5 10 July 2007Bindu Nanduri.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Carolyn J. Mattingly The Mount Desert Island Biological Laboratory Salisbury Cove, Maine The Comparative Toxicogenomics Database (CTD): Predicting mechanisms.
Knowledge Integration for Gene Target Selection Graciela Gonzalez, PhD Juan C. Uribe Contact:
Office of Research and Development National Center for Computational Toxicology May 18, 2011 RASS December 11, 2013 Elaine Cohen Hubal Chemical Safety.
Knowledgebase Creation & Systems Biology: A new prospect in discovery informatics S.Shriram, Siri Technologies (Cytogenomics), Bangalore S.Shriram, Siri.
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Identification of network motifs in lung disease Cecily Swinburne Mentor: Carol J. Bult Ph.D. Summer 2007.
Bioinformatics and it’s methods Prepared by: Petro Rogutskyi
Big Data Network Genomics Network Inference and Perturbation to Study Chemical-Mediated Cancer Induction Stefano Monti Section of Computational.
Beyond the Human Genome Project Future goals and projects based on findings from the HGP.
National Institutes of Health National Institute of General Medical Sciences FINDINGS Chemical World Toxicologist Serrine Lau: Investigating Molecular.
Bioinformatics and medicine: Are we meeting the challenge?
Networks and Interactions Boo Virk v1.0.
Biomedical Research.
Expanding Biomedical Research in Maine Mount Desert Island Biological Laboratory Patricia H. Hand, Ph.D. Administrative Director.
NIH Council of Councils Meeting November 21, 2008 LINCS Library of Integrated Network-based Cellular Signatures.
Agent-based methods for translational cancer multilevel modelling Sylvia Nagl PhD Cancer Systems Science & Biomedical Informatics UCL Cancer Institute.
Inform Development of Regenerative Therapies Identify regulatory units for regeneration Identify genes inactivated in animal systems that cannot regenerate.
Bioinformatics lectures at Rice University Li Zhang Lecture 9: Networks and integrative genomic analysis
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Introduction to DNA Microarrays: Functional Mining of Array Patterns Michael F. Miles, M.D., Ph.D. Depts. of Pharmacology/Toxicology and Neurology and.
“My Experiment” and What I Want to Discover My experiment involved comparing the effect of long term ozone exposure on gene expression in a wild type and.
COMPUTATIONAL ANALYSIS OF MULTILEVEL OMICS DATA FOR THE ELUCIDATION OF MOLECULAR MECHANISMS OF CANCER Presented by Azeez Ayomide Fatai Supervisor: Junaid.
Biological Networks & Systems Anne R. Haake Rhys Price Jones.
TOXICOGENOMICS.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
Ferran Sanz – GRIB (IMIM-UPF) Bioinformatics: How it can support the Family of International Classifications? Ferran Sanz Research Programme on Biomedical.
Clément Frainay PhD Student Supervisors : Dr Daniel Zalko, Dr Fabien Jourdan INRA Toxalim (French National Institute of Agronomic Research)
Supplemental Figure 2. IPA Gene Network Analysis. The top gene networks for DEGS more highly expressed (Panels A- B) and less highly expressed (Panels.
McKim Conference on Predictive Toxicology The Inn of Lake Superior Duluth, Minnesota September 16-18, 2008 Toxicity Pathways as an Organizing Concept Gilman.
Copyright OpenHelix. No use or reproduction without express written consent1.
High throughput biology data management and data intensive computing drivers George Michaels.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
1 Survey of Biodata Analysis from a Data Mining Perspective Peter Bajcsy Jiawei Han Lei Liu Jiong Yang.
General Concepts in QSAR for Using the QSAR Application Toolbox
BDPGx - A Big Data Platform for Graph-based Pharmacogenomics Data
Networks and Interactions
FIFRA SAP Meeting February 2, 2010
Microarray Experiment Design and Data Interpretation
STRING Large-scale data and text mining
James M. Fadool Biological Science
Cancer, reproductive system diseases, lipid metabolism 48 22
Susan Makris U.S. EPA, Office of Research and Development
Dept of Biomedical Informatics University of Pittsburgh
Biomedical Research.
Ingenuity Knowledge Base
By Michael Fraczek and Caden Boyer
Department of Chemical Engineering
Lecture 7: Biological Network Crosstalk Y. Z
The Impact of Network Medicine in Gastroenterology and Hepatology
Standards Development for Metabolomics
The Genetic Basis for Cancer Treatment Decisions
Microbes, Microbiota, and Colon Cancer
Anoor Paripati, MBBS, Chris Kingsley, PhD, Glen J. Weiss, MD 
Rey-Long Liu Dept. of Medical Informatics Tzu Chi University Taiwan
Part I. Introduction and Genetic Engineering
Presentation transcript:

The Comparative Toxicogenomics Database (CTD): Predicting mechanisms of toxicity Carolyn J. Mattingly The Mount Desert Island Biological Laboratory Salisbury Cove, Maine

Chemicals in commerce > 80,000 ~2,000 added/year ~8,000 are carcinogens No toxicity data for ~40% of the 3,300 “high production volume” chemicals Full toxicity data for only 25% of chemicals in consumer products

Time, 1947

What’s the relationship between chemicals and disease? Genes/Proteins Cell Death/ Differentiation DNA Repair Cell Cycle Control Chemical Distribution/ Metabolism Disease DISEASE

Exploring environment-gene-disease connections What diseases are associated with Bisphenol A (BPA)? Which BPA-induced genes function during development? What biological functions are affected by BPA? Which molecular pathways are affected by exposure to BPA? Which other chemicals have interaction profiles similar to BPA? What are target genes that are common to BPA and arsenic?

Curated Data MeSH® (modified) CTD interactions chemical-gene Chemicals MeSH® (modified) CTD interactions chemical-gene interactions chemical-disease relationships Genes Diseases gene-disease relationships Entrez-Gene MeSH/OMIM

Prioritizing Curation EPA Superfund EPA ToxCast NTP Users/Collaborators

Curated Data chemical-gene interactions chemical-disease relationships Chemicals chemical-gene interactions chemical-disease relationships 199,453 9,524 6,143 Genes Diseases gene-disease relationships

Integrated Data chemical-gene interactions chemical-disease Chemicals chemical-gene interactions chemical-disease relationships Genes Diseases gene-disease relationships

Creating inferences chemical-gene interactions chemical-disease Chemicals chemical-gene interactions chemical-disease relationships Genes Diseases gene-disease relationships

Creating inferences chemical-gene interactions chemical-disease Chemicals chemical-gene interactions chemical-disease relationships Genes Diseases gene-disease relationships

Inferred chemical-disease relationships BPA AGR2… Prostatic Neoplasms Inferred chemical-disease relationships

Cancer and urologic diseases BPA-Prostate cancer genes Cancer and urologic diseases Generated using Ingenuity Pathway Analysis

Chemical-disease inferences ~190,000 transitive inferences between chemicals and diseases Transitive Inference If ‘A’ interacts with ‘B’ and ‘C’ interacts with ‘B’, then infer that ‘A’ interacts with ‘C’ How to assess which inferences are “good” or not? A B C

Bisphenol A and Lung Neoplasms Geometric Cvw = |N(v) N(w)|2 |N(v)| . |N(w)| 457 other genes or diseases BPA g1 g2 22 Genes g3 139 other Chemicals or genes … Lung Neoplasms g22

Geometric Cvw for “Real” C-D Inferences Geometric Cvw for shuffled C-D Inferences

Inferred chemical-pathway relationships BPA IKBKB… AML Inferred chemical-pathway relationships

Tools

Tools

Tools

Tools: VennViewer 127 1357 118 4 76 21 Interacting Genes/Proteins Folic acid Arsenicals Pathways 4 76 21 Folic acid Arsenicals

Tools

MDIBL: Effects of arsenic on immune function 64 1689 20 Array data CTD data 0 10 100

MDIBL: Effects of arsenic on immune function 64 1689 20 Array data CTD data 0 10 100 Mattingly, C. J., T. Hampton, K. Brothers, N. E. Griffin and A. J. Planchart (2009). Perturbation of defense pathways by low-dose arsenic exposure in zebrafish embryos. Environ Health Perspect doi:10.1289/ehp.0900555.

NIEHS: Identifying chemical-gene-disease networks Gohlke, J., R. Thomas, Y. Zhang, M. D. Rosenstein, A. P. Davis, C. Murphy, C. J. Mattingly, K. G. Becker and C. J. Portier (2009). The Genetic And Environmental Pathways to Complex Diseases. BMC Syst Biol.May 5 3:46.

EPA: Exploring the environmental etiology of autistic disorders Characterizing these chemicals Structure Regulatory features (e.g., High production, Carcinogen) Function (e.g., Associated pathways) Other associated diseases (e.g., Neurological) 2096 Chemicals 213 Genes 213 Genes Autism Mark Coralles, EPA

In Progress Tag Clouds Text mining Statistical analysis of data inferences Gene Ontology enrichment analysis

Coming Up Analysis tools and visualization capabilities Integration of additional data sets (SNPs, Chemical codes) Exposure data curation

Curating exposure data Develop exposure ontology Define scope of data to be curated Test curation protocol Curate and integrate data in CTD Exposure data Diseases Genes Chemicals chemical-disease relationships chemical-gene interactions gene-disease

Acknowledgements Allan Peter Davis, PhD Cindy Murphy, PhD Scientific Curators Allan Peter Davis, PhD Cindy Murphy, PhD Cynthia Saraceni-Richards, PhD Susan Mockus, PhD Scientific Software Engineers Michael C Rosenstein, JD Thomas Wiegers System Administrator Roy McMorran James L. Boyer, MD (Yale) http://ctd.mdibl.org/ Zebrafish work Antonio Planchart, PhD Thomas Hampton (Dartmouth) Funding NIEHS AND NLM (ES014065) NCRR (RR016463) Contact Us! cmattin@mdibl.org ctd@mdibl.org