Software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel.

Slides:



Advertisements
Similar presentations
Chemical named entity recognition and literature mark-up Colin Batchelor Informatics Department Royal Society of Chemistry
Advertisements

EcoTherm Plus WGB-K 20 E 4,5 – 20 kW.
AP STUDY SESSION 2.
Semantic Integration of Social and Domain Knowledge in a Collaborative Network Platform Luís Carneiro Supervisor: Professor António Lucas Soares
1 End-User Programming to Support Classroom Activities on Small Devices Craig Prince University of Washington VL/HCC 2008.
Chapter 7 System Models.
OvidSP Flexible. Innovative. Precise. Introducing OvidSP Resources.
Objectives: Generate and describe sequences. Vocabulary:
OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
The CODS Protégé Server. Goals 3 Collaborative Ontology Development Approaches Browse with limited Edit Version Control (analogous to cvs, svn) But should.
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
BioPortal Status and Plans September 2011 Ray Fergerson NCBO Project Director Stanford University 1.
…to Ontology Repositories Mathieu dAquin Knowledge Media Institute, The Open University From…
ASTM Member Website Tools Jeff Adkins Diane Trinsey 1 September 2012 Officers Training Workshop.
September 2013 ASTM Officers Training Workshop September 2013 ASTM Officers Training Workshop ASTM Member Website Tools September 2013 ASTM Officers Training.
1 Web Search Environments Web Crawling Metadata using RDF and Dublin Core Dave Beckett Slides:
David Burdett May 11, 2004 Package Binding for WS CDL.
We need a common denominator to add these fractions.
Presented to: By: Date: Federal Aviation Administration Registry/Repository in a SOA Environment SOA Brown Bag #5 SWIM Team March 9, 2011.
1 State Wildlife Action Plans Wiki: Business Transformation Tutorial Brand Niemann July 5, 2008
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Create an Application Title 1Y - Youth Chapter 5.
Determine Eligibility Chapter 4. Determine Eligibility 4-2 Objectives Search for Customer on database Enter application signed date and eligibility determination.
CALENDAR.
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Jone Garmendia, Head of Cataloguing 25 November 2011 The National Archives Taxonomy.
Copyright 2006 Digital Enterprise Research Institute. All rights reserved. MarcOnt Initiative Tools for collaborative ontology development.
Searching for Drug Information In PubMed PubMed for Trainers Spring 2014 U.S. National Library of Medicine (NLM) and NLM Training Center.
1 Click here to End Presentation Software: Installation and Updates Internet Download CD release NACIS Updates.
The 5S numbers game..
Knowledge Extraction from Technical Documents Knowledge Extraction from Technical Documents *With first class-support for Feature Modeling Rehan Rauf,
NCBO-I2B2 Collaboration Overview and Use Cases Nigam Shah
Welcome. © 2008 ADP, Inc. 2 Overview A Look at the Web Site Question and Answer Session Agenda.
Break Time Remaining 10:00.
PP Test Review Sections 6-1 to 6-6
© 2011 TIBCO Software Inc. All Rights Reserved. Confidential and Proprietary. Towards a Model-Based Characterization of Data and Services Integration Paul.
PubMed Searching: Automatic Term Mapping (ATM) PubMed for Trainers, Spring 2014 U.S. National Library of Medicine (NLM) and NLM Training Center.
EIS Bridge Tool and Staging Tables September 1, 2009 Instructor: Way Poteat Slide: 1.
VOORBLAD.
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
Adding Up In Chunks.
FAFSA on the Web Preview Presentation December 2013.
SLP – Endless Possibilities What can SLP do for your school? Everything you need to know about SLP – past, present and future.
MaK_Full ahead loaded 1 Alarm Page Directory (F11)
Sets Sets © 2005 Richard A. Medeiros next Patterns.
Artificial Intelligence
Before Between After.
25 seconds left…...
: 3 00.
We will resume in: 25 Minutes.
ANSC644 Bioinformatics-Database Mining 1 ANSC644 Bioinformatics §Carl J. Schmidt §051 Townsend Hall §
Clock will move after 1 minute
PSSA Preparation.
Immunobiology: The Immune System in Health & Disease Sixth Edition
Chapter 13 The Data Warehouse
Physics for Scientists & Engineers, 3rd Edition
Select a time to count down from the clock above
Copyright Tim Morris/St Stephen's School
1 Distributed Agents for User-Friendly Access of Digital Libraries DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen,
Pronalaženje Skrivenog Znanja
IPY and Semantics Siri Jodha S. Khalsa Paul Cooper Peter Pulsifer Paul Overduin Eugeny Vyazilov Heather lane.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
ECCR Overview/MLSCN. NIH Roadmap Series of initiatives designed to pursue major opportunities in biomedical research and gaps in current knowledge that.
Indiana University School of Indiana University ECCR Summary Infrastructure: Cheminformatics web service infrastructure made available as a community resource.
Semantic Wikis Expedition #52 Conor Shankey CEO July 18, 2006
Presentation transcript:

software servers search tool versioning programming XML RDF OWL data sets semantic web PubChem screening fluorescence small molecule biological assay novel chemical tools chemical probes high-thoughput screening (HTS) ChemBank PDSP pharmaceutical chemical biology cheminformatics biological pathways disease networks structural biology biomedical knowledge technologyend point ATP Luciferin Coupled activity viability Beta-Lactamase Induction binding based calcium redistribution caspase activity dehydrogenase activity cyclic AMP redistribution energy transfer enzyme reporter enzyme substrate based Fluorogenic substrate GFP induction standards controlled vocabulary indexing subject indexing schemes authorized terms taxonomies thesauri subject headings natural language library tags homographs synonyms polysemes concepts structure search knowledge specificity article meta-data information exchange classification nomenclature semantic domain properties annotation object classes individuals BioAssay Ontology (BAO) Stephan Schürer, PhD ICBO, Buffalo, July

 One of the most important approaches to find novel entry points for drug discovery programs  Historically in pharmaceutical companies  Since ~2005, massive NIH effort (MLI) to make HTS accessible to public sector research  PubChem is the major repository of HTS data  More recently: EU-OpenScreen project Background for BioAssay Ontology High-throughput screening 2

 Lack of standardized assay annotations  No standardized endpoint names or formats  Data is rarely re-used(!)  Common queries cannot be asked  Analysis across different data sets is difficult  Integration with other databases is difficult  No knowledge model for assays and screening results Motivation for BioAssay Ontology Large public screening data sets PubChem, ChEMBL, PDSP, ChemBank, Binding DB 3

Identify inhibitors of kinases in biochemical assays. Identify compounds active in multiple luciferase reporter gene assays. Identify compounds active in cell viability assays and organize by cell lines and assay types. Identify active compounds in assays related to pathway X. … Queries the Ontology should be able to answer 4

5  Leverage the aggregated corpus of publically available HTS data to infer molecular mechanism of actions (MMOA) of small molecule perturbagens in biological model systems. Schürer et al. “BioAssay Ontology Annotations Facilitate Cross-Analysis of Diverse High-throughput Screening Data Sets” J Biomol Screen 2011 (16),

 BAOSearch Software (beta):  Query, explore, download BAO-annotated PubChem content  Some semantic search capabilities  Project Website and Wiki with relevant materials and documentation: BAO Products and Resources 6

 Application / user focus vs. “universal” ontologies  Efficiency vs. “realism” of representations  Rapid application development  Orthogonal ontologies vs. Ontology mapping  Universal “realism” vs. domain or application-specific  Chemical bond: 2D structure graph, 3D rule based, molecular mechanics, semi-empirical, up-initio QM  Disease  Virtual world Questions / Discussion points 7

 Collaborative ontology development  Collaborative vs. individual effort  Control over development and focus / application focus  Rapid application development  Quality  Aligning BAO to upper level ontology (BFO)  Benefits vs. required resources  Do upper level ontologies matter for specialized applications? Questions / Discussion points 8

 Aligning BAO with OBI  Some level of overlap  OBI: process-oriented (model the investigation)  BAO: purpose of categorization and analysis of HTS data  BAO model becomes more complex if based on OBI  How do we do it practically  Define missing assays to OBI and MIREOT back?  Quick term templates (QTT)?  Define our relations as short-cut relationships (using RO)? Questions / Discussion points 9

Additional slides 10

BAO-facilitated Example for Analysis (Luciferase Assays) Details in: Schürer et al. “BioAssay Ontology Annotations Facilitate Cross-Analysis of Diverse High-throughput Screening Data Sets” J Biomol Screen 2011 (16),

Panel Assay Single Conc Other Conc-response Assay Count Most promiscuous reporter gene compounds

Reporter DR Reporter SC Viability DR Viability SC Enz Activ DR Enz Activ SC ATP DR ATP SC Luciferin DR Luciferin SC Promiscuity Index Compounds Luciferase Enzyme Inhibitors Generally cytotoxic

Examples: Cytotoxic Series Cluster Reporter PCIdx: 0.56 Cluster Reporter Active: 58 Cluster Viability PCIdx: 0.64 Cluster Viability Active 27 Cluster Reporter PCIdx: 0.48 Cluster Reporter Active: 23 Cluster Viability PCIdx: 0.45 Cluster Viability Active 10 Cluster Reporter PCIdx: 0.41 Cluster Reporter Active: 29 Cluster Viability PCIdx: 0.57 Cluster Viability Active 13 Daunorubicin Emetine

Examples: Luciferase Inhibitor Series Cluster Size: 6 Cluster Reporter PCIdx: 0.61 Cluster Reporter Active: 101 Cluster EnzActivity PCIdx: 0.58 Cluster EnzActivity: 15 Cluster Size: 4 Cluster Reporter PCIdx: 0.38 Cluster Reporter Active: 52 Cluster EnzActivity PCIdx: 0.61 Cluster EnzActivity: 11 Cluster Size: 5 Cluster Reporter PCIdx: 0.46 Cluster Reporter Active: 77 Cluster EnzActivity PCIdx: 0.58 Cluster EnzActivity: 14 Schürer et al. “BioAssay Ontology Annotations Facilitate Cross-Analysis of Diverse High-throughput Screening Data Sets” J Biomol Screen 2011 (16),

1)Development of the Bioassay Ontology 2)Annotation of assays and assay results (content curation) 3)Development of software tools BAO Project: Three major components 16

BAO design to describe assays

Application of BAO: BAO Search Software 18

19

20 BAO: Concept Search

21 Biochemical Assays with IC50 < 1  M

22

23 Chemical structure search

 BioAssay Ontology (NCBO bioportal and project site):  Terminology / annotations for biochemical assays: > Assay Annotation Template  Over 1000 BAO-annotated assays from PubChem (available in BAOSearch) BAO Products and Resources 24

Chris Mader Amar Koleti Nakul Datar Sreeharsha Venkatapuram Felimon Gayanilo Mark Southern Saminda Abeyruwan Uma Vempati Magdalena Przydzial Kunie Sakurai Robin Smith Yuanyuan Jia Caty Chung Ubbo Visser Vance Lemmon Mitsunori Ogihara Nick Tsinoremas Acknowledgements 25