PubChem: An Open Repository for Chemical Structure and Biological Activity Information Steve Bryant The NIH Biowulf Cluster: 10 Years of Scientific Supercomputing.

Slides:



Advertisements
Similar presentations
Leveraging ChemAxon Cheminformatics in an Integrated Drug Discovery and Development Platform Zhenbin Li, Paul Starbard, Jim Gregory, Donald Chen, Paul.
Advertisements

PubMed Review Medical Library Association Annual Meeting May 20 – 22, 2007 Philadelphia.
NCBI/WHO PubMed/Hinari Course NCBI Literature Databases: PubMed Background.
Introduction to PubMed® (pubmed.gov)
PubMed Central ANCHASL Spring Meeting April 1, 2005 Robert James Associate Director of Public Services Duke University.
MICB 405 Bioinformatics Mini-Lab #1 – NCBI’s Entrez Dr. Joanne Fox We gratefully acknowledge the funding for the development of these.
Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
Genome databases and webtools for genome analysis Become familiar with microbial genome databases Use some of the tools useful for analyzing genome Visit.
1.
CDD – a conserved domain database Aron Marchler-Bauer NCBI, National Library of Medicine, NIH DIMACS Workshop on Protein Domains: Identification, Classification.
Strategies towards improving the utility of scientific big data Evan Bolton, PhD National Center for Biotechnology Information (NCBI) National Library.
NATIONAL LIBRARY OF MEDICINE The PubMed ID and Entrez, PubMed and PubMed Central Edwin Sequeira National Center for Biotechnology Information June 21,
PubMed for Trainers, Spring 2012 U.S. National Library of Medicine (NLM) and NLM Training Center LinkOut for Libraries.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
Chemical Genetics – Biol503
Sequence/Structure Alignment Resources from NCBI Steve Bryant Protein Data Bank Rutgers University November 19, 2005.
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
OMICS Group Contact us at: OMICS Group International through its Open Access Initiative is committed to make genuine and.
Molecular Library and Imaging Francis Collins, NHGRI Tom Insel, NIMH Rod Pettigrew, NIBIB Building Blocks and Pathways Francis Collins,NHGRI Richard Hodes,
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
Introductory Overview
Cédric Notredame (30/08/2015) Chemoinformatics And Bioinformatics Cédric Notredame Molecular Biology Bioinformatics Chemoinformatics Chemistry.
Moving forward our shared data agenda: a view from the publishing industry ICSTI, March 2012.
Bioinformatics.
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
X-ray crystallography NMR cryoEM Experimental approaches for structural biology.
Yike Guo/Jiancheng Lin InforSense Ltd. 15 September 2015 Bioinformatics workflow integration.
NIH Roadmap and Chemoinformatics Jeffery Loo NLM Associate Fellow Welch Library Journal Club 2004/12/7.
Evan Bolton, PhD Jian Zhang, PhD Gang Fu, PhD Jun. 15, 2015 U.S. National Center for Biotechnology Information (NCBI)
Copyright OpenHelix. No use or reproduction without express written consent1.
NCBI FieldGuide NCBI Molecular Biology Resources January 2008 Using Entrez.
Board on Research Data and Information, National Research Council “Changing Roles of Libraries in Support of Scientific Data Activities” June 3, 2010 More.
BioHealthBase: A Web-based Database and Analysis Resource for Francisella Shubhada Godbole 1, Jyothi Noronha 1, Burke Squires 1, Victoria Hunt 1, Ed Klem.
ChemModLab: A Web-based Cheminformatics Modeling Laboratory S. Stanley Young + ECCR and ChemSpider Teams.
8 October 2009Microbial Research Commons1 Toward a biomedical research commons: A view from NLM-NIH Jerry Sheehan Assistant Director for Policy Development.
Open source software and web services for designing therapeutic molecules G. P. S. Raghava, Head Bioinformatics Centre, Institute of Microbial Technology,
Copyright OpenHelix. No use or reproduction without express written consent1.
Development of an Information Service Program in Molecular Biology and Genetics Ansuman Chattopadhyay, PhD Information Specialist in Molecular Biology.
NCBI FieldGuide NCBI Molecular Biology Resources March 2007 Using Entrez.
NCBI Literature Databases: PubMed
ECCR Overview/MLSCN. NIH Roadmap Series of initiatives designed to pursue major opportunities in biomedical research and gaps in current knowledge that.
A collaborative tool for sequence annotation. Contact:
December 1, Classification Analysis of HIV RNase H Bioassay Lianyi Han Computational Biology Branch NCBI/NLM/NIH Rocky ‘07.
Copyright OpenHelix. No use or reproduction without express written consent1 1.
Use of Machine Learning in Chemoinformatics
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Basic Translational Clinical New Pathways to Discovery Harmonization Target ID & Valid. Phases I-III Research Teams of Future Translational Cores Clinical.
NCBI: something old, something new. What is NCBI? Create automated systems for knowledge about molecular biology, biochemistry, and genetics. Perform.
NCBI PubMed NCBI Literature Databases: PubMed Session #1, April 28, 2005 Session #2, April 29, 2005 Ho Chi Minh City, VietNam.
PubChem—Substance, Compound, BioAssay Part 1: Essentials Principles of May 24, 2007.
Molecular Modeling in Drug Discovery: an Overview
Indiana University School of Indiana University ECCR Summary Infrastructure: Cheminformatics web service infrastructure made available as a community resource.
PubChem Search Features Stephen Bryant Wolfram Data Summit Scientific and Technical Data Session September 9-10, 2010.
Keeping Current: Genetics Resources. This workshop will provide an overview of NCBI resources for finding-- Background information & journal articles.
PubChem BioAssay: Link chemical research to GenBank and beyond
Cheminformatics and Metabolism Team The EBI Enzyme Portal.
Introduction to PubChem BioAssay
Chemical Informatics and Cyberinfrastructure Collaboratory
Directly Upload Data From An ELN Into PubChem
Introduction to PubChem BioAssay
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
APPLICATIONS OF BIOINFORMATICS IN DRUG DISCOVERY
Mobilizing EPA’s CompTox Chemistry Dashboard Data on Mobile Devices
Lesson 3 Bioinformatics Laboratory
Consortium: National networks in 16 European countries.
Consortium: National networks in 16 European countries.
BioGRID: Biological General Repository for Interaction Datasets
SUBMITTED BY: DEEPTI SHARMA BIOLOGICAL DATABASE AND SEQUENCE ANALYSIS.
Presentation transcript:

PubChem: An Open Repository for Chemical Structure and Biological Activity Information Steve Bryant The NIH Biowulf Cluster: 10 Years of Scientific Supercomputing February 3, 2009

… NIH “Molecular Libraries” … Basic design / approach … Current discovery tools / example … Planned discover tools … New discovery tools ? PubChem Overview …

NIH Molecular Libraries Program … Molecular Libraries Screening Centers Network (MLSCN) Compound Repository (MLSMR) Instrumentation Chemical Diversity Assay Development Predictive ADMET Technology Development Screening Informatics Cheminformatics Research Centers

Molecular Libraries BioAssays … Investigator Customized Assay Screen Hit picking, confirmation, secondary screens Hit List Optimization Chemistry Compound Repository Assay Peer review

Molecular Libraries Components …

MLSCN Created … 2005

MLPCN Created … 2008

… NIH “Molecular Libraries” overview … Basic design / approach … Current discovery tools / example … Planned discover tools … New discovery tools ? PubChem Overview …

… “GenBank model” … direct depositions by investigators … highly automated (low database cost) … 25 year precedents in biology … less precedent in chemical biology PubChem Approach …

Growth In PubChem Contributing Organizations

… Contributed substance records … with chemical structure … chemical names and comments … links to contributor web sites … contributed links to other NCBI biomedical databases PubChem Contents …

Growth In PubChem Substances / Compounds

PubChem Standardization...

… Contributed bioassay records … with assay description / protocol … links to tested substances … summary and detailed test results … links to contributor web sites and other NCBI databases PubChem Contents …

Growth In PubChem BioAssays

Growth In PubChem Tested Substances

… NIH “Molecular Libraries” overview … Basic design / approach … Current discovery tools / example … Planned discover tools … New discovery tools ? PubChem Overview …

… Optimize “discoverability” for molecular biologists by integrating PubChem into NCBI’s Entrez / PubMed Search Engine … Chemical structure search … Bioassay result search … Structure-activity tools PubChem Retrieval System …

NCBI’s Entrez Search Engine...

Entrez Links and Neighbors... Protein Sequences Protein 3D Structure Activity Profile Similarity PubChem Small Molecules PubMed Literature Bioactivity Screens VAST Structure Similarity Term Frequency Statistics Chemical Structure Similarity 2,000,000 users... 60,000,000 hits... … per day Target Sequence Similarity

PubChem Users per Day

Search for “Shoichet inhibitors”...

PubMed Article Retrieved...

Link to PubChem Records...

“Kaempferol” in PubChem...

Similar Compounds in PubChem...

“Quercetin” in PubChem...

Compare Protein / Ligand Complexes...

Link to Another Structure...

Tyrosine Kinase Family Member...

Links from “Quercetin” to PubMed...

PubMed Records...

Links from Quercetin to BioAssays...

BioAssay records...

BioAssay where “Active”...

Entrez Links and Neighbors... Protein Sequences Protein 3D Structure Activity Profile Similarity PubChem Small Molecules PubMed Literature Bioactivity Screens VAST Structure Similarity Term Frequency Statistics Chemical Structure Similarity 2,000,000 users... 60,000,000 hits... … per day Target Sequence Similarity

… Optimize “discoverability” for molecular biologists by integrating PubChem into NCBI’s Entrez / PubMed Search Engine … Chemical structure search … Bioassay result search … Exploratory structure-activity tools PubChem Retrieval System …

Compounds Similar to Quercetin...

PubChem Bioactivity Analysis...

PubChem Structure-Activity...

Active Compound Cluster...

BioAsay Cluster...

Another BioAssay Cluster...

PubMed Connection...

PubChem Structure-Activity...

… NIH “Molecular Libraries” overview … Basic design / approach … Current discovery tools / example … Planned discover tools … New discovery tools ? PubChem Overview …

… Bottom-line “Summaries” of multi-step Molecular Libraries screens … “Chemical Reagent” links for gene and protein records when possible … Add 3D-conformer similarity to structure-activity analysis … Support multi-target “panel” screens Planned Discovery Tools …

“Quercetin” in PubChem...

“Quercetin” Similar Conformers...

… NIH “Molecular Libraries” overview … Basic design / approach … Current discovery tools / example … Planned discover tools … New discovery tools ? PubChem Overview …

Systems-biology “pathway” links among chemical biology screens / results … Links to bioactivity information derived from scientific literature, literature abstraction, and other sources … New Discovery Tools ?

“Quercetin” in PubChem...

“Quercetin” NLM Toxicology...

“Quercetin” NLM Toxicity...

Evan Bolton Jie Chen Svetlana Dracheva Lewis Geer Lianyi Han Jane He Siqian He Karen Karapetian Vahan Simonyan Ben Shoemaker Wenyao Shi Tugba Suzek Paul Thiessen Valery Tkachenko Jiyao Wang Yanli Wang Jewen Xiao Jian Zhang