Presentation is loading. Please wait.

Presentation is loading. Please wait.

PubChem: An Open Repository for Chemical Structure and Biological Activity Information Steve Bryant The NIH Biowulf Cluster: 10 Years of Scientific Supercomputing.

Similar presentations


Presentation on theme: "PubChem: An Open Repository for Chemical Structure and Biological Activity Information Steve Bryant The NIH Biowulf Cluster: 10 Years of Scientific Supercomputing."— Presentation transcript:

1 PubChem: An Open Repository for Chemical Structure and Biological Activity Information Steve Bryant The NIH Biowulf Cluster: 10 Years of Scientific Supercomputing February 3, 2009

2 … NIH “Molecular Libraries” … Basic design / approach … Current discovery tools / example … Planned discover tools … New discovery tools ? PubChem Overview …

3 NIH Molecular Libraries Program … Molecular Libraries Screening Centers Network (MLSCN) Compound Repository (MLSMR) Instrumentation Chemical Diversity Assay Development Predictive ADMET Technology Development Screening Informatics Cheminformatics Research Centers

4 Molecular Libraries BioAssays … Investigator Customized Assay Screen Hit picking, confirmation, secondary screens Hit List Optimization Chemistry Compound Repository Assay Peer review

5 Molecular Libraries Components …

6 MLSCN Created … 2005

7 MLPCN Created … 2008

8 … NIH “Molecular Libraries” overview … Basic design / approach … Current discovery tools / example … Planned discover tools … New discovery tools ? PubChem Overview …

9 … “GenBank model” … direct depositions by investigators … highly automated (low database cost) … 25 year precedents in biology … less precedent in chemical biology PubChem Approach …

10 Growth In PubChem Contributing Organizations

11 … Contributed substance records … with chemical structure … chemical names and comments … links to contributor web sites … contributed links to other NCBI biomedical databases PubChem Contents …

12 Growth In PubChem Substances / Compounds

13 PubChem Standardization...

14

15 … Contributed bioassay records … with assay description / protocol … links to tested substances … summary and detailed test results … links to contributor web sites and other NCBI databases PubChem Contents …

16 Growth In PubChem BioAssays

17 Growth In PubChem Tested Substances

18 … NIH “Molecular Libraries” overview … Basic design / approach … Current discovery tools / example … Planned discover tools … New discovery tools ? PubChem Overview …

19 … Optimize “discoverability” for molecular biologists by integrating PubChem into NCBI’s Entrez / PubMed Search Engine … Chemical structure search … Bioassay result search … Structure-activity tools PubChem Retrieval System …

20 NCBI’s Entrez Search Engine...

21 Entrez Links and Neighbors... Protein Sequences Protein 3D Structure Activity Profile Similarity PubChem Small Molecules PubMed Literature Bioactivity Screens VAST Structure Similarity Term Frequency Statistics Chemical Structure Similarity 2,000,000 users... 60,000,000 hits... … per day Target Sequence Similarity

22 PubChem Users per Day

23 Search for “Shoichet inhibitors”...

24 PubMed Article Retrieved...

25 Link to PubChem Records...

26 “Kaempferol” in PubChem...

27 Similar Compounds in PubChem...

28 “Quercetin” in PubChem...

29 Compare Protein / Ligand Complexes...

30 Link to Another Structure...

31 Tyrosine Kinase Family Member...

32 Links from “Quercetin” to PubMed...

33 PubMed Records...

34 Links from Quercetin to BioAssays...

35 BioAssay records...

36 BioAssay where “Active”...

37

38

39

40 Entrez Links and Neighbors... Protein Sequences Protein 3D Structure Activity Profile Similarity PubChem Small Molecules PubMed Literature Bioactivity Screens VAST Structure Similarity Term Frequency Statistics Chemical Structure Similarity 2,000,000 users... 60,000,000 hits... … per day Target Sequence Similarity

41 … Optimize “discoverability” for molecular biologists by integrating PubChem into NCBI’s Entrez / PubMed Search Engine … Chemical structure search … Bioassay result search … Exploratory structure-activity tools PubChem Retrieval System …

42 Compounds Similar to Quercetin...

43 PubChem Bioactivity Analysis...

44

45 PubChem Structure-Activity...

46 Active Compound Cluster...

47 BioAsay Cluster...

48 Another BioAssay Cluster...

49 PubMed Connection...

50 PubChem Structure-Activity...

51 … NIH “Molecular Libraries” overview … Basic design / approach … Current discovery tools / example … Planned discover tools … New discovery tools ? PubChem Overview …

52 … Bottom-line “Summaries” of multi-step Molecular Libraries screens … “Chemical Reagent” links for gene and protein records when possible … Add 3D-conformer similarity to structure-activity analysis … Support multi-target “panel” screens Planned Discovery Tools …

53 “Quercetin” in PubChem...

54 “Quercetin” Similar Conformers...

55 … NIH “Molecular Libraries” overview … Basic design / approach … Current discovery tools / example … Planned discover tools … New discovery tools ? PubChem Overview …

56 Systems-biology “pathway” links among chemical biology screens / results … Links to bioactivity information derived from scientific literature, literature abstraction, and other sources … New Discovery Tools ?

57 “Quercetin” in PubChem...

58 “Quercetin” NLM Toxicology...

59 “Quercetin” NLM Toxicity...

60 http://pubchem.ncbi.nlm.nih.gov Evan Bolton Jie Chen Svetlana Dracheva Lewis Geer Lianyi Han Jane He Siqian He Karen Karapetian Vahan Simonyan Ben Shoemaker Wenyao Shi Tugba Suzek Paul Thiessen Valery Tkachenko Jiyao Wang Yanli Wang Jewen Xiao Jian Zhang


Download ppt "PubChem: An Open Repository for Chemical Structure and Biological Activity Information Steve Bryant The NIH Biowulf Cluster: 10 Years of Scientific Supercomputing."

Similar presentations


Ads by Google