Presentation is loading. Please wait.

Presentation is loading. Please wait.

DATA MINING FOR SMALL MOLECULE ALLOSTERIC INHIBITORS

Similar presentations


Presentation on theme: "DATA MINING FOR SMALL MOLECULE ALLOSTERIC INHIBITORS"— Presentation transcript:

1 DATA MINING FOR SMALL MOLECULE ALLOSTERIC INHIBITORS
Douglas R. Houston, University of Edinburgh

2 Introduction to Virtual Screening
Virtual High-Throughput Screening Attempts to simulate the assay plate well in silico Models the interaction between small molecule and protein receptor Pros - Fast and cheap (compared to HTS) Cons - Absolute requirement for certain data

3 Requirements for vHTS DATA SOFTWARE HARDWARE WORKFLOW
Receptor and/or ligand structure(s) - both is best If no ligand is present in the structure then active site must be located Chemical database - which compounds? SOFTWARE Massive choice, but almost everything available has been independently benchmarked HARDWARE Linux cluster has become most cost-effective WORKFLOW Most difficult to determine optimal choice

4 CODASS COmbining Docking And Similarity Searching
Utilises both major classes of in silico ligand discovery: Structure-based methods Docking Ligand-based methods Topology Pharmacophoric Descriptor RECEPTOR

5 Requirements for vHTS DATA
Receptor and/or ligand structure(s) - both is best If no ligand is present in the structure then active site must be located Chemical database - which compounds?

6 CODASS data Receptor structure Ligand structure
Ligand binding conformation Compound database

7 EDULISS Compound Database
Estimated to be 1060 “drug-like” molecules possible Total compounds synthesised to date estimated to be ~50m Combining the databases of several compound suppliers results in a total of ~5m compounds Are so many compounds necessary? Only a small proportion of chemical space has properties that: Allow them to bind to biological targets Are likely to be orally bioavailable Are consistent with good pharmacokinetics Will not give rise to toxicity

8 Enriching EDULISS Rule of n
Lipinski rule of 5 Oprea rule of 4 Astex rule of 3 All filter according to the same physicochemical properties, e.g. H-bond donors (NH, OH) H-bond acceptors (Heteroatoms) cLogP MW

9 Enriching EDULISS Selected filters: Oprea “lead-like”
H-bond acceptors < 9 H-bond donors < 6 MW 200 – 480 cLogP -4 – 4.2 cLogS > -5 Rotatable bonds < 11 1,000,000 “virtual compounds” can be screened in ~10 days with current hardware 5,000,000 1,000,000

10 Requirements for vHTS Hardware Software
Linux cluster has become most cost-effective Software Massive choice, but almost everything available has been independently benchmarked

11 CODASS Hardware and Software
114-core Linux cluster Software Docking/scoring LIDAEUS Autodock Vina Scoring X-Score DSX Pharmacophoric searching UFSRAT ROCS Topological searching Wiener index

12 Simulating protein-ligand interaction
Two separate problems: Docking Can the docking program find the correct ligand binding pose? Scoring Can the program’s scoring algorithm judge correctly which ligands bind more tightly?

13 Can docking software predict ligand binding conformation?
PDBbind database contains 3,214 crystal structures of proteins in complex with ligands with known affinities 228 high-quality structures from this form the “core set” PDBbind core set 65 protein targets each in complex with a high- medium- and low-affinity ligand An additional 33 protein targets each in complex with one ligand

14 Can docking software predict ligand binding conformation?
Results: In 122 cases, Autodock’s prediction of the binding pose was correct (RMSD < 2.0 Å) - 54% In141 cases, Vina’s prediction of the binding pose was correct (RMSD < 2.0 A) - 62% In those cases where RMSD < 2 Å between the Autodock and Vina prediction, both predictions were correct in 82% of cases Conclusion: Only those compounds where the binding pose predictions of Autodock and Vina agree should be considered “Posematch” can be introduced into vHTS workflow Structure of RNA-dependent RNA polymerase in complex with a thiophene-based non-nucleoside inhibitor (PDB ID: 2D3U) Structure of O-GlcNAcase in complex with the inhibitor NButGT (PDB ID: 2VVS)

15 Can docking/scoring software predict ligand binding affinity?
Vina and Autodock both attempt to predict affinity X-Score and Drugscore are standalone scoring algorithms How well do the predictions made by these programs correlate with measured affinities? Correct docking poses were ranked by the various algorithms Predicted rankings were compared to known rank Spearman’s rank correlation coefficient

16 Can docking/scoring software predict ligand binding affinity?
= 1 Perfectly correlated = 0 Perfectly uncorrelated Results: Autodock: 0.57 Vina: 0.66 X-Score: 0.71 DSX: 0.64 a e b d c = -1 Perfectly anticorrelated

17 Can docking/scoring software predict ligand binding affinity?
Autodock: 0.57 X-Score: 0.74

18 Can docking/scoring software predict ligand binding affinity?
Results Autodock: 0.57 Vina: 0.66 X-Score: 0.74 DSX: 0.64 Comparative assessment of scoring functions on a diverse test set. Cheng T, et. al. Chem Inf Model Apr;49(4):

19 Can docking/scoring software predict ligand binding affinity?
Results Autodock: 0.57 Vina: 0.66 X-Score: 0.74 DSX: 0.64 Comparative assessment of scoring functions on a diverse test set. Cheng T, et. al. Chem Inf Model Apr;49(4):

20 Requirements for vHTS Workflow
Most difficult to determine optimal choice

21 Which workflow is optimal?
Quality of results is dependent on: Which programs are selected The order in which the programs are used Program selection and sequence depends on the starting data available Receptor structure? Ligands? Ligand binding location? Ligand binding conformation?

22 Example 1 - Ligands known but no target structure
Ligand-based methods only Generate conformers 5m CombiSearch Cluster EDULISS A battery of similarity search algorithms creates a database tailored to your target Searchable database of 5m commercially available 3D compounds Visual analysis Purchase & test

23 Example 2 - Target structure available but no ligands
Structure-based methods only Parallelised rigid-body docking; screen millions of compounds in hours 25m Generate conformers LIDAEUS 0.5m Parallelised fast flexible docking Vina 5k Parallelised rigorous flexible docking models electrostatics 100k 100 5m Autodock CombiSearch Binding modes are compared - matches are 90% likely to be correct PoseMatch EDULISS A battery of topological and pharmacophoric search algorithms creates a database tailored to your target X-Score + DSX Docked poses are graded using best scoring algorithms currently available Searchable database of 5m commercially available 3D compounds Visual analysis Purchase & test

24 Pharmaco-phore search
Example 3 - Target structure and ligands available Both ligand- and structure-based methods Parallelised rigid-body docking; screen millions of compounds in hours 25m Generate conformers LIDAEUS 0.5m Parallelised fast flexible docking Vina 5k 100 Parallelised rigorous flexible docking models electrostatics 100k 5m Autodock Pharmaco-phore search 100k Binding modes are compared - matches are 90% likely to be correct PoseMatch EDULISS X-Score + DSX Docked poses are graded using best scoring algorithms currently available Topological search Searchable database of 5m commercially available 3D compounds Visual analysis LIGAND STRUCTURES Purchase & test

25 Pharmaco-phore search
Example 4 - Target structure available and ligand binding conformation known Both ligand- and structure-based methods Parallelised rigid-body docking; screen millions of compounds in hours 25m Generate conformers LIDAEUS 0.5m Parallelised fast flexible docking Vina 5k 100 Parallelised rigorous flexible docking models electrostatics 100k 5m Autodock Pharmaco-phore search 100k Binding modes are compared - matches are 82% likely to be correct PoseMatch EDULISS X-Score + DSX Docked poses are graded using best scoring algorithms currently available Pharmacophore search Searchable database of 5m commercially available 3D compounds Visual analysis LIGAND CONFORMATIONS Purchase & test

26 CODASS vHTS against PYK
Human PYK is inactivated by T3 or phenylalanine Crystals structure shows that Phe binds in pocket distinct from known active or effector sites

27 CODASS vHTS against PYK

28 Pharmaco-phore search
PYK “receptor” structure available, plus structure and binding conformation of Phe ligand Parallelised rigid-body docking; screen millions of compounds in hours 25m Generate conformers LIDAEUS 0.5m Parallelised fast flexible docking Vina 5k 100 Parallelised rigorous flexible docking models electrostatics 100k 5m Autodock Pharmaco-phore search 100k Binding modes are compared - matches are 90% likely to be correct PoseMatch EDULISS X-Score + DSX Docked poses are graded using best scoring algorithms currently available Pharmacophore search Searchable database of 5m commercially available 3D compounds Visual analysis Purchase & test

29 Pharmaco-phore search
PYK “receptor” structure available, plus structure and binding conformation of Phe ligand Parallelised rigid-body docking; screen millions of compounds in hours 25m Generate conformers LIDAEUS 0.5m Parallelised fast flexible docking Vina 5k 100 Parallelised rigorous flexible docking models electrostatics 100k 5m Autodock Pharmaco-phore search 100k Binding modes are compared - matches are 90% likely to be correct PoseMatch EDULISS X-Score + DSX Docked poses are graded using best scoring algorithms currently available Pharmacophore search Searchable database of 5m commercially available 3D compounds Visual analysis “virtual hits” Purchase & test

30 CODASS vHTS against PYK
Creating a post-screening pharmacophoric filter

31 CODASS vHTS against PYK

32 CODASS vHTS against PYK
Summary: 5 million commercially available compounds were docked into a known allosteric pocket of PYK Lists of predicted binders were collated by merging multiple affinity prediction methods 20-30 compounds will be acquired and tested in assay What hit rate can we expect?

33 CODASS hit rates Virtual screening success rates using in-house tools:
Average hit rate: 49% Compare to 1% and 5% “industry standard” for HTS and vHTS Protein Targets Binding Pocket Tested Actives Hit Rate Cyclophilin A PPIase 14 6 42% ERCC1 and XPF Protein-Protein 27 16 59% XPF nuclease domain Nuclease 18 12 66% tRNA isoleucine lysidine synthetase (TILS) Enzyme 17 7 41% Mdm2 (Ring domain) and E2 19 5 26% 11β-hydroxysteroid dehydrogenase (11βHSD1) 20 35% Eukaryotic E3 ligase 138 91 Leishmania phsphofructokinase (PFK) 8 62 %

34 Conclusion CODASS can offer a cost-effective alternative to High- Throughput Screening for inhibitor discovery CODASS is applicable to any target enzyme for which the structure is known

35 Acknowledgements Dr. Hugh Morgan Dr. Steve Shave Dr. Paul Taylor
Prof. Malcolm Walkinshaw


Download ppt "DATA MINING FOR SMALL MOLECULE ALLOSTERIC INHIBITORS"

Similar presentations


Ads by Google