DATA MINING FOR SMALL MOLECULE ALLOSTERIC INHIBITORS

Slides:



Advertisements
Similar presentations
Design of high-content and focused libraries to improve the development of new active compounds in the framework of the rational drug discovery Design.
Advertisements

Analysis of High-Throughput Screening Data C371 Fall 2004.
Improving enrichment rates A practical solution to an impractical problem Noel O’Boyle Cambridge Crystallographic Data Centre
Jürgen Sühnel Institute of Molecular Biotechnology, Jena Centre for Bioinformatics Jena / Germany Supplementary Material:
Why multiple scoring functions can improve docking performance - Testing hypotheses for rescoring success Noel M. O’Boyle, John W. Liebeschuetz and Jason.
Computational Drug Design Apr 2010 Postgrad course on Comp Chem Noel M. O’Boyle.
Molecular dynamics refinement and rescoring in WISDOM virtual screenings Gianluca Degliesposti University of Modena and Reggio Emilia Molecular Modelling.
1 PharmID: A New Algorithm for Pharmacophore Identification Stan Young Jun Feng and Ashish Sanil NISSMPDM 3 June 2005.
Establishing a Successful Virtual Screening Process Stephen Pickett Roche Discovery Welwyn.
FLEX* - REVIEW.
In-silico screening without structural comparisons: Peptides to non-peptides in one step Maybridge Workshop Oct ‘03 Bregenz Austria.
An Integrated Approach to Protein-Protein Docking
BL5203: Molecular Recognition & Interaction Lecture 5: Drug Design Methods Ligand-Protein Docking (Part I) Prof. Chen Yu Zong Tel:
Pharmacophore-based Molecular Docking Bert E. Thomas, Diane Joseph- McCarthy, Juan C.Avarez.
Design of Small Molecule Drugs Targeted to RNA RNA Ontology Group May
Molecular Docking Using GOLD Tommi Suvitaival Seppo Virtanen S Basics for Biosystems of the Cell Fall 2006.
Chemoinformatics in Molecular Docking and Drug Discovery.
1111 Discovery Novel Allosteric Fragment Inhibitors of HIV-1 Reverse Transcriptase for HIV Prevention A/Prof Gilda Tachedjian Retroviral Biology and Antivirals.
Chemoinformatics in Drug Design
Protein Structure and Drug Discovery Workshop To be held at Monash University, Mebourne, Australia October 3 rd to 4 th 2006 Molecular Visualization Learn.
Bioinformatics Ayesha M. Khan Spring Phylogenetic software PHYLIP l 2.
eHiTS Score Darryl Reid, Zsolt Zsoldos, Bashir S. Sadjad, Aniko Simon, The next stage in scoring function evolution: a new statistically.
Computational Techniques in Support of Drug Discovery October 2, 2002 Jeffrey Wolbach, Ph. D.
Molecular Descriptors
Topological Summaries: Using Graphs for Chemical Searching and Mining Graphs are a flexible & unifying model Scalable similarity searches through novel.
Introduction to Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
Confab – Systematic generation of diverse low-energy conformers Noel M O’Boyle, 1 Tim Vandermeersch, 2 and Geoffrey R Hutchison. 3 1 Analytical and Biological.
High-Throughput Virtual Molecular Docking: Hadoop Implementation of AutoDock4 on a Private Cloud Sally R. Ellingson Graduate Research Assistant Center.
Faculté de Chimie, ULP, Strasbourg, FRANCE
In silico discovery of inhibitors using structure-based approaches Jasmita Gill Structural and Computational Biology Group, ICGEB, New Delhi Nov 2005.
Page 1 SCAI Dr. Marc Zimmermann Department of Bioinformatics Fraunhofer Institute for Algorithms and Scientific Computing (SCAI) Grid-enabled drug discovery.
Ligand-based drug discovery No a priori knowledge of the receptor What information can we get from a few active compounds.
SimBioSys Inc.© Slide #1 Enrichment and cross-validation studies of the eHiTS high throughput screening software package.
Altman et al. JACS 2008, Presented By Swati Jain.
1 © Patrick An Introduction to Medicinal Chemistry 3/e Chapter 10 DRUG DESIGN: OPTIMIZING TARGET INTERACTIONS Part 2: Section 10.2.
Virtual Screening C371 Fall INTRODUCTION Virtual screening – Computational or in silico analog of biological screening –Score, rank, and/or filter.
Hierarchical Database Screenings for HIV-1 Reverse Transcriptase Using a Pharmacophore Model, Rigid Docking, Solvation Docking, and MM-PB/SA Junmei Wang,
R L R L L L R R L L R R L L water DOCKING SIMULATIONS.
Introduction to Chemoinformatics and Drug Discovery Irene Kouskoumvekaki Associate Professor February 15 th, 2013.
Surflex: Fully Automatic Flexible Molecular Docking Using a Molecular Similarity-Based Search Engine Ajay N. Jain UCSF Cancer Research Institute and Comprehensive.
Computational Approach for Combinatorial Library Design Journal club-1 Sushil Kumar Singh IBAB, Bangalore.
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
Elon Yariv Graduate student in Prof. Nir Ben-Tal’s lab Department of Biochemistry and Molecular Biology, Tel Aviv University.
Docking and Virtual Screening Using the BMI cluster
Molecular Modeling in Drug Discovery: an Overview
TIDEA Target (and Lead) Independent Drug Enhancement Algorithm.
HP-SEE In the search of the HDAC-1 inhibitors. The preliminary results of ligand based virtual screening Ilija N. Cvijetić, Ivan O. Juranić,
Natural products from plants
Structure-based inhibitor design and validation: Application to Plasmodium falciparum glutathione S-transferase Marli Botha MSc. Bioinformatics.
Structural Bioinformatics Elodie Laine Master BIM-BMC Semester 3, Genomics of Microorganisms, UMR 7238, CNRS-UPMC e-documents:
Page 1 Computer-aided Drug Design —Profacgen. Page 2 The most fundamental goal in the drug design process is to determine whether a given compound will.
Lipinski’s rule of five
Theoretical investigating DNA binding properties in Rad51D as a way to find missing facts in Homologous Recombination Repair mechanism Lecturer: Mohammad.
Computational Docking Experiments to Find a Ligand that Will Bind to Xanthine Oxidase Lysengkeng Her and Thao Yang Department of Chemistry, University.
Pharmaco-phore search
APPLICATIONS OF BIOINFORMATICS IN DRUG DISCOVERY
Ligand-Based Structural Hypotheses for Virtual Screening
Molecular Docking Profacgen. The interactions between proteins and other molecules play important roles in various biological processes, including gene.
Virtual Screening.
“Structure Based Drug Design for Antidiabetics”
Ligand Docking to MHC Class I Molecules
An Integrated Approach to Protein-Protein Docking
Flexible alignment in 3D & applications
William J. Zuercher, Jonathan M. Elkins, Stefan Knapp 
Cheminformatics Basics
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Julia Salas Case Study, CS379a
Presentation transcript:

DATA MINING FOR SMALL MOLECULE ALLOSTERIC INHIBITORS Douglas R. Houston, University of Edinburgh

Introduction to Virtual Screening Virtual High-Throughput Screening Attempts to simulate the assay plate well in silico Models the interaction between small molecule and protein receptor Pros - Fast and cheap (compared to HTS) Cons - Absolute requirement for certain data

Requirements for vHTS DATA SOFTWARE HARDWARE WORKFLOW Receptor and/or ligand structure(s) - both is best If no ligand is present in the structure then active site must be located Chemical database - which compounds? SOFTWARE Massive choice, but almost everything available has been independently benchmarked HARDWARE Linux cluster has become most cost-effective WORKFLOW Most difficult to determine optimal choice

CODASS COmbining Docking And Similarity Searching Utilises both major classes of in silico ligand discovery: Structure-based methods Docking Ligand-based methods Topology Pharmacophoric Descriptor RECEPTOR

Requirements for vHTS DATA Receptor and/or ligand structure(s) - both is best If no ligand is present in the structure then active site must be located Chemical database - which compounds?

CODASS data Receptor structure Ligand structure Ligand binding conformation Compound database

EDULISS Compound Database Estimated to be 1060 “drug-like” molecules possible Total compounds synthesised to date estimated to be ~50m Combining the databases of several compound suppliers results in a total of ~5m compounds Are so many compounds necessary? Only a small proportion of chemical space has properties that: Allow them to bind to biological targets Are likely to be orally bioavailable Are consistent with good pharmacokinetics Will not give rise to toxicity

Enriching EDULISS Rule of n Lipinski rule of 5 Oprea rule of 4 Astex rule of 3 All filter according to the same physicochemical properties, e.g. H-bond donors (NH, OH) H-bond acceptors (Heteroatoms) cLogP MW

Enriching EDULISS Selected filters: Oprea “lead-like” H-bond acceptors < 9 H-bond donors < 6 MW 200 – 480 cLogP -4 – 4.2 cLogS > -5 Rotatable bonds < 11 1,000,000 “virtual compounds” can be screened in ~10 days with current hardware 5,000,000 1,000,000

Requirements for vHTS Hardware Software Linux cluster has become most cost-effective Software Massive choice, but almost everything available has been independently benchmarked

CODASS Hardware and Software 114-core Linux cluster Software Docking/scoring LIDAEUS Autodock Vina Scoring X-Score DSX Pharmacophoric searching UFSRAT ROCS Topological searching Wiener index

Simulating protein-ligand interaction Two separate problems: Docking Can the docking program find the correct ligand binding pose? Scoring Can the program’s scoring algorithm judge correctly which ligands bind more tightly?

Can docking software predict ligand binding conformation? PDBbind database contains 3,214 crystal structures of proteins in complex with ligands with known affinities 228 high-quality structures from this form the “core set” PDBbind core set 65 protein targets each in complex with a high- medium- and low-affinity ligand An additional 33 protein targets each in complex with one ligand

Can docking software predict ligand binding conformation? Results: In 122 cases, Autodock’s prediction of the binding pose was correct (RMSD < 2.0 Å) - 54% In141 cases, Vina’s prediction of the binding pose was correct (RMSD < 2.0 A) - 62% In those cases where RMSD < 2 Å between the Autodock and Vina prediction, both predictions were correct in 82% of cases Conclusion: Only those compounds where the binding pose predictions of Autodock and Vina agree should be considered “Posematch” can be introduced into vHTS workflow Structure of RNA-dependent RNA polymerase in complex with a thiophene-based non-nucleoside inhibitor (PDB ID: 2D3U) Structure of O-GlcNAcase in complex with the inhibitor NButGT (PDB ID: 2VVS)

Can docking/scoring software predict ligand binding affinity? Vina and Autodock both attempt to predict affinity X-Score and Drugscore are standalone scoring algorithms How well do the predictions made by these programs correlate with measured affinities? Correct docking poses were ranked by the various algorithms Predicted rankings were compared to known rank Spearman’s rank correlation coefficient

Can docking/scoring software predict ligand binding affinity? = 1 Perfectly correlated = 0 Perfectly uncorrelated Results: Autodock: 0.57 Vina: 0.66 X-Score: 0.71 DSX: 0.64 a e b d c = -1 Perfectly anticorrelated

Can docking/scoring software predict ligand binding affinity? Autodock: 0.57 X-Score: 0.74

Can docking/scoring software predict ligand binding affinity? Results Autodock: 0.57 Vina: 0.66 X-Score: 0.74 DSX: 0.64 Comparative assessment of scoring functions on a diverse test set. Cheng T, et. al. Chem Inf Model. 2009 Apr;49(4):1079-93.

Can docking/scoring software predict ligand binding affinity? Results Autodock: 0.57 Vina: 0.66 X-Score: 0.74 DSX: 0.64 Comparative assessment of scoring functions on a diverse test set. Cheng T, et. al. Chem Inf Model. 2009 Apr;49(4):1079-93.

Requirements for vHTS Workflow Most difficult to determine optimal choice

Which workflow is optimal? Quality of results is dependent on: Which programs are selected The order in which the programs are used Program selection and sequence depends on the starting data available Receptor structure? Ligands? Ligand binding location? Ligand binding conformation?

Example 1 - Ligands known but no target structure Ligand-based methods only Generate conformers 5m CombiSearch Cluster EDULISS A battery of similarity search algorithms creates a database tailored to your target Searchable database of 5m commercially available 3D compounds Visual analysis Purchase & test

Example 2 - Target structure available but no ligands Structure-based methods only Parallelised rigid-body docking; screen millions of compounds in hours 25m Generate conformers LIDAEUS 0.5m Parallelised fast flexible docking Vina 5k Parallelised rigorous flexible docking models electrostatics 100k 100 5m Autodock CombiSearch Binding modes are compared - matches are 90% likely to be correct PoseMatch EDULISS A battery of topological and pharmacophoric search algorithms creates a database tailored to your target X-Score + DSX Docked poses are graded using best scoring algorithms currently available Searchable database of 5m commercially available 3D compounds Visual analysis Purchase & test

Pharmaco-phore search Example 3 - Target structure and ligands available Both ligand- and structure-based methods Parallelised rigid-body docking; screen millions of compounds in hours 25m Generate conformers LIDAEUS 0.5m Parallelised fast flexible docking Vina 5k 100 Parallelised rigorous flexible docking models electrostatics 100k 5m Autodock Pharmaco-phore search 100k Binding modes are compared - matches are 90% likely to be correct PoseMatch EDULISS X-Score + DSX Docked poses are graded using best scoring algorithms currently available Topological search Searchable database of 5m commercially available 3D compounds Visual analysis LIGAND STRUCTURES Purchase & test

Pharmaco-phore search Example 4 - Target structure available and ligand binding conformation known Both ligand- and structure-based methods Parallelised rigid-body docking; screen millions of compounds in hours 25m Generate conformers LIDAEUS 0.5m Parallelised fast flexible docking Vina 5k 100 Parallelised rigorous flexible docking models electrostatics 100k 5m Autodock Pharmaco-phore search 100k Binding modes are compared - matches are 82% likely to be correct PoseMatch EDULISS X-Score + DSX Docked poses are graded using best scoring algorithms currently available Pharmacophore search Searchable database of 5m commercially available 3D compounds Visual analysis LIGAND CONFORMATIONS Purchase & test

CODASS vHTS against PYK Human PYK is inactivated by T3 or phenylalanine Crystals structure shows that Phe binds in pocket distinct from known active or effector sites

CODASS vHTS against PYK

Pharmaco-phore search PYK “receptor” structure available, plus structure and binding conformation of Phe ligand Parallelised rigid-body docking; screen millions of compounds in hours 25m Generate conformers LIDAEUS 0.5m Parallelised fast flexible docking Vina 5k 100 Parallelised rigorous flexible docking models electrostatics 100k 5m Autodock Pharmaco-phore search 100k Binding modes are compared - matches are 90% likely to be correct PoseMatch EDULISS X-Score + DSX Docked poses are graded using best scoring algorithms currently available Pharmacophore search Searchable database of 5m commercially available 3D compounds Visual analysis Purchase & test

Pharmaco-phore search PYK “receptor” structure available, plus structure and binding conformation of Phe ligand Parallelised rigid-body docking; screen millions of compounds in hours 25m Generate conformers LIDAEUS 0.5m Parallelised fast flexible docking Vina 5k 100 Parallelised rigorous flexible docking models electrostatics 100k 5m Autodock Pharmaco-phore search 100k Binding modes are compared - matches are 90% likely to be correct PoseMatch EDULISS X-Score + DSX Docked poses are graded using best scoring algorithms currently available Pharmacophore search Searchable database of 5m commercially available 3D compounds Visual analysis 1000-2000 “virtual hits” Purchase & test

CODASS vHTS against PYK Creating a post-screening pharmacophoric filter

CODASS vHTS against PYK

CODASS vHTS against PYK Summary: 5 million commercially available compounds were docked into a known allosteric pocket of PYK Lists of predicted binders were collated by merging multiple affinity prediction methods 20-30 compounds will be acquired and tested in assay What hit rate can we expect?

CODASS hit rates Virtual screening success rates using in-house tools: Average hit rate: 49% Compare to 1% and 5% “industry standard” for HTS and vHTS Protein Targets Binding Pocket Tested Actives Hit Rate Cyclophilin A PPIase 14 6 42% ERCC1 and XPF Protein-Protein 27 16 59% XPF nuclease domain Nuclease 18 12 66% tRNA isoleucine lysidine synthetase (TILS) Enzyme 17 7 41% Mdm2 (Ring domain) and E2 19 5 26% 11β-hydroxysteroid dehydrogenase (11βHSD1) 20 35% Eukaryotic E3 ligase 138 91 Leishmania phsphofructokinase (PFK) 8 62 %

Conclusion CODASS can offer a cost-effective alternative to High- Throughput Screening for inhibitor discovery CODASS is applicable to any target enzyme for which the structure is known

Acknowledgements Dr. Hugh Morgan Dr. Steve Shave Dr. Paul Taylor Prof. Malcolm Walkinshaw