Automated Design of ligands with targeted polypharmacology Jérémy Besnard PhD University of Dundee ELRIG Drug Discovery '13 Manchester 3rd September 2013.

Slides:



Advertisements
Similar presentations
SOMA2 – Drug Design Environment. Drug design environment – SOMA2 The SOMA2 project Tekes (National Technology Agency of Finland) DRUG2000 program.
Advertisements

1 Real World Chemistry Virtual discovery for the real world Joe Mernagh 19 May 2005.
The Drug Discovery Process
Analysis of High-Throughput Screening Data C371 Fall 2004.
Case Study: Dopamine D 3 Receptor Anthagonists Chapter 3 – Molecular Modeling 1.
PhysChem Forum, 29 Nov 2006, Newhouse 1 Memories and the future: From experimental to in silico physical chemistry Han van de Waterbeemd AstraZeneca, DMPK.
Department Author Lead-like Properties, High- throughput Screening and Combinatorial Library Design Andy Davis, Simon Teague, Tudor Oprea, John Steele,
Jürgen Sühnel Institute of Molecular Biotechnology, Jena Centre for Bioinformatics Jena / Germany Supplementary Material:
Lipinski’s rule of five
Establishing a Successful Virtual Screening Process Stephen Pickett Roche Discovery Welwyn.
Integrating Bayesian Networks and Simpson’s Paradox in Data Mining Alex Freitas University of Kent Ken McGarry University of Sunderland.
S TRUCTURAL B IOINFORMATICS. A subset of Bioinformatics concerned with the of biological structures - proteins, DNA, RNA, ligands etc. It is the first.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
Support Vector Machines Kernel Machines
BL5203: Molecular Recognition & Interaction Lecture 5: Drug Design Methods Ligand-Protein Docking (Part I) Prof. Chen Yu Zong Tel:
Doug Brutlag 2011 Genomics, Bioinformatics & Medicine Drug Development
Conformetrix A new dimension in drug discovery Conformetrix © All rights reserved. Conformetrix Ltd Background technology and its application to.
Computational Techniques in Support of Drug Discovery October 2, 2002 Jeffrey Wolbach, Ph. D.
Combinatorial Chemistry and Library Design
Knowledgebase Creation & Systems Biology: A new prospect in discovery informatics S.Shriram, Siri Technologies (Cytogenomics), Bangalore S.Shriram, Siri.
Asia’s Largest Global Software & Services Company Genomes to Drugs: A Bioinformatics Perspective Sharmila Mande Bioinformatics Division Advanced Technology.
Topological Summaries: Using Graphs for Chemical Searching and Mining Graphs are a flexible & unifying model Scalable similarity searches through novel.
A genetic algorithm for structure based de-novo design Scott C.-H. Pegg, Jose J. Haresco & Irwin D. Kuntz February 21, 2006.
Introduction to Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
EXPLORING CHEMICAL SPACE FOR DRUG DISCOVERY Daniel Svozil Laboratory of Informatics and Chemistry.
CS 790 – Bioinformatics Introduction and overview.
Designing a Tri-Peptide based HIV-1 protease inhibitor Presented by, Sushil Kumar Singh IBAB,Bangalore Submitted to Dr. Indira Ghosh AstraZeneca India.
Faculté de Chimie, ULP, Strasbourg, FRANCE
Drug Discovery Process Massimiliano Beltramo, PhD.
Function first: a powerful approach to post-genomic drug discovery Stephen F. Betz, Susan M. Baxter and Jacquelyn S. Fetrow GeneFormatics Presented by.
Page 1 SCAI Dr. Marc Zimmermann Department of Bioinformatics Fraunhofer Institute for Algorithms and Scientific Computing (SCAI) Grid-enabled drug discovery.
Ligand-based drug discovery No a priori knowledge of the receptor What information can we get from a few active compounds.
Combinatorial Chemistry. Synthesis of many structures (diversity) combinatorial technology, combinatorial library molecular diversity What is Combinatorial.
BioPaths-Catalyze Drug Discovery, Development and Clinical Research
Virtual Screening C371 Fall INTRODUCTION Virtual screening – Computational or in silico analog of biological screening –Score, rank, and/or filter.
DuPont Pharmaceuticals Company Utilization of Spotfire™ in Quality Assessment & Analysis of Screening Data Thomas D.Y. Chung, Ph.D. Sr. Director, Leads.
BREED: Generating Novel Inhibitors through Hybridization of Known Ligands (A. C. Pierce, G. Rao, and G. W. Bemis) Richard S. L. Stein CS 379a February.
Introduction to Chemoinformatics and Drug Discovery Irene Kouskoumvekaki Associate Professor February 15 th, 2013.
Design of a Compound Screening Collection Gavin Harper Cheminformatics, Stevenage.
Identification of structurally diverse Growth Hormone Secretagogue (GHS) agonists by virtual screening and structure-activity relationship analysis of.
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
A model collaboration using the Pool Susanne Hollinger, Ph.D., J.D. Chief Intellectual Property Officer Emory University.
Shows tendency for mergers. These big companies may be shrinking – much research is now outsourced to low cost countries like Latvia, India, China and.
Improving compound–protein interaction prediction by building up highly credible negative samples Toward more realistic drug-target interaction predictions.
Docking and Virtual Screening Using the BMI cluster
신기술 접목에 의한 신약개발의 발전전망과 전략 LGCI 생명과학 기술원. Confidential LGCI Life Science R&D 새 시대 – Post Genomic Era Genome count ‘The genomes of various species including.
Pharmaceutical Approaches to Antiviral Drug Discovery
Molecular Modeling in Drug Discovery: an Overview
TIDEA Target (and Lead) Independent Drug Enhancement Algorithm.
HP-SEE In the search of the HDAC-1 inhibitors. The preliminary results of ligand based virtual screening Ilija N. Cvijetić, Ivan O. Juranić,
Page 1 Computer-aided Drug Design —Profacgen. Page 2 The most fundamental goal in the drug design process is to determine whether a given compound will.
Lipinski’s rule of five
Drug Discovery &Development
Can Drug Discovery Research be Done At An Undergraduate Institution?
Rule Induction for Classification Using
APPLICATIONS OF BIOINFORMATICS IN DRUG DISCOVERY
Biotechnology Objectives: At the end of this lecture we will be able to identify and describe the uses of biotechnology in society.
Ligand-Based Structural Hypotheses for Virtual Screening
Molecular Docking Profacgen. The interactions between proteins and other molecules play important roles in various biological processes, including gene.
Virtual Screening.
Volume 19, Issue 1, Pages (January 2012)
Insight into the Pharmaceutical Industry
Drug Design and Drug Discovery
Machine Learning / AI in Drug Discovery
ORGANIC PHARMACEUTICAL CHEMISTRY IV
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Considerations for Successful Biopharmaceutical Product Development: Discovery to Proof of Concept -A Panel Discussion Stanley C. McDermott, PharmD, MS,
Presentation transcript:

Automated Design of ligands with targeted polypharmacology Jérémy Besnard PhD University of Dundee ELRIG Drug Discovery '13 Manchester 3rd September 2013 Medicinal Informatics

Background Increasing cost of R&D High failure rate for compounds in Phase II and III Phase II failures: 2008–2010. The 108 failures are divided according to reason for failure when reported (87 drugs). The total success rate is 18 % between 2008 and 2009 (Arrowsmith, Nature Reviews Drug Discovery 2011)

Possible solution Improve efficacy and safety by better understanding polypharmacological profile of a compound Two proteins are deemed interacting in chemical space (joined by an edge) if both bind one or more compound. Paolini et al. Nature Biotechnology, 2006

Designing Ligands Challenging to test one compound versus multiple targets: costs, which panel to use, more complicated SAR, increasing difficulty of multiobjective optimisation Computational methods can provide – Design ideas – Prediction of activities – When possible ADME predictions

Design ideas – De Novo Drug Design Compound structures are generated by an algorithm Predefined rules to create/modify structures User defined filters – Molecular property space (MW, LogP) – Primary activities to improve – Side activities to avoid

Can we design against a polypharmacological profile? Drug Design is a multi-dimensional optimisation problem Polypharmacology profile design increases the number of dimensions but not the type of the problem – Multiple biological activities – ADMET properties – Drug-like properties Automating drug design is the strategy we have taken to deal with the design decision complexity of multi- target optimisation

Drug Optimisation Road Decisions: – Exploration – Improvement Guides – Structure – Previous SAR – Med Chem Knowledge Biologically active chemical space Synthesised Compounds Decision to synthesize Lead Clinical Candidate

Algorithm Compounds Generate Virtual compounds Background knowledge Med Chem design rules Machine Learning Predict properties Phys-Chem Activities (primary and anti target) Novelty Define Objectives Results expand knowledge-base Patent WO A2 Multi- objective prioritization Final Population Synthesis optimal molecules Test in bio-assays Assess molecules Top cpds + Random set X run Analyse

Background knowledge ChEMBL – 30 years of publications Total ~ 3M endpoints Total 40,000 papers Total ~ 660,000 cpds

Algorithm Compounds Generate Virtual compounds Background knowledge Med Chem design rules Machine Learning Predict properties Phys-Chem Activities (primary and anti target) Novelty Define Objectives Results expand knowledge-base Patent WO A2 Multi- objective prioritization Final Population Synthesis optimal molecules Test in bio-assays Assess molecules Top cpds + Random set X run Analyse

Transformations Try to find new transformations Set of ~700 – Tactics to design analogs – Not synthetic reactions – Derived from literature Semi-automatic

Algorithm Compounds Generate Virtual compounds Background knowledge Med Chem design rules Machine Learning Predict properties Phys-Chem Activities (primary and anti target) Novelty Define Objectives Results expand knowledge-base Patent WO A2 Multi- objective prioritization Final Population Synthesis optimal molecules Test in bio-assays Assess molecules Top cpds + Random set X run Analyse

Model Categorical model – Active if activity < 10μM Use 2D structural information

Bayesian Good feature: 23 times in training set, 15 times in active molecule: Weight = 2.46 “A molecule” Bad feature: 360 times in training set, Never in active molecule: Weight = Moderate good feature: 389 times in training set, 7 times in active molecule: Weight = 0.10 Moderate bad feature: 4 times in training set, Never in active molecule: Weight = Score= = 0.59 High score means high confidence of activity. Low (negative) score means high confidence of inactivity Score ~ 0: either cancellation of good and bad, or unknown W. Van Hoorn, Scitegic User Group Meeting, Feb 2006, La Jolla

Algorithm Compounds Generate Virtual compounds Background knowledge Med Chem design rules Machine Learning Predict properties Phys-Chem Activities (primary and anti target) Novelty Define Objectives Results expand knowledge-base Patent WO A2 Multi- objective prioritization Final Population Synthesis optimal molecules Test in bio-assays Assess molecules Top cpds + Random set X run Analyse

Prioritization Objectives – Activity – CNS score or QED – Anti Target Example – Receptor 1 and 2 activity – Good CNS score – No α1 (a, b and d) activity – -> n dimensions Achievement Objective Objective 1 Objective 2 QED: see Bickerton et al., Quantifying the chemical beauty of drugs. Nature Chemistry, 4(February 2012)

Algorithm Compounds Generate Virtual compounds Background knowledge Med Chem design rules Machine Learning Predict properties Phys-Chem Activities (primary and anti target) Novelty Define Objectives Results expand knowledge-base Patent WO A2 Multi- objective prioritization Final Population Synthesis optimal molecules Test in bio-assays Assess molecules Top cpds + Random set X run Analyse

Experimental Validation Does it actually work? Evolution of a drug (SOSA) – Look at possible side activity of drugs Donepezil: acetylcholinesterase inhibitor used for Alzheimer disease Potential activity for dopamine D4 receptor Confirmed experimentally at 600nM: design ligands with Donepezil as a hit to improve D4 activity Dopamine D2 receptor studied (lower prediction, not active) Wermuth, C. G. Selective optimization of side activities: the SOSA approach. Drug discovery today, 11(3-4), 2006

What are Dopamine D2 and D4 receptors? Belong to the GPCR family Mainly present in the CNS Involved in cognition, memory, learning… Targets for several neuropsychiatric disorders like Parkinson’s disease, Schizophrenia, Attention-deficit hyperactivity disorder, Bipolar disorder… Data (4,400 activities for D2 and 1,500 for D4) and screening facilities available

Two studies Two receptors as objectives – D2: will lead to work on selectivity toward multiple receptors – D4: will lead to work on selectivity and novelty

D2 as objective 1 st series of compounds with high D2 prediction

Results CNS penetration for compound 3: brain/blood ratio = 0.5

Next objectives: reduce anti- target activity Polypharmacology primary activity – Combination profile of multiple GPCRs: 5HT1a, D2, D3, D4 Selectivity over alpha 1 anti-targets – Alpha 1a, 1b and 1d – Inhibitors induce vasodilatation Novelty: remove known scaffolds Good phys-chem properties: need to cross blood-brain- barrier Multiple calculations and look at the results for synthetically attractive compounds

Optimisation results for 5-HT1A/D2/D3/D4/CNS/α1 selectivity/CNS objectives Highest ranked compound

Path

Results

Selectivity Need to include selectivity in the algorithm: – Alpha adrenoreceptor 1 inhibitors versus other targets Ratio Ki D2 receptor / Ki α Receptor

D4 objective Improve D4 activity Good ADME score Bayesian = 25 D4 Ki=614nM Bayesian = 105 D4 Ki=9nM

Screening data Ki Binding Assays (nM) Bayesian Model Predictions

Experimental Data Compound 13 is selective for D4 receptor with pKi = 8 It crosses the BBB (Ratio of 7.5) In vivo experiments on with comparison to D4-KO mouse showed effects that the compound acts on target

Morpholino series However Cpd 13 is commercial and thus not novel New objective: starting from 13, keep activity, filter non-novel chemotype, D4 selectivity over other targets, CNS penetrant Example of top ranked compound

Morpholino series 24 analogues were synthesised around 2 scaffolds

Matrix of results

Lead Series Criteria Met Ki<100nM Highly novel chemotype at level of carbon framework Chemotype is D4 selective CNS penetrable Patent filing (WO )

Further characterization Functional data – Compounds are antagonist or inverse agonist hERG (K ion channel): inhibition can cause sudden death – 27s: EC50 = 3μM Blood-Brain-Barrier – 27s: in vivo brain/blood ratio of 2.0 Stability – Compound itself: oxidation possible indoline > indole – Metabolic stability: high clearance > need improvement (Cl i, = 25 mL/min/g) Compound 27s can be classified as a lead for D4 selective inverse agonist. From the series, there is also a potential of dual 5HT1A/D4 ligands

How to improve the algorithm Better model: better prediction can help reducing false positives and detect potential other activities Different methods – Predictions: other machine learning, 2D/3D similarity (USR-USRCAT), docking – Idea generator: real synthetic reactions, group replacements (MMPs) More knowledge on the method itself – Where it works – When to stop Hussain. Computationally Efficient Algorithm to Identify Matched Molecular Pairs ( MMPs ) in Large Data Sets, J.Chem.Inf.Model., 4, 2010 Ballester. Ultrafast shape recognition to search compound databases for similar molecular shapes. Journal of computational chemistry, 28(10), 2007 Schreyer. USRCAT: real-time ultrafast shape recognition with pharmacophoric constraints. Journal of Cheminformatics, 4(27), 2012

Conclusion We have designed an algorithm to generate and predict compounds against polypharmacological profile The algorithm can adapt to the situation: improve activity, selectivity, novelty We have shown proof of concept that we can automatically invent patentable compounds Results were experimentally validated and it generated a lead compound – this study has been published (Besnard al,. Nature, 492(7428), 2012)

The technology has been licensed to Ex Scientia Ltd (spin off - ) Ex Scientia in its first year has had further successes applying the algorithm to the design of various other gene families including ion channels, GPCRs and enzymes (“stay tuned”)

Acknowledgments Pr. Andrew Hopkins Richard Bickerton ALH group Pr. Ian Gilbert Gian Filippo Ruda Karen Abecassis Kevin Read and DMPK group Barton group Brenk group Pr. Bryan Roth (UNC-CH - NIH) Vincent Setola Roth lab Pr. William Wetsel (Duke University Medical School) Wetsel group CLS IT support (Jon) Accelrys support