Dispensing Processes Impact Computational and Statistical Analyses Sean Ekins 1, Joe Olechno 2 Antony J. Williams 3 1 Collaborations in Chemistry, Fuquay.

Slides:



Advertisements
Similar presentations
Microarray statistical validation and functional annotation
Advertisements

Shape and Color Clustering with SAESAR Norah E. MacCuish, John D. MacCuish, and Mitch Chapman Mesa Analytics & Computing, Inc.
Discovery Studio AtlasStore: Protein/Ligand Database Steve Potts, Ph.D., MBA Product Manager Biological Informatics
Analysis of High-Throughput Screening Data C371 Fall 2004.
Christopher Reynolds Supervisor: Prof. Michael Sternberg Bioinformatics Department Division of Molecular Biosciences Imperial College London.
Data Collection Techniques. Biblical Reference But where can wisdom be found? Where does understanding dwell? Job 28:12.
1 Sequential Screening S. Stanley Young NISS HTS Workshop October 25, 2002.
PhysChem Forum, 29 Nov 2006, Newhouse 1 Memories and the future: From experimental to in silico physical chemistry Han van de Waterbeemd AstraZeneca, DMPK.
Proprietary and Confidential © AstraZeneca 2008 FOR INTERNAL USE ONLY Opportunities to bridge the technology gap between academia and industry. The view.
Progress In Computational Toxicology Sean Ekins 1,2 1 Collaborations in Chemistry, 5616 Hilltop Needmore Road, Fuquay Varina, NC27526, USA, NC. 2 Collaborative.
3D Molecular Structures C371 Fall Morgan Algorithm (Leach & Gillet, p. 8)
In silico small molecule discovery Sales Target gene Discover hit Hit to lead Optimise lead Clinical Target gene identified with a viable assay High throughput.
Why Are We Still Doing Industrial Age Drug Discovery For Neglected Diseases in The Information Age? Sean Ekins Collaborations In Chemistry, Fuquay Varina,
Jürgen Sühnel Institute of Molecular Biotechnology, Jena Centre for Bioinformatics Jena / Germany Supplementary Material:
Lipinski’s rule of five
Royal Society of Chemistry developments to support open drug discovery Antony Williams, Ken Karapetyan, Valery Tkachenko, Colin Batchelor Alexey Pshenichnov.
Why you need this App Sean Ekins 1, Alex M. Clark 2 1 Collaborations in Chemistry, 5616 Hilltop Needmore Road, Fuquay Varina, NC 27526, U.S.A. 2 Molecular.
1 PharmID: A New Algorithm for Pharmacophore Identification Stan Young Jun Feng and Ashish Sanil NISSMPDM 3 June 2005.
Establishing a Successful Virtual Screening Process Stephen Pickett Roche Discovery Welwyn.
Cheminformatics II Apr 2010 Postgrad course on Comp Chem Noel M. O’Boyle.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
Analysis of Drug-Gene Interaction Data Florian Ganglberger Sebastian Nijman Lab.
Chemoinformatics in Drug Design
Introduction The goal of translational bioinformatics is to enable the transformation of increasingly voluminous genomic and biological data into diagnostics.
A novel interactive tool for multidimensional biological data analysis Zhaowen Luo, Xuliang Jiang Serono Research Institute, Inc.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
1 The Discovery Informatics Framework Pat Rougeau President and CEO MDL Information Systems, Inc. Delivering the Integration Promise American Chemical.
Topological Summaries: Using Graphs for Chemical Searching and Mining Graphs are a flexible & unifying model Scalable similarity searches through novel.
Abstract Recent repurposing project tendering calls by the National Center for Advancing Translational Sciences (US) and the Medical Research Council (UK)
Introduction In order for us to learn from the extensive prior literature we have collated information on molecules screened versus Mycobacterium tuberculosis.
Introduction to Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
Designing a Tri-Peptide based HIV-1 protease inhibitor Presented by, Sushil Kumar Singh IBAB,Bangalore Submitted to Dr. Indira Ghosh AstraZeneca India.
Dispensing Processes Profoundly Impact Biological, Computational and Statistical Analyses Sean Ekins 1, Joe Olechno 2 Antony J. Williams 3 1 Collaborations.
Function first: a powerful approach to post-genomic drug discovery Stephen F. Betz, Susan M. Baxter and Jacquelyn S. Fetrow GeneFormatics Presented by.
Leading scientists towards openness Antony J Williams 1 and Sean Ekins 2 1 Royal Society of Chemistry, Wake Forest, NC 2 Collaborations In Chemistry, Fuquay.
In silico discovery of inhibitors using structure-based approaches Jasmita Gill Structural and Computational Biology Group, ICGEB, New Delhi Nov 2005.
Construction of cancer pathways for personalized medicine | Presented By Date Construction of cancer pathways for personalized medicine Predictive, Preventive.
Ligand-based drug discovery No a priori knowledge of the receptor What information can we get from a few active compounds.
The Benefits of Participation in the Social Web of Science Antony Williams Research Square October 30 th 2014.
QSAR Study of HIV Protease Inhibitors Using Neural Network and Genetic Algorithm Akmal Aulia, 1 Sunil Kumar, 2 Rajni Garg, * 3 A. Srinivas Reddy, 4 1 Computational.
Virtual Screening C371 Fall INTRODUCTION Virtual screening – Computational or in silico analog of biological screening –Score, rank, and/or filter.
Protein Sequence Analysis - Overview - NIH Proteomics Workshop 2007 Raja Mazumder Scientific Coordinator, PIR Research Assistant Professor, Department.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
DuPont Pharmaceuticals Company Utilization of Spotfire™ in Quality Assessment & Analysis of Screening Data Thomas D.Y. Chung, Ph.D. Sr. Director, Leads.
Computer-aided drug discovery (CADD)/design methods have played a major role in the development of therapeutically important small molecules for several.
PharmaMiner: Geometric Mining of Pharmacophores 1.
Data Mining Consultant GlaxoSmithKline: US Pharma IT
Catalyst TM What is Catalyst TM ? Structural databases Designing structural databases Generating conformational models Building multi-conformer databases.
Introduction to Chemoinformatics and Drug Discovery Irene Kouskoumvekaki Associate Professor February 15 th, 2013.
Challenge Problem: Link Mining Lise Getoor University of Maryland, College Park.
Design of a Compound Screening Collection Gavin Harper Cheminformatics, Stevenage.
Novel Chalcone Derivatives: Synthesis, Evaluation of Anti-prostate Cancer Activities Through Cathepsin B Inhibition And In-Silico Studies. Dalia Hussein.
Ensemble Approaches Yield New Scaffolds and New Binding Sites Heather A. Carlson Department of Medicinal Chemistry College of Pharmacy University of Michigan.
The Promise and the Reality of Turning Chemical Literature into Information Tom Doman Chicago ACS – Wiggins Symposium 3/25/2007.
MRC Technology an independent life science medical research charity Biotech & Money 2016 Mr Andrew Mercieca, Director of Finance
Discovery of Therapeutics to Improve Quality of Life Ram Samudrala University of Washington.
G Global Health Design Webinar #5 Open Source Malaria Next Steps Tue 24 th May 8am East Coast US, Brazil, 1pm UK, 2pm CET, SAST, 10pm Sydney This webinar.
Molecular Modeling in Drug Discovery: an Overview
HP-SEE In the search of the HDAC-1 inhibitors. The preliminary results of ligand based virtual screening Ilija N. Cvijetić, Ivan O. Juranić,
Introduction to PubChem BioAssay
TDM in the Life Sciences Application to Drug Repositioning *
Lipinski’s rule of five
Next Generation Preprint Service
Virtual Screening.
Probing the Probes: Fitness Factors For Small Molecule Tools
Structure-based drug design: progress, results and challenges
Rational for the 5R Philosophy
Probing the Probes: Fitness Factors For Small Molecule Tools
Volume 17, Issue 2, Pages (February 2010)
Patrick: An Introduction to Medicinal Chemistry 6e
Presentation transcript:

Dispensing Processes Impact Computational and Statistical Analyses Sean Ekins 1, Joe Olechno 2 Antony J. Williams 3 1 Collaborations in Chemistry, Fuquay Varina, NC. 2 Labcyte Inc, Sunnyvale, CA. 3 Royal Society of Chemistry, Wake Forest, NC. Disclaimer: SE and AJW have no affiliation with Labcyte and have not been engaged as consultants

“If I have seen further than others, it is by standing upon the shoulders of giants.” Isaac Newton Where do scientists get chemistry/ biology data?  Databases  Patents  Papers  Your own lab  Collaborators  Some or all of the above?  What is common to all? – quality issues

From data hoarding to open data (IMDB) Me Linked Open data cloud 2011 (Wikipedia) Pharma company data hoarding - to open data

Simple Rules for licensing “open” data Williams, Wilbanks and Ekins. PLoS Comput Biol 8(9): e , : NIH and other international scientific funding bodies should mandate …open accessibility for all data generated by publicly funded research immediately Could data ‘open accessibility’ equal ‘Disruption’ Ekins, Waller, Bradley, Clark and Williams. DDT In press, 2013 As we see a future of increased database integration the licensing of the data may be a hurdle that hampers progress and usability.

Data can be found – but …..drug structure quality is important  More groups doing in silico repositioning  Target-based or ligand-based  Network and systems biology  integrating or using sets of FDA drugs..if the structures are incorrect predictions will be too..  Need a definitive set of FDA approved drugs with correct structures  Also linkage between in vitro data & clinical data

Southan et al., DDT, 18: (2013) Finding structures of Pharma molecules is hard NCATS and MRC made molecule identifiers from pharmas available with no structures

Structure Quality Issues NPC Browser Database released and within days 100’s of errors found in structures Williams and Ekins, DDT, 16: (2011) Science Translational Medicine 2011

Substructure # of Hits # of Correct Hits No stereochemistry Incomplete Stereochemistry Complete but incorrect stereochemistry Gonane Gon-4-ene Gon-1,4-diene Data Errors in the NPC Browser: Analysis of Steroids Williams, Ekins and Tkachenko, DDT 17: (2012)

DDT editorial Dec This editorial led to the current work Its not just structure quality we need to worry about

How do you move a liquid? Images courtesy of Bing, Tecan McDonald et al., Science 2008, 322, 917. Belaiche et al., Clin Chem 2009, 55, Plastic leaching

Moving liquids with sound Images courtesy of Labcyte Inc. Tipless transfer No cross-contamination, carryover or leachates Accurate, precise - assay miniaturization

Spicer et al., In Drug Discovery Technology: Boston, Data appears randomly scattered. 24% had IC 50 values > 3 fold weaker using tip- based dispensing. 8% produced no value using tip-based dispensing. No analysis of molecule properties. Tip-based Serial Aqueous dilution vs. Acoustic dispensing comparison

Wingfield et al., American Drug Discovery 2008, 3, ~ point IC 50 values. Compounds more active when using acoustic dispensing. Correlation in data is poor with many compounds showing >10 fold shift in potency depending on dispensing method. No analysis of molecule properties. Tip-based Serial Aqueous dilution vs. Acoustic dispensing comparison

Wingfield, J. Drug Discovery 2012: Manchester, UK, 2012 Inhibition of tyrosine kinases at 10 µM for ~10,000 compounds. False +ve from acoustic transfer (as measured by subsequent IC 50 analyses) = 19% of hits. False +ve from tip-based transfers = 55% of all hits. 60 more compounds were identified as active with acoustic transfer. No analysis of molecule properties. Tip-based Serial Aqueous dilution vs. Acoustic dispensing comparison

Using literature data from different dispensing methods to generate computational models Few molecule structures and corresponding datasets are public Using data from 2 AstraZeneca patents – Tyrosine kinase EphB4 pharmacophores (Accelrys Discovery Studio) were developed using data for 14 compounds IC 50 determined using different dispensing methods Analyzed correlation with simple descriptors (SAS JMP) Calculated LogP correlation with log IC 50 data for acoustic dispensing (r 2 = 0.34, p < 0.05, N = 14) Barlaam, B. C.; Ducray, R., WO 2009/ A1, 2009 Barlaam, B. C.; Ducray, R.; Kettle, J. G., US 7,718,653 B2, 2010

Barlaam, B. C.; Ducray, R., WO 2009/ A1, 2009 Barlaam, B. C.; Ducray, R.; Kettle, J. G., US 7,718,653 B2, 2010 Examples of IC 50 values produced via acoustic transfer with direct dilution vs those generated with Tip-based dispensing Acoustically-derived IC 50 values were 1.5 to fold lower than for Tip-based dispensing

A graph of the log IC 50 values for tip-based serial dilution and dispensing versus acoustic dispensing with direct dilution shows a poor correlation between techniques (R 2 = 0.246). acoustic technique always gave a more potent IC 50 value

Hydrophobic features (HPF) Hydrogen bond acceptor (HBA) Hydrogen bond donor (HBD) Observed vs. predicted IC 50 r Acoustic mediated process Tip-based process Ekins, Olechno and Williams, Submitted 2012 AcousticTip based Tyrosine kinase EphB4 Pharmacophores Cyan = hydrophobic Green = hydrogen bond acceptor Purple = hydrogen bond donor Each model shows most potent molecule mapping

An additional 12 compounds from AstraZeneca Barlaam, B. C.; Ducray, R., WO 2008/ A1, of these compounds had data for tip based dispensing and 2 for acoustic dispensing Calculated LogP and logD showed low but statistically significant correlations with tip based dispensing (r 2 = 0.39 p < 0.05 and 0.24 p < 0.05, N = 36) Used as a test set for pharmacophores The two compounds analyzed with acoustic liquid handling were predicted in the top 3 using the ‘acoustic’ pharmacophore The ‘Tip-based’ pharmacophore failed to rank the retrieved compounds correctly Test set evaluation of pharmacophores

Automated receptor-ligand pharmacophore generation method Pharmacophores for the tyrosine kinase EphB4 generated from crystal structures in the protein data bank PDB using Discovery Studio version Cyan = hydrophobic Green = hydrogen bond acceptor Purple = hydrogen bond donor Grey = excluded volumes Each model shows most potent molecule mapping Bioorg Med Chem Lett 2010, 20, Bioorg Med Chem Lett 2008, 18, Bioorg Med Chem Lett 2008, 18, Bioorg Med Chem Lett 2011, 21,

In the absence of structural data, pharmacophores and other computational and statistical models are used to guide medicinal chemistry in early drug discovery. Our findings suggest acoustic dispensing methods could improve HTS results and avoid the development of misleading computational models and statistical relationships. Automated pharmacophores are closer to pharmacophore generated with acoustic data – all have hydrophobic features – missing from Tip- based pharmacophore model Importance of hydrophobicity seen with logP correlation and crystal structure interactions Public databases should annotate this meta-data alongside biological data points, to create larger datasets for comparing different computational methods. Summary

Strengths and Weaknesses Small dataset size – focused on one compound series No previous publication describing how data quality can be impacted by dispensing and how this in turn affects computational models and downstream decision making. No comparison of pharmacophores generated from acoustic dispensing and tip-based dispensing. No previous comparison of pharmacophores generated from in vitro data with pharmacophores automatically generated from X-ray crystal conformations of inhibitors. Severely limited by number of structures in public domain with data in both systems Reluctance of many to accept that this could be an issue

The stuff of nightmares?  How much of the data in databases is generated by tip based serial dilution methods  How much is erroneous  Do we have to start again?  How does it affect all subsequent science – data mining etc  Does it impact Pharmas productivity?