Private to Public Domain Transfer

Slides:



Advertisements
Similar presentations
CONFIDENTIAL INFORMATION KnowTox, a Franco- Hungarian collaborative project relative to toxicity.
Advertisements

The Drug Discovery Process
Vendor Update SLA P&HT Division Meeting April 8, 2008.
T H O M S O N S C I E N T I F I C Ann Wescott April 8, 2008 Prous Science Update SLA DPHT Spring Meeting 2008.
Challenges in new drug discovery in South Asia
Services | Research | Training | Industry Small Molecules Resources at the EBI Dr. Louisa Bellis Chemical Content Curator, ChEMBL Group EMBL-EBI, UK Bioinformatics.
The Statisticians Role in Pharmaceutical Development
Integrative Organs Systems Scientists and Drug Discovery: The Link Between Big Pharma and Academia Glenn A. Reinhart, Ph.D. Senior Group Leader Integrative.
SGC OxfordSGC TorontoSGC Stockholm How Canada might develop its drug discovery sector.
Influencing Change in Research, Treatment Protocols, and New Drug Development.
Lipinski’s rule of five
The Role of Human Genetics in Drug Discovery
S TRUCTURAL B IOINFORMATICS. A subset of Bioinformatics concerned with the of biological structures - proteins, DNA, RNA, ligands etc. It is the first.
U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES NATIONAL INSTITUTES OF HEALTH Working with FDA: Biological Products and Clinical Development Critical Path.
Cancer Nanotechnology: New Opportunities for Targeted Therapies FDA Public Meeting October 10, 2006 Piotr Grodzinski, Ph.D. Director, Nanotechnology for.
Doug Brutlag 2011 Genomics, Bioinformatics & Medicine Drug Development
Career Opportunities for PharmDs in the Pharmaceutical Industry: Research & Development.
Discovery of new medicines through new models of collaboration Simon Ward Professor of Medicinal Chemistry & Director of Translational Drug Discovery Group.
Bioinformatics Ayesha M. Khan Spring Phylogenetic software PHYLIP l 2.
Howard Fillit, MD Executive Director Improving Animal Trials for Alzheimer’s Disease: Recommendations for Best Practices.
Stages of drug development
Drug discovery and development
Application of Toxicology Databases in Drug Development (Estimating potential toxicity) Joseph F. Contrera, Ph.D. Director, Regulatory Research and Analysis.
Improving the competitiveness of the pharmaceutical industry Dominique Limet Senior Vice-President and Area Director Southern and Eastern Europe.
Obtaining and Negotiatiing a Position in Industry/Biotech Roy G. Smith Ph.D. Director, Huffington Center on Aging Professor, Department of Molecular and.
U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Institute of Neurological Disorders and Stroke U.S. DEPARTMENT OF HEALTH.
Figure 4.1 NEW PRODUCT DEVELOPMENT PROCESS Finance Corporate strategy and portfolio decisions Regulatory affairs Marketing and sales + market research.
Knowledgebase Creation & Systems Biology: A new prospect in discovery informatics S.Shriram, Siri Technologies (Cytogenomics), Bangalore S.Shriram, Siri.
Asia’s Largest Global Software & Services Company Genomes to Drugs: A Bioinformatics Perspective Sharmila Mande Bioinformatics Division Advanced Technology.
1 Value Assessment of Development-Stage Assets Pharmaceutical Products, Medical Devices, and Related Intellectual Property Frank S. Castellana, M.D., Eng.Sc.D.
Stefan Franzén Introduction to clinical trials.
Chapter 13. The Impact of Genomics on Antimicrobial Drug Discovery and Toxicology CBBL - Young-sik Sohn-
1 Discovering new drugs in Africa Defeating Malaria Together Kelly Chibale PhD FRSSAf University of Cape Town.
Marine Drug Development and Delivery Prof. Dr. Basavaraj K. Nanjwade M. Pharm., Ph. D Department of Pharmaceutics KLE University College of Pharmacy BELGAUM ,
CS 790 – Bioinformatics Introduction and overview.
TOPICS IN (NANO) BIOTECHNOLOGY
Exploratory IND Studies
Computational Toxicology and Virtual Development in Drug Design
NICE Evidence Search workshop Revised reference slide set August
Drug Discovery Process Massimiliano Beltramo, PhD.
ChEMBL– Open Access Database For Drug Discovery By – Udghosh Singh M.S.(Pharm), 3 rd Sem Pharmacoinformatics.
Metals in Medicine Consortium Sydney Cancer Centre.
Anne Hersey ChEMBL Group, EMBL-EBI ChEMBL – A Database of Bioactive Drug-like Small Molecules.
The New Drug Development Process (www. fda. gov/cder/handbook/develop
The FDA: Basic Facts It takes 12 to 15 years to develop a single drug Only 1 in 10,000 potential medications makes it completely through the process Only.
Intellectual Property Rights and Pharmaceutical Industry
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Privacy Symposium / HIPAA Summit
Proteomics Session 1 Introduction. Some basic concepts in biology and biochemistry.
The pharmaceutical R&D process
PubChem: An Open Repository for Chemical Structure and Biological Activity Information Steve Bryant The NIH Biowulf Cluster: 10 Years of Scientific Supercomputing.
European Patients’ Academy on Therapeutic Innovation The key principles of pharmacology.
A model collaboration using the Pool Susanne Hollinger, Ph.D., J.D. Chief Intellectual Property Officer Emory University.
26th April 2016, BioTrinity - London
Shows tendency for mergers. These big companies may be shrinking – much research is now outsourced to low cost countries like Latvia, India, China and.
신기술 접목에 의한 신약개발의 발전전망과 전략 LGCI 생명과학 기술원. Confidential LGCI Life Science R&D 새 시대 – Post Genomic Era Genome count ‘The genomes of various species including.
Pharmaceutical Approaches to Antiviral Drug Discovery
Molecular Modeling in Drug Discovery: an Overview
Synthetic Chemistry for the Life Sciences Dial – a – Molecule Workshop, University of Liverpool Jan-12 Where / how can chemistry make greater contribution.
Cheminformatics and Metabolism Team The EBI Enzyme Portal.
Introduction to PubChem BioAssay
Intersecting different databases to define the inner and outer limits of the data-supported druggable proteome
Drug Discovery – Setting the Scene
APPLICATIONS OF BIOINFORMATICS IN DRUG DISCOVERY
An Introduction to Medicinal Chemistry 3/e
New Targets, New Modalities, New Challenges – The Inconvenient Path of Human Genetics in Drug Discovery Enabling Precision Medicine: The Role of Genetics.
Benjamin Wooden, Nicolas Goossens, Yujin Hoshida, Scott L. Friedman 
Drug Design and Drug Discovery
Rational for the 5R Philosophy
Considerations for Successful Biopharmaceutical Product Development: Discovery to Proof of Concept -A Panel Discussion Stanley C. McDermott, PharmD, MS,
Presentation transcript:

ChEMBL – Large-Scale Open Access Data for Drug Discovery John Overington EMBL-EBI jpo@ebi.ac.uk

Private to Public Domain Transfer Five year strategic award from Wellcome Trust Large-scale Drug Discovery Structure Activity Relationship (SAR) data Linking small molecule structures to ‘targets’ and pharmacological activities – Chemogenomics/Chemical Biology ‘Open Access’, ‘User Friendly’, ‘Translational’, ‘Free’ Multiple access mechanisms Full database download, web front-ends, web services Actively support ad hoc sabbaticals (academic and commercial) at EMBL-EBI

ChEMBL Research Strategy Comprehensively catalogue historical drug discovery Include successes and failures Drugs can be small molecules, recombinant proteins, siRNA, etc. Derive rules for drug discovery ‘success’ from these data Target selection and prioritisation Lead discovery, optimisation, candidate selection

Drug Discovery Process (simplified) Clinical Trials Target Discovery Lead Discovery Lead Optimisation Preclinical Development Phase 1 Phase 2 Phase 3 Launch Target identification Microarray profiling Target validation Assay development Biochemistry Clinical/Animal disease models Medicinal Chemistry Structure-based drug design Selectivity screens ADMET screens Cellular/Animal disease models Pharmacokinetics High-throughput Screening (HTS) Fragment-based screening Focused libraries Screening collection Toxicology In vivo safety pharmacology Formulation Dose prediction Safety & Efficacy PK tolerability Indication Discovery & expansion Efficacy Discovery Development Use Med. Chem. SAR Clinical Candidates Drugs >450,000 distinct compounds ~25,000 distinct lead series ~1,300 drugs ~12,000 candidates

ChEMBL: Launched Drugs Database of all approved drugs Chemistry and sequence ‘aware’ Contents Small molecules and biological therapeutics USANs, INNs, research codes, other synonyms Pharmaceutical properties, prodrugs, dosage, form, etc PK data and metabolites, black box warnings, etc. 1,378 chemically distinct ‘drugs’, 324 distinct molecular targets Controlled vocabulary indications dictionary and hierarchy

Synthetic small molecule New Drugs 2006-2009 Enzyme mAb Peptide Other Protein Synthetic small molecule Natural Product

ChEMBL: Launched Drugs Nat. Rev. Drug Disc., 5, pp. 993-996 (2006)

ChEMBL: Drug Dosage nmol mmol mmol ~150-200mmol Binned log10 mole dose 80 nmol mmol mmol 70 60 50 40 30 Metformin, Hydroxyurea 20 Steroids, thyroids 10 -8.4 -8.08 -7.76 -7.44 -7.12 -6.8 -6.48 -6.16 -5.84 -5.52 -5.2 -4.88 -4.56 -4.24 -3.92 -3.6 -3.28 -2.96 -2.64 -2.32 Binned log10 mole dose

Affinity Of Drugs For Their Targets Retrieved Ki, Kd, IC50, EC50, pA2, … endpoints for drugs against their ‘efficacy targets’ 400 350 300 250 Frequency 200 150 100 50 2 3 4 5 6 7 8 9 10 11 12 -log10 affinity 10mM 1mM 100mM 10mM 1mM 100nM 10nM 1nM 100pM 10pM 1pM

Function for Drug Efficacy/Affinity Empirical function that estimates the probability of in vivo activity for a compound with acceptable PK characteristics as a function of target affinity 1.0 0.8 mM mM nM pM 0.6 P(efficacy) 0.4 0.2 0.0 2 4 6 8 10 12 -log10 Affinity

ChEMBL: Clinical Candidates Database of clinical development candidates Contains ~10,000 2-D structures Estimated size ~35-45,000 compounds Work in progress Deeper coverage of key gene families e.g. Protein kinases, 184 distinct clinical candidates VEGFR 90 80 70 PDGFR 60 50 p38a 40 30 20 C-Kit 10 Aurora ErbB CDK Launched III II I Clinical candidates by target Kinase clinical candidates by highest phase

Industry Productivity File Registration number vs USAN date 800000 700000 600000 500000 400000 300000 200000 100000 1960 1965 1970 1975 1980 1985 1990 1995 2000 2005 2010

Industry Productivity 70 64 USANs/100,000 compounds 60 16 Drugs/100,000 compounds 50 40 30 1.9 USANs/100,000 compounds 20 0.4 Drugs/100,000 compounds 10 1- 100,000 100,001- 200,000 200,001- 300,000 300,001- 400,000 400,001- 500,000 500,001- 600,000 600,001- 700,000 700,001, 800,000 File registration number range USAN assignment typically at entry to phase 3

ChEMBL: SAR data StARLITe Bioactivity Bioactive compounds Link through to validated synthetic routes and assay protocols Bidirectionally linking compounds to/from targets Built from 12 primary journals J.Med.Chem. Biorg.Med.Chem., PNAS, JBC, Bioorg.Med.Chem.Letts., Eur.J.Med.Chem., DMD, Xenobioitica, Nature, Science, AACR, J.Nat.Prod. StARlite 1 – June 2001 StARlite 31 – August 2008 StARLITe Bioactivity Compound Target Ki=4.5 nM >Thrombin (Homo sapiens) MAHVRGLQLPGCLALAALCSLVHSQHVFLAPQQARSLLQRVRRANTFLEEVRKGNLERECVEETCSYEEAFEALESSTATDVFWAKYTACETARTPRDKLAACLEGNCAEGLGTNYRGHVNITRSGIECQLWRSRYPHKPEINSTTHPGADLQENFCRNPDSSTTGPWCYTTDPTVRRQECSIPVCGQDQVTVAMTPRSEGSSVNLSPPLEQCVPDRGQQYQGRLAVTTHGLPCLAWASAQAKALSKHQDFNSAVQLVENFCRNPDGDEEGVWCYVAGKPGDFGYCDLNYCEEAVEEETGDGLDEDSDRAIEGRTATSEYQTFFNPRTFGSGEADCGLRPLFEKKSLEDKTERELLESYIDGRIVEGSDAEIGMSPWQVMLFRKSPQELLCGASLISDRWVLTAAHCLLYPPWDKNFTENDLLVRIGKHSRTRYERNIEKISMLEKIYIHPRYNWRENLDRDIALMKLKKPVAFSDYIHPVCLPDRETAASLLQAGYKGRVTGWGNLKETWTANVGKGQPSVLQVVNLPIVERPVCKDSTRIRITDNMFCAGYKPDEGKRGDACEGDSGGPFVMKSPFNNRWYQMGIVSWGEGCDRDGKYGFYTHVFRLKKWIQKVIDQFGE

Drug Optimisation Imidazole triazole Prototype 1st generation 2nd generation 3rd generation 4th generation Metronidazole 1962 Tinidazole 1970 Terconazole 1980 Posaconazole 2005 Itraconazole 1984 Clotrimazole 1970 Ketoconazole 1978 Azomycin (1956) Streptomyces natural product trichomonacidal ‘toxic’ Sulconazole 1980 Miconazole 1970 Voriconazole 2002 Fluconazole 1988 Econazole 1972 Bifonazole 1981 Fosfluconazole 2004 After W. Sneader

Counts refer to StARlite release 31 ChEMBL SAR Contents Abstracted from 26,299 papers from 12 journals Monthly update cycle - optimised curation pipeline Autocuration tools – clean up and index other large SAR datasets Updates and ongoing curation process all data, not simply new article data 521,237 compound records 440,055 distinct compound structures 5,439 targets 3,512 protein molecular targets ~2,200 orthologous targets (1,644 human) 1,936,969 million experimental bioactivities Counts refer to StARlite release 31

Interface and Searching

Interface and Searching

Interface and Searching

Interface and Searching

Interface and Searching

Interface and Searching

Interface and Searching

Rule-based Optimisation – Bioisosteres Identify data-driven ‘rational’ lead-optimisation strategies Useful in automated design e.g. Replacement of carboxylic acid Reflect synthetic ease and expectation for functional effect Search StARLITe for functional group Search for all ‘contexts’ where acid has been replaced Retrieve assay value StARLITe DIC50 tetrazole sulphonamide ester sulphonic acid Effect on affinity (-log10 IC50) Frequency (%) 4 2 6 -2 -4 -6 10 40 20 60 50 30

Typical Compound Collection - Novartis benzene pyridine piperidine piperazine cyclohexane pyrimidine indole imidazole naphthalene morpholine thiophene pyrazole pyrrolidine thiazole furan quinoline cyclopropane benzimidazole imidazoline pyrrole cyclopentane pyran quinazoline benzthiazole benzodioxole isoxazole purine tetrahydrofuran triazole benzofuran tetrahydroisoquinoline adamantane tetrazole triazine isoquinoline Ertl, Koch and Roggo, Novartis

Screening File Comparison - Novartis Depleted fragments 35 tetrazole Enriched fragments 30 purine tetrahydrofuran 25 20 Novartis rank 15 pyrrolidine pyrazole 10 morpholine 5 pyrimidine piperidine benzene StARLITe rank pyridine 5 10 15 20 25 30 35

Genome-Scale Druggability Assessment Nat. Rev. Drug. Disc., 8, pp. 900-907 (2008) Nature 460, 352-358 (2009) Now possible to rapidly map chemical intervention points onto genomic data In ‘real time’ as gene model is developed Develop therapeutic hypotheses for expert review/analysis/validation Reuse existing drugs/clinical candidates in new contexts Anticipate required optimisation (comparative modelling, etc)

Marks et al., Lancet, 367, pp. 668-678 (2006) Indication Discovery Marks et al., Lancet, 367, pp. 668-678 (2006) Map chemical biology/pharmacology data onto microarray datasets Rapid path to clinic and patient benefit Develop therapeutic hypotheses for expert review/analysis/validation Reuse existing drugs/clinical candidates in new contexts Marks et al., Lancet, 367, pp. 668-678 (2006)

The ChEMBL-og - www.chemblog.org