1 Schedule 8:30-9:30 Introduction 9:40- 10:45 Analysis Methods 10:55-12:00 Design and Analysis 12:00 Lunch 1:00-2:05 Design and Analysis I (Will and Stan)

Slides:



Advertisements
Similar presentations
CONFIDENTIAL INFORMATION KnowTox, a Franco- Hungarian collaborative project relative to toxicity.
Advertisements

1 Real World Chemistry Virtual discovery for the real world Joe Mernagh 19 May 2005.
Analysis of High-Throughput Screening Data C371 Fall 2004.
1 Sequential Screening S. Stanley Young NISS HTS Workshop October 25, 2002.
1 PharmID: A New Algorithm for Pharmacophore Identification Stan Young Jun Feng and Ashish Sanil NISSMPDM 3 June 2005.
Faculty of Computer Science © 2006 CMPUT 605February 04, 2008 Novel Approaches for Small Bio-molecule Classification and Structural Similarity Search Karakoc.
Cheminformatics II Apr 2010 Postgrad course on Comp Chem Noel M. O’Boyle.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
A Study on Feature Selection for Toxicity Prediction*
ABSTRACT Modern drug discovery process involves mapping pharmaceutical knowledge about target proteins and ligands as well as sophisticated computer science.
Active Learning Strategies for Drug Screening 1. Introduction At the intersection of drug discovery and experimental design, active learning algorithms.
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
Application and Efficacy of Random Forest Method for QSAR Analysis
Large-scale computational design and selection of polymers for solar cells Dr Noel O’Boyle & Dr Geoffrey Hutchison ABCRF University College Cork Department.
Predicting Highly Connected Proteins in PIN using QSAR Art Cherkasov Apr 14, 2011 UBC / VGH THE UNIVERSITY OF BRITISH COLUMBIA.
1 Data mining of toxic chemicals & database-based toxicity prediction Jiansuo Wang & Luhua Lai Institute of Physical Chemistry, Peking University P. R.
GGAGATTCTGGGCCACTTTGGTTCCCCATGAGCCAAGACGGCACTTCTAATTTGCATTCCCTACCGGAGTCCCTGTCTGTAGCCAGCCTGGCTTTCAGCTGGTGCCCAAAGTGACAAATGTATCTGCAATGACAAAGGTAC CCTGGAAGGGCTCGCCCTCTGCGGAATTTCAGTTCATGCAGGCCTTGGTGCTTCCACATCTGTCCAAGGGCCTTTCAAATGTGACTTTTAACTCTGTGGATTGATTTGCCCGG
Computational Techniques in Support of Drug Discovery October 2, 2002 Jeffrey Wolbach, Ph. D.
Problems and Opportunities for Machine Learning in Drug Discovery (Can you find lessons for Systems Biology?) George S. Cowan, Ph.D. Computer Aided Drug.
Topological Summaries: Using Graphs for Chemical Searching and Mining Graphs are a flexible & unifying model Scalable similarity searches through novel.
A genetic algorithm for structure based de-novo design Scott C.-H. Pegg, Jose J. Haresco & Irwin D. Kuntz February 21, 2006.
Introduction to Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
Validating New Techniques for HTS data analysis Alain Calvet, Kjell Johnson and George S. Cowan Pfizer Global Research and Development Ann Arbor Laboratories.
Qualitative and Quantitative Research Quantitative Deductive: transforms general theory into hypothesis suitable for testing Deductive: transforms general.
1 Innovative Science To Improve Public Health EDKB: Endocrine Disruptors Knowledge Base at the FDA Huixiao Hong, Ph.D. Center for Bioinformatics Division.
High Throughput Experimentation: Computational Requirements John M. Newsam Molecular Simulations Inc. (A Pharmacopeia subsidiary) “Workshop on Combinatorial.
Computational Toxicology and Virtual Development in Drug Design
Faculté de Chimie, ULP, Strasbourg, FRANCE
ChemModLab: A Web-based Cheminformatics Modeling Laboratory S. Stanley Young + ECCR and ChemSpider Teams.
1 PowerMV Chemical Data Mining Environment S. Stanley Young Jun Feng and Jack Liu NISS MPDM, McMaster University 4 June 2005.
Use of Machine Learning in Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
In silico discovery of inhibitors using structure-based approaches Jasmita Gill Structural and Computational Biology Group, ICGEB, New Delhi Nov 2005.
Ligand-based drug discovery No a priori knowledge of the receptor What information can we get from a few active compounds.
Empirical Validation of the Effectiveness of Chemical Descriptors in Data Mining Kirk Simmons DuPont Crop Protection Stine-Haskell Research Center 1090.
Martin Waldseemüller's World Map of 1507 Zanjan. Roberto Todeschini Viviana Consonni Davide Ballabio Andrea Mauri Alberto Manganaro chemometrics molecular.
Combinatorial Chemistry. Synthesis of many structures (diversity) combinatorial technology, combinatorial library molecular diversity What is Combinatorial.
QSAR Study of HIV Protease Inhibitors Using Neural Network and Genetic Algorithm Akmal Aulia, 1 Sunil Kumar, 2 Rajni Garg, * 3 A. Srinivas Reddy, 4 1 Computational.
© 2001, Boehringer, Inc. - All Rights Reserved. SCA: New Cluster Algorithm for Structural Diversity Analysis and Applications Presented at Spotfire Users.
Statistical Analysis of Inlining Heuristics in Jikes RVM Jing Yang Department of Computer Science, University of Virginia.
Virtual Screening C371 Fall INTRODUCTION Virtual screening – Computational or in silico analog of biological screening –Score, rank, and/or filter.
Alessandro Pedretti MetaPies, an annotated database for metabolism analysis and prediction: results and future perspectives L’Aquila November 21, 2011.
COMPUTERS IN BIOLOGY Elizabeth Muros INTRO TO PERSONAL COMPUTING.
Selecting Diverse Sets of Compounds C371 Fall 2004.
Brad Evans, Ph.D. Pfizer Global Research and Development
Seascape/Indira Ghosh, JNU Introduction. Seascape Intro Indo-US company with a mission to foster international collaborative research, education and software.
Data Mining Consultant GlaxoSmithKline: US Pharma IT
1 High Throughput Target Identification Stan Young, NISS Doug Hawkins, U Minnesota Christophe Lambert, Golden Helix Machine Learning, Statistics, and Discovery.
Introduction to Chemoinformatics and Drug Discovery Irene Kouskoumvekaki Associate Professor February 15 th, 2013.
Design of a Compound Screening Collection Gavin Harper Cheminformatics, Stevenage.
CZ3253: Computer Aided Drug design Lecture 6: QSAR part II Prof. Chen Yu Zong Tel: Room 07-24,
Number Sets. Symbols for Number Set Counting numbers ( maybe 0, 1, 2, 3, 4, and so on) Natural Numbers: Positive and negative counting numbers (-2, -1,
Identification of structurally diverse Growth Hormone Secretagogue (GHS) agonists by virtual screening and structure-activity relationship analysis of.
Introduction Support Vector Regression QSAR Problems and Data SVMs for QSAR Linear Program Feature Selection Model Selection and Bagging Computational.
Computational Approach for Combinatorial Library Design Journal club-1 Sushil Kumar Singh IBAB, Bangalore.
A Discriminatively Trained, Multiscale, Deformable Part Model Yeong-Jun Cho Computer Vision and Pattern Recognition,2008.
Shows tendency for mergers. These big companies may be shrinking – much research is now outsourced to low cost countries like Latvia, India, China and.
Improving compound–protein interaction prediction by building up highly credible negative samples Toward more realistic drug-target interaction predictions.
신기술 접목에 의한 신약개발의 발전전망과 전략 LGCI 생명과학 기술원. Confidential LGCI Life Science R&D 새 시대 – Post Genomic Era Genome count ‘The genomes of various species including.
Page 1 Computer-aided Drug Design —Profacgen. Page 2 The most fundamental goal in the drug design process is to determine whether a given compound will.
Parallel Density-based Hybrid Clustering
ATOM Accelerating Therapeutics for Opportunities in Medicine
CS548 Fall 2017 Decision Trees / Random Forest Showcase by Yimin Lin, Youqiao Ma, Ran Lin, Shaoju Wu, Bhon Bunnag Showcasing work by Cano,
Virtual Screening.
A Unifying View on Instance Selection
Topological Ordering Algorithm: Example
Statistics 2 for Chemical Engineering lecture 5
Topological Ordering Algorithm: Example
Topological Ordering Algorithm: Example
M.Pavan, P.Gramatica, F.Consolaro, V.Consonni, R.Todeschini
Topological Ordering Algorithm: Example
Presentation transcript:

1 Schedule 8:30-9:30 Introduction 9:40- 10:45 Analysis Methods 10:55-12:00 Design and Analysis 12:00 Lunch 1:00-2:05 Design and Analysis I (Will and Stan) 2:15-3:15 Pooling 3:25-4:00 Pooling Discussion 4:00-4:30 General Discussion

2 HTS Directions NISS Affiliates Workshop 25 October 2002

3 Statistical Analysis Corporate Collection Commercial Compounds Virtual Compounds Statistical Model Selected Compounds Initial Compounds Active Compounds Filters Screen Compounds Stat Methods Chem Descriptors Model Data Sets Sequential Screening Paradigm

4 Data / Descriptors / Stat Method Data : Protein binding Cell-based Whole Animal, e.g. Toxicology Descriptors : BCUT, Atom Type Counts, Topological, 3D, etc. Analysis : LR, PLS, RP, NN, Latent Class, SVM, etc. Combined (e.g. RP and LR)

5 Data Sets – Need for benchmarking data sets Binding : NCI Cell-based: none Tox : Mutagenicity, others Pooled : none

6 Software Needs Compound viewer/editor: MolViewer : JMP/smiles Descriptor Calculations: Dragon? Environment to code new algorithms: R, MatLab, MOE Standard stat methods: SAS, S, R, JMP Special Data Mining Code : ChemTree, CompChem vendors LeadScope, BioReason, etc.

7 Statistical Analysis Corporate Collection Commercial Compounds Virtual Compounds Statistical Model Selected Compounds Initial Compounds Active Compounds Filters Screen Compounds Stat Methods Chem Descriptors Model Data Sets Sequential Screening Paradigm

8 Initial Sample 1. Focused, knowledge-based 2. No knowledge => Diverse 3. Redundant to support analysis 4. Large enough to start process 5. Random is generally OK!

9 Rational Screening Goals 1. Optimize bioactivity, (increase selectivity) 2. Reduce the cost of experimental data 3. Increase the number of “active” classes 4. Increase speed of knowledge acquisition

10 Irrational Screening Goals 1. Find all active compounds. 2. Cost is no object. 3. Time is no object. 4. Find every active class.

11 Find Multiple Chemical Classes

12 Literature DM Hawkins, SS Young and A Rusinko. “Analysis of a Large Structure-Activity Data Set Using Recursive Partitioning” QSAR 16: (1997). MJ Valler and D Green. “Diversity screening versus focussed screening in drug discovery” Drug Discovery Today 5: (2001). MFM Engels and P Venkatarangan. “Smart screening: Approaches to efficient HTS” Current Opinions in Drug Discovery&Development 4: (2001)