Presentation is loading. Please wait.

Presentation is loading. Please wait.

A novel approach to analysis of primary HTS data Compound Set Enrichment Thibault VarinAnsgar Schuffenhauer Gubler, H., Parker, C., Zhang, JH., Raman,

Similar presentations


Presentation on theme: "A novel approach to analysis of primary HTS data Compound Set Enrichment Thibault VarinAnsgar Schuffenhauer Gubler, H., Parker, C., Zhang, JH., Raman,"— Presentation transcript:

1 A novel approach to analysis of primary HTS data Compound Set Enrichment Thibault VarinAnsgar Schuffenhauer Gubler, H., Parker, C., Zhang, JH., Raman, P., Ertl, P.

2 INTRODUCTION | Compound Set Enrichment | Thibault Varin | 10/07/142 Compound Set Enrichment

3 Introduction  Active series identification: Can relevant SAR be extracted from primary HTS data?  Are activity data binary or continuous? | Compound Set Enrichment | Thibault Varin | 10/07/143

4 Introduction Active series identification | Compound Set Enrichment | Thibault Varin | 10/07/144 Hypothesis 1: Within primary HTS screening data, structure activity relationships (SAR) are apparent and can be used to help selecting active compound classes.

5 Introduction Are the activity data binary or continuous? | Compound Set Enrichment | Thibault Varin | 10/07/145 Scaffold 1Scaffold 2 Activity Binary activity: -1 active / 5 inactives -Scaffold 1 = Scaffold 2 Continuous activity: Scaffold 1 > Scaffold 2 Active compound (binary) Inactive compound (binary)

6 Introduction Are the activity data binary or continuous? | Compound Set Enrichment | Thibault Varin | 10/07/146 Threshold 1 Activity Threshold 2 Activity Binary scaffold activity is different according to the threshold Active compound (binary) Inactive compound (binary) Hypothesis 2: Methods based on an activity cut-off distort the activity information leading to the incorrect assignment of active series of compounds.

7 METHODS | Compound Set Enrichment | Thibault Varin | 10/07/147 Compound Set Enrichment

8 The Scaffold Tree – Visualization of the Scaffold Universe by Hierarchical Scaffold Classification A. Schuffenhauer, P. Ertl et al. J. Chem. Inf. Model., 47, 47, 2007 Methods The Scaffold Tree classification | Compound Set Enrichment | Thibault Varin | 10/07/148

9 Methods Datasets | Compound Set Enrichment | Thibault Varin | 10/07/149 PubChem Annotation from CRC Simulation of the primary screening data -7 PubChem bioassays - Ranging from 9389 to 263679 compounds - Ranging from 0.03 to 26.29% of active compounds Hypothesis 1

10 Methods Single hypothesis test: summary procedure  1. State the null and the alternative hypotheses -H 0 : „the scaffold is inactive“ -H 1 : „the scaffold is active“  2. Specify a significance level: α=0.01  3. Compute the statistics and the p-value ) →p-value=probability that the scaffold is inactive (H 0 )  4. Decision step: -p-value> α: H 0 is accepted -p-value< α: H 0 is rejected and then H 1 is accepted „The scaffold is active“ | Compound Set Enrichment | Thibault Varin | 10/07/1410

11 Methods The KS and the Binomial hypothesis tests | Compound Set Enrichment | Thibault Varin | 10/07/1411 Continuous data KS test Binary data Binomial test Actives Inactives Bioassay Scaffold H 0 : there is no difference in the activity distribution defined by compounds having the scaffold S3-2 and the background distribution H 0 : there is no difference in the proportion of active compounds for compounds having the scaffold S3-2 and the proportion of active compounds for the full dataset.

12 Methods Multiple hypothesis tests: Bonferroni correction  Problem of false positives α =probability to identify as active an inactive scaffold (for each test done...) 100 inactive scaffolds: probability to identify an „active“ by chance is equal 63% (1-0.99 100 ))  Suggests to test each scaffold at a critical significance level equal to α = 0.01 / Nbr of scaffolds  Makes the assumption that the individual tests are independent  Each level in the Scaffold Tree have been done separately | Compound Set Enrichment | Thibault Varin | 10/07/1412

13 Methods Determining the activity of classes | Compound Set Enrichment | Thibault Varin | 10/07/1413 Hypo 1 Hypo 2 Scaffold activity evaluation Comparison of results Multiple hypothesis test correction (Bonferroni)

14 RESULTS | Compound Set Enrichment | Thibault Varin | 10/07/1414 Compound Set Enrichment

15 Results Comparison of KSP and BTP predictions | Compound Set Enrichment | Thibault Varin | 10/07/1415 Bioassay Total BPCA significantly actives BPCA non significantly actives KSPBTPΔBPCAKSPBTPΔKSPBTPΔ Hydroxysteroid dehydrogenase 330231+99199183168+1514763+84 Caspase-1331114+2175220329112+217 PK124+81233091+8 Luciferase6712+55151311+2541+53 Luciferase17848+130413235-314613+133 CYP450 2C95833+2534 31+3242+22 CYP450 3A412164+5760 53+76111+50 With: -KSP: KS Prediction -BTP: Binomial Threshold Prediction -Δ : KSP-BTP -BPCA: Binomial PubChem Annotation Both KSP and BTP retrieve BPCA significantly active classes Number of active classes: KSP > BTP Most of new KSP active classes are not BPCA significantly actives

16 Results KSP significantly active scaffolds that are in Pubchem inactives | Compound Set Enrichment | Thibault Varin | 10/07/1416 Inconclusives? Inconclusive? Inconclusives? Compound activity (PubChem Annotation) Active Inconclusive Inactive WA

17 Results Prioritize nodes instead of individual scaffolds | Compound Set Enrichment | Thibault Varin | 10/07/1417 Scaffold activity (KS Prediction / Bonferroni) Non significantly active Significantly active

18 Results Visualization tool (Peter Ertl) | Compound Set Enrichment | Thibault Varin | 10/07/1418

19 CONCLUSION | Compound Set Enrichment | Thibault Varin | 10/07/1419 Compound Set Enrichment

20 Conclusion Compound Set Enrichment | Compound Set Enrichment | Thibault Varin | 10/07/1420  Validation of initial hypotheses  A method to mine HTS data and identify active series of compounds Chemical classification: Scaffold Tree Statistical analysis: Kolmogorov-Smirnov hypothesis test Multiple hypothesis test correction: Bonferroni correction  Use all primary data  No activity cut-off  Identification of new active scaffolds not necessarily represented by very active compounds (latent hits) during the primary screen

21 With many thanks to | Compound Set Enrichment | Thibault Varin | 10/07/1421 Acknowledgments Primary mentor: - Ansgar Schuffenhauer Scientific advisers: -Christian Parker -Hanspeter Gubler -Ji-Hu Zhang -Peter Ertl -Edgar Jacoby Help: MLI group Fellowship: Education office Discussions: -Martin Beibel -Sebastian Bergling -Meir Glick -Alain Dietrich -Marie-Cecile Didiot

22 Questions? | Compound Set Enrichment | Thibault Varin | 10/07/1422


Download ppt "A novel approach to analysis of primary HTS data Compound Set Enrichment Thibault VarinAnsgar Schuffenhauer Gubler, H., Parker, C., Zhang, JH., Raman,"

Similar presentations


Ads by Google