QSAR AND CHEMOMETRIC APPROACHES TO THE SCREENING OF POPs FOR ENVIRONMENTAL PERSISTENCE AND LONG RANGE TRANSPORT FOR ENVIRONMENTAL PERSISTENCE AND LONG.

Slides:



Advertisements
Similar presentations
SOME PRACTICAL ISSUES IN COMPOSITE INDEX CONSTRUCTION Lino Briguglio and Nadia Farrugia Department of Economics, University of Malta Prepared for the INTERNATIONAL.
Advertisements

C A INTRODUCTION An Environmental Quality Objective (EQO), intended as a real “No Effect Concentration” (NEC), is not accessible experimentally. The usual.
Design of Experiments Lecture I
Issues of Reliability, Validity and Item Analysis in Classroom Assessment by Professor Stafford A. Griffith Jamaica Teachers Association Education Conference.
Appendix 3 Frank Wania Evaluating Persistence and Long Range Transport Potential of Organic Chemicals Using Multimedia Fate Models.
MODELLING OF PHYSICO-CHEMICAL PROPERTIES FOR ORGANIC POLLUTANTS F. Consolaro, P. Gramatica and S. Pozzi QSAR Research Unit, Dept. of Structural and Functional.
Business Research for Decision Making Sixth Edition by Duane Davis Chapter 7 Foundations of Measurement PowerPoint Slides for the Instructor’s Resource.
PROBABILISTIC ASSESSMENT OF THE QSAR APPLICATION DOMAIN Nina Jeliazkova 1, Joanna Jaworska 2 (1) IPP, Bulgarian Academy of Sciences, Sofia, Bulgaria (2)
ABSTRACT The BEAM EU research project focuses on the risk assessment of mixture toxicity. A data set of 124 heterogeneous chemicals of high concern as.
4 Th Iranian chemometrics Workshop (ICW) Zanjan-2004.
CALIBRATION Prof.Dr.Cevdet Demir
A quick introduction to the analysis of questionnaire data John Richardson.
8 th Iranian workshop of Chemometrics 7-9 February 2009 Progress of Chemometrics in Iran Mehdi Jalali-Heravi February 2009 In the Name of God.
Introduction to Management Science
Scaling and Attitude Measurement in Travel and Hospitality Research Research Methodologies CHAPTER 11.
1 Chapter 17: Introduction to Regression. 2 Introduction to Linear Regression The Pearson correlation measures the degree to which a set of data points.
DERIVING LINEAR REGRESSION COEFFICIENTS
Application and Efficacy of Random Forest Method for QSAR Analysis
Factor Analysis Psy 524 Ainsworth.
9-1 Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Multicriteria Decision Making Chapter 9.
1 IES 371 Engineering Management Chapter 10: Location Week 11 August 17, 2005 Objectives  Identify the factors affecting location choices  Explain how.
Combining Statistical and Physical Considerations in Deriving Targeted QSPRs Using Very Large Molecular Descriptor Databases Inga Paster and Mordechai.
Non ionic organic pesticide environmental behaviour: ranking and classification F. Consolaro and P. Gramatica QSAR Research Unit, Dept. of Structural and.
Molecular Descriptors
RESULT and DISCUSSION In order to find a relation between the three rate reaction constant (k OH, k NO3 and k O3 ) and the structural features of chemicals,
Sung Kyu (Andrew) Maeng. Contents  QSAR Introduction  QSBR Introduction  Results and discussion  Current QSAR project in UNESCO-IHE.
Surveillance monitoring Operational and investigative monitoring Chemical fate fugacity model QSAR Select substance Are physical data and toxicity information.
Life Cycle of Products Source: Melanen et al Metals flows and recycling of scrap in Finland. The Finnish Environment 401. Finnish Environment Institute,
ABSTRACT Bioconcentration by aquatic biota is an important factor in assessing the environmental behaviour and potential hazard evaluation of a chemical,
CONCLUSIONS CONCLUSIONS - Missing values of the principal physico-chemical properties are predicted by validated regression models by using different kinds.
Use of Machine Learning in Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
Chapter 11 Environmental Performance of a Flowsheet.
The aquatic toxicity values of 57 esters, with experimental and predicted LC50 in fish, EC50 in Daphnia and seaweed and IGC in Entosiphon sulcatum, were.
Paola GRAMATICA a, Paola LORENZINI a, Angela SANTAGOSTINO b and Ezio BOLZACCHINI b a University of Insubria, Dep. of Structural and Functional Biology,
Identifying Applicability Domains for Quantitative Structure Property Relationships Mordechai Shacham a, Neima Brauner b Georgi St. Cholakov c and Roumiana.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
University of Texas at AustinMichigan Technological University 1 Module 2: Evaluating Environmental Partitioning and Fate: Approaches based on chemical.
DESIRABILITY OF POPs ACCORDING TO THEIR ATMOSPHERIC MOBILITY The main goal pursued in this work is the formulation of a POP ranking by atmospheric mobility.
TOXICITY MODELLING OF “EEC PRIORITY LIST 1” COMPOUNDS TOXICITY MODELLING OF “EEC PRIORITY LIST 1” COMPOUNDS Council Directive 76/464/EEC of the European.
Martin Waldseemüller's World Map of 1507 Zanjan. Roberto Todeschini Viviana Consonni Davide Ballabio Andrea Mauri Alberto Manganaro chemometrics molecular.
Paola Gramatica, Elena Bonfanti, Manuela Pavan and Federica Consolaro QSAR Research Unit, Department of Structural and Functional Biology, University of.
QSAR Study of HIV Protease Inhibitors Using Neural Network and Genetic Algorithm Akmal Aulia, 1 Sunil Kumar, 2 Rajni Garg, * 3 A. Srinivas Reddy, 4 1 Computational.
ABSTRACT The behavior and fate of chemicals in the environment is strongly influenced by the inherent properties of the compounds themselves, particularly.
K. Kolomvatsos 1, C. Anagnostopoulos 2, and S. Hadjiefthymiades 1 An Efficient Environmental Monitoring System adopting Data Fusion, Prediction & Fuzzy.
P. Gramatica and F. Consolaro QSAR Research Unit, Dept. of Structural and Functional Biology, University of Insubria, Varese, Italy.
MODELING MATTER AT NANOSCALES 3. Empirical classical PES and typical procedures of optimization Classical potentials.
Organic pollutants environmental fate: modeling and prediction of global persistence by molecular descriptors P.Gramatica, F.Consolaro and M.Pavan QSAR.
Maximizing value and Minimizing base on Fuzzy TOPSIS model
Selecting Diverse Sets of Compounds C371 Fall 2004.
Selection of Molecular Descriptor Subsets for Property Prediction Inga Paster a, Neima Brauner b and Mordechai Shacham a, a Department of Chemical Engineering,
Log Koc = MW nNO – 0.19 nHA CIC MAXDP Ts s = 0.35 F 6, 134 = MW: molecular weight nNO: number of NO bonds.
Location Planning and Analysis Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent.
Correlation & Regression Analysis
F.Consolaro 1, P.Gramatica 1, H.Walter 2 and R.Altenburger 2 1 QSAR Research Unit - DBSF - University of Insubria - VARESE - ITALY 2 UFZ Centre for Environmental.
Using Population Data to Address the Human Dimensions of Population Change D.M. Mageean and J.G. Bartlett Jessica Daniel 10/27/2009.
MUTAGENICITY OF AROMATIC AMINES: MODELLING, PREDICTION AND CLASSIFICATION BY MOLECULAR DESCRIPTORS M.Pavan and P.Gramatica QSAR Research Unit, Dept. of.
A) I. I. Mechnikov National University, Chemistry Department, Dvorianskaya 2, Odessa 65026, Ukraine, b) Department of Molecular.
P. Gramatica 1, H. Walter 2 and R. Altenburger 2 1 QSAR Research Unit - DBSF - University of Insubria - VARESE - ITALY 2 UFZ Centre for Environmental Research.
A molecular descriptor database for homologous series of hydrocarbons ( n - alkanes, 1-alkenes and n-alkylbenzenes) and oxygen containing organic compounds.
Roberto Todeschini Viviana Consonni Manuela Pavan Andrea Mauri Davide Ballabio Alberto Manganaro chemometrics molecular descriptors QSAR multicriteria.
Use of Machine Learning in Chemoinformatics
A new protein-protein docking scoring function based on interface residue properties Reporter: Yu Lun Kuo (D )
Lipinski’s rule of five
PHYSICO-CHEMICAL PROPERTIES MODELLING FOR ENVIRONMENTAL POLLUTANTS
Nahid Abbas and Sonal Dubey
Hierarchical Classification of Calculated Molecular Descriptors
P. Gramatica1, F. Consolaro1, M. Vighi2, A. Finizio2 and M. Faust3
Product moment correlation
M.Pavan, P.Gramatica, F.Consolaro, V.Consonni, R.Todeschini
Understanding How the Ranking is Calculated
Presentation transcript:

QSAR AND CHEMOMETRIC APPROACHES TO THE SCREENING OF POPs FOR ENVIRONMENTAL PERSISTENCE AND LONG RANGE TRANSPORT FOR ENVIRONMENTAL PERSISTENCE AND LONG RANGE TRANSPORT Paola Gramatica a, Ester Papa a and Stefano Pozzi b a) Department of Structural and Functional Biology, University of Insubria - Varese (Italy) b) Laboratory of environmental Studies (SPAA) - Lugano (Switzerland) QSARResearchUnit D 13 Global Mobility Index The inherent tendency of compounds towards global mobility is regulated mainly by volatility, water solubility, Kow and Koa. Global Mobility Index A Global Mobility Index is obtained from the linear combination, by PCA, of the physico- chemical properties: the PC1 score (EV%=74.6%) in Fig. 2. The chemicals on the right side of are those with the major tendency to mobility. The need for a scientific foundation for the criteria used to evaluate persistence and long-range transport (LRT) potential of POPs (Persistent Organic Pollutants) in the environment has been recently highlighted 1. Persistence is a necessary condition for long-range transport, however persistent chemicals are not necessarily subject to long-range transport: the inherent tendency of compounds towards global mobility must also be taken into account. The half-life of organic pollutants in various compartments is among the most commonly used criterion for studying persistence, but these studies are severely hindered by the limited availability of experimental degradation half-life data, thus there is an incentive to develop reliable procedures, like QSAR/QSPR, to estimate lacking data. The same is true for physico-chemical properties particularly relevant for determining mobility potential 2. As the Long Range Transport potential of POPs is due to the contemporaneous influence of their persistence in the environment and their inherent tendency to mobility, the finding of the best combination of chemical properties minimizing LRT is a multicriteria problem and can be approached positively through MultiCriteria Decision-Making (MCDM) techniques 3 : procedures for combining the magnitude of several properties into a single quantitative measure of overall quality. For modeling and predicting half life we used a data set of 141 organic compounds, for which half-life experimental values in different compartments are available from Howard 4, Mackay 5 and Rodan 6. The molecular structure has been represented by a wide set of molecular descriptors 7 calculated by a software developed by R.Todeschini 7,8 : Constitutional descriptors(56), Topological descriptors(69), Walk counts (20), Bcut descriptors (64), Galvez indices (21), 2D Autocorrelations (96), Charge descriptors (7), Aromaticity descriptors (4), Molecular profiles (40), Geometrical descriptors(18), 3D MoRSE descriptors (160), WHIM descriptors 9 (99), GETAWAY descriptors (196), Empirical descriptors (3). The selection of the best subset variables for modelling half-life was done by a Genetic Algorithm (GA-VSS) approach, where the response is obtained by ordinary least square regression (OLS). All the calculations have been performed by using the leave-one-out (LOO) and leave-more-out (LMO) procedures and the scrambling of the responses for the validation of the models (MOBY-DIGS package) 10. Introduction General Persistence Index The Principal Component Analysis (PCA) of the experimental and predicted half-life of 141 pollutants in various media allows the ranking of the chemicals according to their overall half-life and relative persistence in different media. general Persistence Index A general Persistence Index is obtained from the linear combination of half-life data in four environmental media (PC1 in Fig. 1).The chemicals on the right are the most globally persistent in the various compartments. Figure 1 PERSISTENCE MOBILITY Screening of Long Range Transport Potential MultiCriteria Decision-Making utility function Persistence Index Mobility IndexAir Half-life The finding of the best combination of chemical properties minimizing LRT can be approached by MultiCriteria Decision-Making (MCDM) techniques: procedures for combining the magnitude of several properties into a single quantitative measure of overall quality. The utility function is chosen here as the best combined criteria function and is applied to the most relevant properties determining the LRT, according to the following criteria, f(x), all expressed as the minimum: the general Persistence Index (Fig.1), deriving from the PCA combination of half-life in four environmental compartments, the Mobility Index (Fig.2), deriving from the cited physico-chemical properties and the Air Half-life, which is considered particularly relevant in determining LRT. The k=3 properties, equally weighted (by the weight ) and added in the utility function, according to the reported formula, allow a ranking of the studied chemicals according to their LRT potential, giving a LRT index ( F(x)). The chemicals, highlighted in Fig. 3, with the lowest utility (F(x) near 0) will exhibit highest LRT potential, while those with F(x) near 1 will have the lowest possibility for LRT. Figure 3 Figure 2 The QSPR (Quantitative Structure-Property Relationships) approach is applied here in two steps: first, to fill the gap in the experimental data of the studied properties and finally to model the scores of the MCDM function, the LRT index (Fig. 3). Different kinds of theoretical molecular descriptors have been used to obtain OLS regression models (Fig.4) and CART classification models (Fig. 5) with good predictive power (Q 2 LOO =86.8%, Q 2 LMO =86.2% and Misclassification Risk Cross val.=6.2%, respectively. Conclusions Conclusions fast pre-screening based simply on the knowledge of their molecular structure. The ranking of the studied chemicals according to their LRT potential, obtained by the utility function of MCDM, can be proposed as an alternative approach to others based on characteristic travel distance (CTD) 11. An additional advantage of this approach is that the application of the QSPR models (both regression and classification) on the scores of the MCDM utility function (defined as LRT index) can allow a fast pre-screening of existing and new chemicals for their inherent tendency to LRT, based simply on the knowledge of their molecular structure. References References 1- Klecka, G.M., Ed. (1999). SETAC Pellston Workshop Environ. Toxicol. Chem. (Suppl.), 18, 8 2- Gramatica, P., Pozzi, S., Consonni, V. and Di Guardo, A. (2001) SAR and QSAR in Environ. Res., in press. 3- Hendriks M.M.W.B., De Boer J.H., Smilde A.K. and Doornbos D.A. (1992) Chemom. Intell. Lab. Syst 16, Howard,P.H. et all. Handbook of environmental degradation rates (1991) ; 5- Mackay, Shiu, Ma Illustrated handbook of physical-chemical properties and environmental fate for organic chemicals (2000); 6- Rodan, B.D et all. Envir. Sci. technol.,33( (1999); 7- R.Todeschini and V.Consonni,Handbook of molecular descriptors (2000) Wiley; 8- R.Todeschini, DRAGON ver.1.0, Milano, 2000 free download from 9- R. Todeschini and P.Gramatica (1997) Quant. Struct.Act. Rel. 16, R. Todeschini, R. (1999). MOBY DIGS - Software for multilinear regression analysis and variable subset selection by Genetic Algorithm, rel. 2.1 Milan (Italy). 11- Beyer, A., Mackay, D., Matthies, M., Wania, F. and Webster E. (2000). Environ. Sci.Technol. 34, nC nC 7.00 E1u Assigned class Classification Tree Figure 5 Figure 4 1 3