General Concepts in QSAR for Using the QSAR Application Toolbox

Slides:



Advertisements
Similar presentations
QSAR-based Prediction of Inhalation Toxicity Kendall B. Wallace, Eli Petkova, Gilman D. Veith University of Minnesota – Duluth Medical School & International.
Advertisements

CE 510 Hazardous Waste Engineering
1 Development & Evaluation of Ecotoxicity Predictive Tools EPA Development Team Regional Stakeholder Meetings January 11-22, 2010.
Turning Point At the beginning of the course, we discussed three ways in which mathematics and statistics can be used to facilitate psychological science.
Chapter 7 Correlational Research Gay, Mills, and Airasian
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
~ Science for Life not for Grades!. Why choose Cambridge IGCSE Co-ordinated Sciences ? IGCSE Co-ordinated Sciences gives you the opportunity to study.
3 CHAPTER Cost Behavior 3-1.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-1 Review and Preview.
(IAQ). What is Risk Assessment? Risk assessment: provides information on the health risk Characterizes the potential adverse health effects of human exposures.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 22 Regression Diagnostics.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
1 Selected Current and Suggested Ideas on Uses of HPV Challenge Data Nhan Nguyen US EPA Characterizing Chemicals in Commerce: Using Data on High Production.
Correlational Research Chapter Fifteen Bring Schraw et al.
Charge Question 4-1: Please comment on the ecotoxicity studies selected to represent the most sensitive species in each of the risk scenarios (acute aquatic,
Patterns of Event Causality Suggest More Effective Corrective Actions Abstract: The Occurrence Reporting and Processing System (ORPS) has used a consistent.
Translating HPV Information into Plain Language Characterizing Chemicals in Commerce Austin, TX December, 12-14, 2006.
Identifying Applicability Domains for Quantitative Structure Property Relationships Mordechai Shacham a, Neima Brauner b Georgi St. Cholakov c and Roumiana.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
3. Statistics Test results on a drug such as (1) with17 variants (No. of compounds, n, = 17 ) differing only in X are tabulated with the values of physical.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Prof. Dr. Wolfgang Dekant Department of Toxicology University of Würzburg Germany Risk, Hazard, and Innovation.
Paola Gramatica, Elena Bonfanti, Manuela Pavan and Federica Consolaro QSAR Research Unit, Department of Structural and Functional Biology, University of.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Academic Research Academic Research Dr Kishor Bhanushali M
Question paper 1997.
McKim Conference on Predictive Toxicology
A "Reference Series" Method for Prediction of Properties of Long-Chain Substances Inga Paster and Mordechai Shacham Dept. Chem. Eng. Ben-Gurion University.
The McKim Conferences for the Strategic Use of Testing Gitchee Gumee Conference Center Duluth, Minnesota June 27-29, 2006.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
Barcelona April, 2008 Overview of the QSAR Application Toolbox Gilman Veith International QSAR Foundation Duluth, Minnesota.
McKim Conference on Predictive Toxicology The Inn of Lake Superior Duluth, Minnesota September 25-27, 2007 Narcosis as a Reference Gilman Veith.
RISK DUE TO AIR POLLUTANTS
1 HETEROSCEDASTICITY: WEIGHTED AND LOGARITHMIC REGRESSIONS This sequence presents two methods for dealing with the problem of heteroscedasticity. We will.
QSAR in CANCER ASSESSMENT PURPOSE and AGENDA Gilman Veith Duluth MN May 19-21, 2010.
Consumer Behavior, Ninth Edition Schiffman & Kanuk Copyright 2007 by Prentice Hall Chapter 2 Consumer Research.
DOSE-RESPONSE ASSESSMENT
Yandell - Econ 216 Chap 1-1 Chapter 1 Introduction and Data Collection.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Toxicity vs CHEMICAL space
Theme 5. Association 1. Introduction. 2. Bivariate tables and graphs.
General Concepts in QSAR for Using the QSAR Application Toolbox
Department of Mathematics
OBJECTIVES Understanding what food chemicals are?
Statistical analysis.
Lecture 4: Risk Analysis
DATA COLLECTION METHODS IN NURSING RESEARCH
Ecotoxicology Day 2. Adam Peters and Graham Merrington 2017.
QSAR Application Toolbox: Step 12: Building a QSAR model
Leacock, Warrican and Rose (2009)
Using Derek Nexus to assess the safety of raw materials used in cosmetics and as part of a skin sensitisation ITS. Part 1: Using Derek Nexus to assess.
Statistical analysis.
Lecture 02.
Background This is a step-by-step presentation designed to take the first time user of the Toolbox through the workflow of a data filling exercise.
Chapter 7 The Hierarchy of Evidence
Reliability and Validity of Measurement
Correlation and Regression
RAM XI Training Summit October 2018
Category elements for assessing category consistency
Evaluating alert performance accounting for a metabolism
Experimental Design: The Basic Building Blocks
Biological Science Applications in Agriculture
EFSA’s Chemical Hazards Database
MGS 3100 Business Analysis Regression Feb 18, 2016
Relationships among Inflation, Interest Rates and Exchange Rates
Writing competence based questions
Presentation transcript:

General Concepts in QSAR for Using the QSAR Application Toolbox

Part 1 General Concepts in QSAR

Online Course Outline The need for predictive methods Basic terminology in QSAR development Selecting biological endpoints for modeling Using trends to define chemical categories Chemical categories for filling data gaps Overview of the QSAR Toolbox

Need for Predictive Methods Laboratory measurements of chemical toxicity must address many different hazards and responses (biological effects) under many exposure scenarios Regulatory risk (or safety) assessments rely heavily on the interpretation of bioassays, which are designed to reveal the spectrum of effects of a chemical Most assessments rely on batteries of bioassays intended to characterize important hazards such as short-term effects, carcinogenicity, mutagenicity, reproductive impairment and development deficits Screening-level assessments can cost from $1-5M, while comprehensive risk assessments can cost more than $60M in testing and analysis

Need for Predictive Methods Due to the high cost of animal tests, risk assessments based on such tests are limited to a small percentage of industrial chemicals Fewer than 10,000 chemicals have been tested for the major hazards; the majority of these chemicals have been tested for only a few hazards The world inventory of chemicals in commerce exceeds 160,000 chemicals and is growing by more than 3,000 new chemicals each year The collective capacity of all OECD member countries to conduct the SIDS initial hazard assessments was ~500 chemicals/year for the last decade

Need for Predictive Methods Alternative test methods that are more diagnostic and faster are one leading approach to fill the data gaps Non-testing alternative methods involve the use of chemical models to extrapolate the hazards of tested chemicals to similar untested chemicals The non-testing alternative method includes the use of quantitative structure-activity relationships (QSARs) that relate biological activity to structure Physical chemists, engineers and medicinal chemists have been reliably estimating the behavior of untested chemicals for more than 60 years

Basic Definitions in QSAR Chemistry is based on the simple premise that similar chemicals will behave similarly If two chemicals appear to be very similar but behave dramatically differently (e.g., stereoisomers), one’s perception of similarity is wrong Like most complex systems, the behavior of a chemical as a molecular system is largely derived from the electronic and steric properties of its structure Therefore, the field of QSAR research is concerned with methods for quantifying chemical similarity in order to improve ways of grouping similar chemicals Similarity is not an absolute, but must be determined within a specific context for a specific attribute or behavior

Basic Definitions in QSAR For toxicology, structure-activity relationships start with selecting a test endpoint such as lethality (LC50) or effect concentration (EC50) QSAR searches for relationships between chemical structure and activity so that the test endpoint can be predicted accurately from structure For example, industrial chemicals are classified as inhalation hazards when the 4-hour LC50 of a chemical for rats is less than 20 mg/l When LC50 values are compiled for 20 - 30 chemicals and chemical structure is represented by the vapor pressure (VP), a QSAR model can be formed In this example, the QSAR is log LC50 (rat, 4hr) = 0.69 log VP + 1.54, which allows the LC50 of untested similar chemicals to be estimated

Selecting Biological Endpoints QSARs can be used to estimate important toxicity endpoints for thousands of chemical structures in order to focus assessments on the greatest risks However, a single QSAR model for a toxicity endpoint like LC50 is only reliable for chemicals that are similar to the training set of chemicals In toxicology, similar chemicals are usually defined as those that cause toxic effects through the same toxicity mechanisms Therefore, QSAR models must first predict whether a chemical has the same toxicity mechanism for which a particular model was built If a chemical’s toxicity mechanisms differs from the one for which a particular model was built, it is, by definition, not similar and its effects cannot be estimated reliably with that same QSAR model

Steps to creating QSAR Models Choose a well-defined endpoint for biological activity that is relevant to the assessment Compile measured values of the biological endpoint using a consistent test method for similar chemicals (training set) –OR- Select a homologous series of relevant chemicals and systematically test all of them for the biological endpoint using a consistent method Identify the chemical attributes that are likely to be important in toxicity mechanisms and the endpoint, and then calculate for each chemical the “molecular descriptors” ( e.g., VP, Log P, pKa, etc.) that put those attributes in numerical terms Explore the statistical variances among the molecular descriptors and endpoint values, and identify relationships between the molecular descriptor and endpoint for the assessment

Simple Example for QSAR Compile data for lethality (LC50) in mice from 30-minute inhalation exposures from the literature In this example, restricting chemicals to simple aliphatic ethers increases the likelihood that the toxicity mechanism for lethality will be the same As shown on the next slide, estimate or measure the vapor pressure to be used as a molecular descriptor (selected from theory or by trial and error) Correlate LC50 values with the vapor pressure to get: log LC50 = 0.57 x log VP + 2.08 This regression equation is the QSAR for this endpoint and this class of chemicals even though the toxicity mechanism is not known

Chemical name CAS MW Table 1. LC50-30 min of aliphatic ethers in mice VP (mmHg) LC50 (mmol/m3) Diisobutyl ether 628-55-7 130.2 15 1200 Disec-butyl ether 6863-58-7 17 1000 Diethyl ether 60-29-7 74.1 537 6000 Diisopropyl ether 108-20-3 102.2 170 1500 Dimethyl ether 115-10-6 46.1 4450 37000 Dipropyl ether 111-43-3 60 1600 Divinyl ether 109-93-3 70.1 684 4700 Ethyl amyl ether 17952-11-3 116.2 18 Ethyl butyl ether 628-81-9 52 Ethyl cyclopropyl ether 5614-38-0 86.1 150 Ethyl isoamyl ether 628-04-6 30 Ethyl isobutyl ether 627-02-1 98 Ethyl isopropyl ether 625-54-7 88.1 250 2500 Ethyl propyl ether 628-32-0 185 Ethyl- sec-butyl ether 2679-87-0 1400 Ethyl tert-amyl ether 919-94-8 43 700 Ethyl tert-butyl ether 637-92-3 155 Ethyl vinyl ether 109-92-2 72.1 500 4500 Methyl amyl ether 628-80-8 55 1300 Methyl butyl ether 628-28-4 160 2000 Methyl cyclopropyl ether 540-47-6 410 1750 Methyl ethyl ether 540-67-0 60.1 1493 18000 Methyl isobutyl ether 625-44-5 210 Methyl isopropyl ether 598-53-8 550 5500 Methyl propyl ether 557-17-5 3500 Methyl sec-butyl ether 6795-87-5 230 Methyl tert-butyl ether 1634-04-4 244 Propyl isopropyl ether 627-08-7 85

Simple Example for QSAR Notice the dependence on VP (slope) is almost the same as the QSAR derived from the 4-hour exposure with rats shown earlier, suggesting the same structural attributes are controlling toxicity Notice the intercept is about 0.5 log units greater for the 30-minute test with mice versus the 4-hour test with rats If we assume the toxicity mechanism causing lethality is the same for chemicals in both sets, can you explain why the LC50 in mice is greater (lower toxicity) than the LC50 in rats? Statistical exploration of data compiled for different chemicals is one of several important methods for defining chemical similarity

Simple Example for QSAR This QSAR implies that the vapor pressure of a chemical is an important factor in determining that chemical’s potency in a lethality test Many other molecular descriptors would not correlate to toxicity, and the good correlation here points to structural attributes that influence VP Chemicals that cause lethality by other toxicity mechanisms, chemicals such as acrolein or phosgene, will appear as statistical outliers Therefore, in QSAR “outlier analysis” is often used to gain insight into chemical similarity as defined in terms of common mechanisms