Claude Beigel, PhD. Exposure Assessment Senior Scientist Research Triangle Park, USA Practical session metabolites Part II: goodness of fit and decision.

Slides:



Advertisements
Similar presentations
Claude Beigel, PhD. Exposure Assessment Senior Scientist Research Triangle Park, USA Practical session metabolites Part I: curve fitting.
Advertisements

Introduction to parameter optimization
26-27 Jan 2005 Page 1 FOCUS Kinetics training workshop Chapter 7 Recommended Procedures to Derive Endpoints for Parent Compounds Practical Exercise Ralph.
Claude Beigel, PhD. Exposure Assessment Senior Scientist Research Triangle Park, USA Practical session metabolites Part III: plenary discussion of results.
Outline Introduction Kinetic endpoints General fitting recommendations
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Objectives 10.1 Simple linear regression
Simple Linear Regression. G. Baker, Department of Statistics University of South Carolina; Slide 2 Relationship Between Two Quantitative Variables If.
Correlation and regression
Forecasting based on creeping trend with harmonic weights Creeping trend can be used if variable changes irregularly in time. We use OLS to estimate parameters.
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
9. SIMPLE LINEAR REGESSION AND CORRELATION
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Simple Linear Regression Analysis
Quantitative Business Analysis for Decision Making Simple Linear Regression.
Inferences About Process Quality
Statistical Treatment of Data Significant Figures : number of digits know with certainty + the first in doubt. Rounding off: use the same number of significant.
Chemometrics Method comparison
Regression and Correlation Methods Judy Zhong Ph.D.
Inference for regression - Simple linear regression
Regression Analysis (2)
One-Factor Experiments Andy Wang CIS 5930 Computer Systems Performance Analysis.
Simple Linear Regression Models
(a.k.a: The statistical bare minimum I should take along from STAT 101)
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
© 1998, Geoff Kuenning General 2 k Factorial Designs Used to explain the effects of k factors, each with two alternatives or levels 2 2 factorial designs.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
Introduction to Linear Regression
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 23/10/2015 9:22 PM 1 Two-sample comparisons Underlying principles.
Analysis of Residuals Data = Fit + Residual. Residual means left over Vertical distance of Y i from the regression hyper-plane An error of “prediction”
Go to Table of Content Single Variable Regression Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Chapter 14 Inference for Regression AP Statistics 14.1 – Inference about the Model 14.2 – Predictions and Conditions.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
1.1 Statistical Analysis. Learning Goals: Basic Statistics Data is best demonstrated visually in a graph form with clearly labeled axes and a concise.
REGRESSION DIAGNOSTICS Fall 2013 Dec 12/13. WHY REGRESSION DIAGNOSTICS? The validity of a regression model is based on a set of assumptions. Violation.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
COTOR Training Session II GL Data: Long Tails, Volatility, Data Transforms September 11, 2006.
Inference ConceptsSlide #1 1-sample Z-test H o :  =  o (where  o = specific value) Statistic: Test Statistic: Assume: –  is known – n is “large” (so.
Business Statistics: Contemporary Decision Making, 3e, by Black. © 2001 South-Western/Thomson Learning 8-1 Business Statistics, 3e by Ken Black Chapter.
Linear Regression Models Andy Wang CIS Computer Systems Performance Analysis.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
Chi Square Test for Goodness of Fit Determining if our sample fits the way it should be.
Data handling Sabine Beulke, Central Science Laboratory, York, UK Kinetic Evaluation according to Recommendations by the FOCUS Work Group on Degradation.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Two Sample Problems  Compare the responses of two treatments or compare the characteristics of 2 populations  Separate samples from each population.
REGRESSION MODELS OF BEST FIT Assess the fit of a function model for bivariate (2 variables) data by plotting and analyzing residuals.
CORRELATION-REGULATION ANALYSIS Томский политехнический университет.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Bivariate Regression. Bivariate Regression analyzes the relationship between two variables. Bivariate Regression analyzes the relationship between two.
Stats Methods at IC Lecture 3: Regression.
Inference for Regression
Chapter 13 Test for Goodness of Fit
Chapter Six Normal Curves and Sampling Probability Distributions
Understanding Standards Event Higher Statistics Award
Linear Regression Models
Correlation and Regression
Everyone thinks they know this stuff
Inferences and Conclusions from Data
Two Independent Samples
CHAPTER 12 More About Regression
Chapter 14 Inference for Regression
One-Factor Experiments
DESIGN OF EXPERIMENT (DOE)
3.2. SIMPLE LINEAR REGRESSION
Presentation transcript:

Claude Beigel, PhD. Exposure Assessment Senior Scientist Research Triangle Park, USA Practical session metabolites Part II: goodness of fit and decision making

1 Recommended Tools for Assessing Goodness of Fit Same recommended approach as for parent substance Combination of visual assessment and statistical tests Visual assessment, although bringing some level of subjectivity, is necessary to discern between normal data variability (scattering) and systematic model deviation, this is not done by the statistical tests

2 Recommended Tools for Assessing Goodness of Fit Visual Assessment Visual check of model description of measured data and distribution of residuals (plot of residuals, Predicted - Observed) Systematic deviation indicates kinetic model may not be appropriate (unless deviation can be attributed to experimental artifacts) Example 8.2 of report, SFO-SFO fit

3 Recommended Tools for Assessing Goodness of Fit Visual Assessment Residuals should be randomly distributed on vertical axis Example 8.2 of report, SFO-FOMC fit

4 Recommended Tools for Assessing Goodness of Fit Statistical Indices Chi 2 ( 2 ) statistical test Minimum error percentage to pass 2 test at a 5% significance level –Needs to be performed for each substance individually, to avoid that good fit of the main substances (parent and/or major metabolite) overshadows goodness of fit of more minor substances (weighting issue) –Calculated from fitted versus observed substance data (use of average values for replicates is recommended) –Degrees of freedom for the substance defined as number of substance data points used in 2 test minus number of estimated parameters for the substance Do not count replicates if averages used, excludes data points set to 0 (metabolite at time-0) or not counted (<LOD/LOQ) Metabolite parameters defined as metabolite formation fraction and degradation rate parameters (dependent of kinetic model used)

5 Recommended Tools for Assessing Goodness of Fit Statistical Indices One-sided t-test for evaluating uncertainty of rate constant parameters To determine whether rate is significantly different from 0 –If p < 0.05, parameter is considered significantly different than zero –If p between 0.05 and 0.1, weight of evidence should be considered Especially important for metabolites that do not show a clear decline Because parameters in parent + metabolite fits (formation and degradation parameters) can be highly correlated, the t-test is performed at final step (all parameters fitted together) –Degrees of freedom defined as number of data points (including replicates) minus number of fitted parameters

6 Recommended Tools for Assessing Goodness of Fit Data Handling / Methodology Basic data handling Paste ModelMaker output (integration table) in Excel spreadsheet –Extract fitted values corresponding to measured times for each substance –Average replicates if necessary (an automated Excel spreadsheet may be created for that purpose, but not available yet) Minimum 2 error % for metabolites may be calculated using Parent degradation kinetics.xls file Paste measured Vs. fitted values in Chi2 all models worksheet, update number of parameters cell and click calculate

7 Recommended Tools for Assessing Goodness of Fit Data Handling / Methodology Residuals may be plotted in Parent degradation kinetics.xls file Valid only for 1- or 2-replicate data sets (if more, needs to be done manually) Paste measured Vs. fitted values (all replicates) in SFO no-reps or SFO 2-reps worksheet t-test for rate constant parameters may be performed using provided t-test.xls file For each rate constant parameter, enter parameter estimate and standard error, number of data points and number of parameters estimated

8 Hands-on Example 1 Exercise 1 Open ModelMaker file for example 1 From result table, extract fitted value for each sampling time, write down in output tables for parent, metabolite1 and metabolite2 Enter values in Metabolitesexamplesoutput.xls, averages are calculated automatically

9 Hands-on Example 1 Visual assessment Check ModelMaker plot of fit and answer following questions for each substance –Does fitted line adequately describe data, are there obvious over- or under- predictions (including day-0) Plot residuals for each substance and answer following questions –Do residuals show distinct pattern, are most of the points above or below 0- line, what is the magnitude? Statistical indices Calculate minimum 2 error percentage for each substance Perform t-test for all rate constant parameters (parent and metabolites) and record P-value and conclusion

10 Hands-on Example 1 Visual Assessment GraphAssessment / Remarks Parent Overall fit Residuals Metabolite1 Overall fit Residuals Metabolite2 Overall fit Residuals

11 Hands-on Example 1 Statistical Indices 2 -test Relevant Parameters Estimated (y/n) Number of Parameters Minimum 2 Error Percentage Parent Pini kP Metabolite1 ffM1 kM1 Metabolite2 ffM2 kM2 t-test Estimated Value Standard Error Number of Data Points Number of Estimated Parameters P-valueConclusion kP kM1 kM2

12 Hands-on Example 2, Parent FOMC Exercise 2 Open ModelMaker file for example 2, parent FOMC From result table, extract fitted value for each sampling time, write down in output tables for parent and metabolite Enter values in Metabolitesexamplesoutput.xls, averages are calculated automatically Perform visual assessment and calculate statistical indices

13 Hands-on Example 2, parent FOMC Visual Assessment GraphAssessment / Remarks Parent Overall fit Residuals Metabolite Overall fit Residuals

14 Hands-on Example 2, parent FOMC Statistical Indices 2 -test Relevant Parameters Estimated (y/n) Number of Parameters Minimum 2 Error Percentage Parent Pini P Metabolite ffM kM t-test Estimated Value Standard Error Number of Data Points Number of Estimated Parameters P-valueConclusion kM

15 Hands-on Example 2, parent DFOP Exercise 3 Open ModelMaker file for example 2, parent DFOP From result table, extract fitted value for each sampling time, write down in output tables for parent and metabolite Enter values in Metabolitesexamplesoutput.xls, averages are calculated automatically Perform visual assessment and calculate statistical indices

16 Hands-on Example 2, parent DFOP Visual Assessment GraphAssessment / Remarks Parent Overall fit Residuals Metabolite Overall fit Residuals

17 Hands-on Example 2, parent DFOP Statistical Indices 2 -test Relevant Parameters Estimated (y/n) Number of Parameters Minimum 2 Error Percentage Parent Pini g k1 k2 Metabolite ffM kM t-test Estimated Value Standard Error Number of Data Points Number of Estimated Parameters P-valueConclusion k1 k2 kM

18 Hands-on Example 2, Metabolite Decline Exercise 4 Open ModelMaker file for example 2, metabolite decline From result table, extract fitted value for each sampling time, write down in output tables for parent and metabolite Enter values in Metabolitesexamplesoutput.xls, averages are calculated automatically Perform visual assessment and calculate statistical indices

19 Hands-on Example 2, Metabolite Decline Visual & Statistical Assessment GraphAssessment / Remarks Metabolite Decline Overall fit Residuals 2 -test Relevant Parameters Estimated (y/n) Number of Parameters Minimum 2 Error Percentage Metabolite Mmax kM t-test Estimated Value Standard Error Number of Data Points Number of Estimated Parameters P-valueConclusion kM

20 Decision Making Trigger Endpoints Based on visual assessment and statistical indices, is SFO model acceptable for the metabolite (in combination with best-fit model for parent)? Yes use SFO DT50/90 endpoints No and clear decline of metabolite, use FOMC model for metabolite (in combination with best-fit model for parent) –If FOMC acceptable based on visual assessment and statistical indices, use FOMC DT50/90 endpoints –If not, model decline of metabolite with best-fit model and use decline DT50/90 as conservative endpoints No and no apparent decline of metabolite –Assess relevance of study with regard to metabolite –Check other studies –Study with metabolite may be needed

21 Decision Making Modeling Endpoints Based on visual assessment and statistical indices, is SFO model acceptable for the metabolite (in combination with appropriate model for parent)? Yes use modeling endpoints for metabolite No and clear decline –If formation fraction estimate is reliable, use with decline rate constant as conservative endpoints –If not, use formation fraction of 1 with decline rate constant as conservative endpoints –If metabolite biphasic, use appropriate higher-Tier approach (e.g. DFOP, PEARL) –If terminal metabolite and biphasic, use FOMC DT90/3.32 as half-life No and no apparent decline of metabolite –Assess relevance of study with regard to metabolite –Check other studies –Study with metabolite may be needed

22 Hands-on Examples Determine appropriate trigger and modeling endpoints for Example 1 metabolites 1 and 2 and Example 2 metabolite