Accuracy and Precision of Fitting Methods

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

Design of Experiments Lecture I
3.3 Hypothesis Testing in Multiple Linear Regression
Objectives 10.1 Simple linear regression
BA 275 Quantitative Business Methods
Monte Carlo Methods and Statistical Physics
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
Mitigating Risk of Out-of-Specification Results During Stability Testing of Biopharmaceutical Products Jeff Gardner Principal Consultant 36 th Annual Midwest.
Propagation of Error Ch En 475 Unit Operations. Quantifying variables (i.e. answering a question with a number) 1. Directly measure the variable. - referred.
Data Freshman Clinic II. Overview n Populations and Samples n Presentation n Tables and Figures n Central Tendency n Variability n Confidence Intervals.
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
Discussion on four presentations on Study design and statistical modeling for accelerated stability testing Regular long term stability – shelf life is.
Oscar Go, Areti Manola, Jyh-Ming Shoung and Stan Altan
Simple Linear Regression Analysis
Chemometrics Method comparison
Review of normal distribution. Exercise Solution.
Objectives of Multiple Regression
Regression and Correlation Methods Judy Zhong Ph.D.
Statistical Methods For Engineers ChE 477 (UO Lab) Larry Baxter & Stan Harding Brigham Young University.
Introduction to Linear Regression and Correlation Analysis
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
LINEAR REGRESSION Introduction Section 0 Lecture 1 Slide 1 Lecture 5 Slide 1 INTRODUCTION TO Modern Physics PHYX 2710 Fall 2004 Intermediate 3870 Fall.
1 CE 530 Molecular Simulation Lecture 7 David A. Kofke Department of Chemical Engineering SUNY Buffalo
1 Statistical Mechanics and Multi- Scale Simulation Methods ChBE Prof. C. Heath Turner Lecture 11 Some materials adapted from Prof. Keith E. Gubbins:
Juan Chen1, Sabine Thielges2, William R
Calibration Guidelines 1. Start simple, add complexity carefully 2. Use a broad range of information 3. Be well-posed & be comprehensive 4. Include diverse.
Simple Linear Regression Models
Chapter 15 Modeling of Data. Statistics of Data Mean (or average): Variance: Median: a value x j such that half of the data are bigger than it, and half.
880.P20 Winter 2006 Richard Kass 1 Confidence Intervals and Upper Limits Confidence intervals (CI) are related to confidence limits (CL). To calculate.
Stats for Engineers Lecture 9. Summary From Last Time Confidence Intervals for the mean t-tables Q Student t-distribution.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Estimation of Statistical Parameters
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 Part 4 Curve Fitting.
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Practical Statistical Analysis Objectives: Conceptually understand the following for both linear and nonlinear models: 1.Best fit to model parameters 2.Experimental.
Simple Linear Regression One reason for assessing correlation is to identify a variable that could be used to predict another variable If that is your.
© 2001 Prentice-Hall, Inc. Statistics for Business and Economics Simple Linear Regression Chapter 10.
General Confidence Intervals Section Starter A shipment of engine pistons are supposed to have diameters which vary according to N(4 in,
Brian Macpherson Ph.D, Professor of Statistics, University of Manitoba Tom Bingham Statistician, The Boeing Company.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Statistical Methods II&III: Confidence Intervals ChE 477 (UO Lab) Lecture 5 Larry Baxter, William Hecker, & Ron Terry Brigham Young University.
Simple Linear Regression. The term linear regression implies that  Y|x is linearly related to x by the population regression equation  Y|x =  +  x.
Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Propagation of Error Ch En 475 Unit Operations. Quantifying variables (i.e. answering a question with a number) 1. Directly measure the variable. - referred.
Statistics Presentation Ch En 475 Unit Operations.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 21 The Simple Regression Model.
CONFIDENCE INTERVALS.
Correlation – Recap Correlation provides an estimate of how well change in ‘ x ’ causes change in ‘ y ’. The relationship has a magnitude (the r value)
Chapter 8: Simple Linear Regression Yang Zhenlin.
Sampling Fundamentals 2 Sampling Process Identify Target Population Select Sampling Procedure Determine Sampling Frame Determine Sample Size.
The Importance of Sample Size and Its Varying Effects on Precision in Large-Scale Surveys Dipankar Roy, PhD Bangladesh Bureau of Statistics
Machine Learning 5. Parametric Methods.
Statistics Presentation Ch En 475 Unit Operations.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Global predictors of regression fidelity A single number to characterize the overall quality of the surrogate. Equivalence measures –Coefficient of multiple.
BPS - 5th Ed. Chapter 231 Inference for Regression.
11-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
Fundamentals of Data Analysis Lecture 11 Methods of parametric estimation.
The simple linear regression model and parameter estimation
Regression Analysis AGEC 784.
Statistical Methods For Engineers
Statistics Review ChE 477 Winter 2018 Dr. Harding.
Introduction to Instrumentation Engineering
Some Design Recommendations For ASAP Studies
Real-time Uncertainty Output for MBES Systems
Presentation transcript:

Accuracy and Precision of Fitting Methods A Scientific and Statistical Analysis of Accelerated Aging for Pharmaceuticals: Accuracy and Precision of Fitting Methods Kenneth C. Waterman, Ph.D. Jon Swanson, Ph.D. FreeThink Technologies, Inc. ken.waterman@freethinktech.com 2014

ken.waterman@freethinktech.com 2014 Outline Accuracy in accelerated aging Point estimates Linear estimates Isoconversion Uncertainty in predictions Isoconversion methods Arrhenius Distributions (MC vs. extrema isoconversion) Linear vs. non-linear Low degradant Conclusions ken.waterman@freethinktech.com 2014

Accuracy in Accelerated Aging ken.waterman@freethinktech.com 2014 Statistics must be based on accurate models Most shelf-life today determined by degradant growth not potency loss >50% Drug products show complex kinetics: do not show linear behavior Heterogeneous systems Secondary degradation Autocatalysis Inhibitors Diffusion controlled ken.waterman@freethinktech.com 2014

Complex Kinetics—Example ken.waterman@freethinktech.com 2014 Drug → primary degradant → secondary degradant ken.waterman@freethinktech.com 2014

ken.waterman@freethinktech.com 2014 Accelerated Aging Complex Kinetics 70°C 60°C 50°C Fixed time accelerated stability ken.waterman@freethinktech.com 2014

ken.waterman@freethinktech.com 2014 Accelerated Aging Complex Kinetics More unstable 70°C 60°C 50°C 30°C? Appears very non-Arrhenius Impossible to predict shelf-life from high T results ken.waterman@freethinktech.com 2014

Complex Kinetics: Real Example ken.waterman@freethinktech.com 2014 Accelerated Aging Complex Kinetics: Real Example 80C 70C 50C 60C 30C Real time data CP-456,773/60%RH ken.waterman@freethinktech.com 2014

Accelerated Aging—Isoconversion Approach 0.2% specification limit Isoconversion: %degradant fixed at specification limit, time adjusted ken.waterman@freethinktech.com 2014

Accelerated Aging—Isoconversion Approach Complex Kinetics Using 0.2% isoconversion 70°C 60°C 50°C 30°C ken.waterman@freethinktech.com 2014

Accelerated Aging—Isoconversion Approach Complex Kinetics—Real Example ASAPprime Shelf Life 1.2 yrs Experimental Shelf Life 1.2 yrs 70C 80C 50C 60C 30C Real time data CP-456,773/60%RH ken.waterman@freethinktech.com 2014

ken.waterman@freethinktech.com 2014 More Detailed Example k1 k2 A B C Time points @ 0, 3, 7, 14 and 28 days Shelf-life @25°C using 50, 60 and 70°C k1 = 0.000113%/d k2 = 0.01125%/d @50°C for “B” example (25 kcal/mol) k1 = 0.000112%/d k2 = 0.09%/d @50°C for “C” example (25 kcal/mol) ken.waterman@freethinktech.com 2014

Primary Degradant (“B”) Formation ken.waterman@freethinktech.com 2014 Method Shelf-life (yrs) @25°C Spec. 0.2% Spec. 0.5% Exact 1.43 4.45 4 linear rate constants @ each T 0.62 1.56 1 linear rate constant through 4 points @ each T 0.29 0.71 Single point at isoconversion @ each T Linear fitting of 4 points @ each T to determine intersection with specification 12.35 1.40 Determining intersection with specification using 2 points closest to specification @ each T (or extrapolating from last 2 points, when necessary) 1.36 3.19 ken.waterman@freethinktech.com 2014

ken.waterman@freethinktech.com 2014 Example @40°C Note R2 for line = 0.998 ken.waterman@freethinktech.com 2014

Secondary Degradant (“C”) Formation Method Shelf-life (yrs) @25°C Spec. 0.2% Spec. 0.5% Exact 2.02 4.01 4 linear rate constants @ each T 16.64 41.61 1 linear rate constant through 4 points @ each T 3.29 8.21 Single point at isoconversion @ each T Linear fitting of 4 points @ each T to determine intersection with specification 2.75 7.56 Determining intersection with specification using 2 points closest to specification @ each T (or extrapolating from last 2 points, when necessary) 2.06 4.78 ken.waterman@freethinktech.com 2014

ken.waterman@freethinktech.com 2014 Accuracy Both isoconversion and rate constant methods accurate when behavior is simple Only isoconversion is accurate when degradant formation is complex Carrying out degradation to bracket specification limit at each condition will increase reliability of modeling ken.waterman@freethinktech.com 2014

Estimating Uncertainty ken.waterman@freethinktech.com 2014 Need to use isoconversion for accuracy: defines a 2-step process Estimating uncertainty in isoconversion from degradant vs. time data Propagating to ambient using Arrhenius equation Error bars for degradant formation are not uniform Constant relative standard deviation (RSD) Minimum error of limit of detection (LOD) ken.waterman@freethinktech.com 2014

Isoconversion Uncertainty Methods ken.waterman@freethinktech.com 2014 Confidence Interval: 𝐶𝐼=𝜎 1 𝑛 + 𝑑 𝑜 − 𝑑 2 𝑑 𝑖 − 𝑑 2 Regression Interval: 𝑅𝐼=𝜎 1+ 1 𝑛 + 𝑑 𝑝 − 𝑑 2 𝑑 𝑖 − 𝑑 2 Stochastic: Monte-Carlo distribution Non-stochastic: 2n permutations of ±1σ Extrema: 2n permutations of ±1σ; normalize using zero-error isoconversion - minimum time (maximum degradant) of distribution ken.waterman@freethinktech.com 2014

Test Calculations: Model System ken.waterman@freethinktech.com 2014

Calculations Where Formulae Exist ken.waterman@freethinktech.com 2014 Calculation Method 5 Days (Interpolation) 40-Days (Extrapolation) Regression Interval 0.023% 0.102% Confidence Interval 0.012% 0.100% Stochastic 0.099% Non-Stochastic Extrema 0.020% 0.147% Fixed SD = 0.02% ken.waterman@freethinktech.com 2014

Isoconversion Uncertainty ken.waterman@freethinktech.com 2014 CI too narrow in interpolation regions (< experimental σ); also does not take into account error of fit RI better represents error for predictions RI and CI converge with extrapolation Extrema mimics RI in interpolation; more conservative in extrapolation Note: scientifically less confident in isoconversion extrapolations (model fit) ken.waterman@freethinktech.com 2014

Calculations Where Formulae ken.waterman@freethinktech.com 2014 Do Not Exist Calculation Method 5 Days (Interpolation) 40-Days (Extrapolation) Stochastic 0.016% 0.166% Non-Stochastic Extrema 0.027% 0.223% Fixed RSD = 10% with minimum error of 0.02% (LOD) ken.waterman@freethinktech.com 2014

Arrhenius Fitting Uncertainty ken.waterman@freethinktech.com 2014 Can use full isoconversion distribution from Monte-Carlo calculation Can use extrema calculation Normalized about time (x-axis, degradant set by specification limit) Normalized about degradant (y-axis, time set by zero-error intercept with specification limit) ken.waterman@freethinktech.com 2014

25°C Projected Rate Distributions ken.waterman@freethinktech.com 2014 60, 70, 80°C measurements @10 days; RSD=10%, LOD=0.02%; 25 kcal/mol 50% 2.38 X 10-4%/d 50% 2.34 X 10-4%/d 84.1% 1.42 X 10-4%/d 84.1% 1.43 X 10-4%/d 15.9% 3.83 X 10-4%/d 15.9% 4.05 X 10-4%/d Monte Carlo Isoconversion Monte Carlo Arrhenius Extrema Isoconversion Monte Carlo Arrhenius ken.waterman@freethinktech.com 2014

Arrhenius Fitting Uncertainty ken.waterman@freethinktech.com 2014 Distribution of ambient rates from Monte-Carlo or extrema calculations very similar In both cases, rate is not normally distributed Probabilities need to use a cumulative distribution function ken.waterman@freethinktech.com 2014

Arrhenius Fitting Uncertainty ken.waterman@freethinktech.com 2014 𝑘𝑖𝑠𝑜 𝑇 1 =𝐴 𝑒 − 𝐸 𝑎 𝑅 1 𝑇 2 Can be solved in logarithmic (linear) or exponential (non-linear) form With perfect data, point estimates of rate (shelf-life) will be identical A distribution at each point will generate imperfect fits Least squares will minimize difference between actual and calculated points Non-linear will weight high T more heavily Constant RSD means that higher rates will have greater errors ken.waterman@freethinktech.com 2014

Comparison of Arrhenius Fitting Methods   Extrapolated Shelf-life (years) at 25°C 84.1% Median 15.9% Mean Linear 3.86 2.31 1.43 2.70 Non-linear 7.12 2.33 0.90 5.41 Arrhenius based on isoconversion values @60, 70, 80°C Origin + point at 10 days; spec. limit (0.20%) RSD=10%; LOD = 0.02% Isoconversion distribution using extrema method True shelf-life equals 2.31 years ken.waterman@freethinktech.com 2014

Arrhenius Fitting Uncertainty ken.waterman@freethinktech.com 2014 Non-linear least squares fitting gives larger, less normal distributions of ambient rates Non-linear fitting’s greater weighting of higher temperatures makes non-Arrhenius behavior more likely to cause inaccuracies Since linear is also less computationally challenging, recommend use of linear fitting ken.waterman@freethinktech.com 2014

Low Degradant vs. Standard Deviation For low degradation rate (with respect to the SD), isoconversion less symmetric Becomes discontinuous @Δdeg = 0 (isoconversion = ∞) for any sampled point Can resolve by clipping points with MC Distribution meaning when most points removed? Can use extrema Define behavior with no regression line isoconversion Can define mean from first extrema intercept (2 X value) No perfect answers—modeling better when data show change ken.waterman@freethinktech.com 2014

ken.waterman@freethinktech.com 2014 Notes ICH guidelines allow ±2C and ±5%RH—average drug product shows a factor of 2.7 shelf-life difference within this range ASAP modeling uses both T and RH, both potentially changing with time—errors will change accordingly Assume mathematics the same, but need to focus on instantaneous rates ken.waterman@freethinktech.com 2014

ken.waterman@freethinktech.com 2014 Conclusions Modeling drug product shelf-life from accelerated data more accurate using isoconversion Isoconversion more accurate using points bracketing specification limit than using all points With isoconversion, regression interval (not confidence interval) includes error of fit, but difficult to calculate with varying SD Extrema method reasonably approximates RI for interpolation; more conservative for extrapolation Linear fitting of Arrhenius equation preferred ken.waterman@freethinktech.com 2014

ken.waterman@freethinktech.com 2014 Notes on King, Kung, Fung “Statistical prediction of drug stability based on non-linear parameter estimation” J. Pharm. Sci. 1984;73:657-662 Used rates based on each time point independently Changing rate constants not projected accurately for shelf-life Gives greater precision by treating each point as equivalent, even when far from isoconversion (32 points at 4 T’s gives better error bars than just 4 isoconversion values: more precise, but more likely to be wrong) Non-linear fitting to Arrhenius Weights higher T more heavily (and where they had most degradation) Made more sense with constant errors used for loss of potency Non-linear fitting in general bigger, less symmetric error bars, more likely to be in error if mechanism shift with T Used mean and SD for linear fitting, even when not normally distributed (i.e., not statistically valid method) Do not recommend general use of KKF method (fine for ideal behavior, loss of potency) ken.waterman@freethinktech.com 2014