Incorporating Historical Control Data when Comparing Tumor Incidence Rates in Rodent Cancer Bioassay Shyamal D. Peddada Biostatistics Branch National Inst.

Slides:



Advertisements
Similar presentations
Introductory Mathematics & Statistics for Business
Advertisements

Issues and Experience in Analyzing Transgenic Mouse Carcinogenicity Studies: An Industry Perspective Ronald Menton Wyeth Research 2005 FDA/Industry Statistics.
Evaluation of Live Phase Results from Carcinogenicity Studies Wherly Hoffman, Ph.D. Statistics and Information Sciences Lilly Research Laboratories.
BUS 220: ELEMENTARY STATISTICS
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
Inference in the Simple Regression Model
SADC Course in Statistics Introduction to Non- Parametric Methods (Session 19)
SADC Course in Statistics Comparing two proportions (Session 14)
SADC Course in Statistics (Session 09)
Chapter 7 Sampling and Sampling Distributions
International Workshop on Industrial Statistics Dalian, China June 2010 Shyam Upadhyaya UNIDO Use of IIP in other measures.
Department of Engineering Management, Information and Systems
On Comparing Classifiers : Pitfalls to Avoid and Recommended Approach
June 14. In Chapter 9: 9.1 Null and Alternative Hypotheses 9.2 Test Statistic 9.3 P-Value 9.4 Significance Level 9.5 One-Sample z Test 9.6 Power and Sample.
Chapter 7 Hypothesis Testing
1 A STEPWISE MAXIMUM MODULUS TEST WITH UNEQUAL VARIANCES D. Bristol and N. Stouffer Purdue Pharma LP, Princeton NJ.
Chapter 13: Chi-Square Test
Chapter 16 Goodness-of-Fit Tests and Contingency Tables
1 Epidemiologic Measures of Association Saeed Akhtar, PhD Associate Professor, Epidemiology Division of Epidemiology and Biostatistics Aga Khan University,
Chapter 4 Inference About Process Quality
Module 16: One-sample t-tests and Confidence Intervals
Putting Statistics to Work
Inferential Statistics
© The McGraw-Hill Companies, Inc., Chapter 10 Testing the Difference between Means and Variances.
Research Methodology Statistics Maha Omair Teaching Assistant Department of Statistics, College of science King Saud University.
Chapter 18: The Chi-Square Statistic
Estimation of TLD dose measurement uncertainties and thresholds at the Radiation Protection Service Du Toit Volschenk SABS.
Chapter 11: The t Test for Two Related Samples
January Structure of the book Section 1 (Ch 1 – 10) Basic concepts and techniques Section 2 (Ch 11 – 15): Inference for quantitative outcomes Section.
Hypothesis Testing. To define a statistical Test we 1.Choose a statistic (called the test statistic) 2.Divide the range of possible values for the test.
Shibing Deng Pfizer, Inc. Efficient Outlier Identification in Lung Cancer Study.
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
1 Health Warning! All may not be what it seems! These examples demonstrate both the importance of graphing data before analysing it and the effect of outliers.
Bayesian inference “Very much lies in the posterior distribution” Bayesian definition of sufficiency: A statistic T (x 1, …, x n ) is sufficient for 
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Carcinogen Classification Criteria Patricia Richter Ph.D., DABT Tobacco Products Scientific Advisory Committee June 8, 2010.
Cox Model With Intermitten and Error-Prone Covariate Observation Yury Gubman PhD thesis in Statistics Supervisors: Prof. David Zucker, Prof. Orly Manor.
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
Resampling techniques
Evaluating Hypotheses
Statistical Methods for Analyzing Ordered Gene Expression Microarray Data Shyamal D. Peddada Biostatistics Branch National Inst. Environmental Health Sciences.
Statistical Methods in Computer Science Hypothesis Testing I: Treatment experiment designs Ido Dagan.
Experimental Evaluation
Survival Analysis A Brief Introduction Survival Function, Hazard Function In many medical studies, the primary endpoint is time until an event.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Hypothesis Testing.
Copyright © Cengage Learning. All rights reserved. 8 Tests of Hypotheses Based on a Single Sample.
T-Tests and Chi2 Does your sample data reflect the population from which it is drawn from?
Comparing Two Population Means
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
PARAMETRIC STATISTICAL INFERENCE
An Empirical Likelihood Ratio Based Goodness-of-Fit Test for Two-parameter Weibull Distributions Presented by: Ms. Ratchadaporn Meksena Student ID:
Bayesian Analysis and Applications of A Cure Rate Model.
1 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 12 The Analysis of Categorical Data and Goodness-of-Fit Tests.
False Discovery Rates for Discrete Data Joseph F. Heyse Merck Research Laboratories Graybill Conference June 13, 2008.
Chapter 16 The Chi-Square Statistic
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Confidence intervals and hypothesis testing Petter Mostad
1 From Mice to Men Cancer is not inevitable at Old Age Talk to OEHHA Oakland April 12th 2002 Richard Wilson Mallinckrodt Research Professor of Physics.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Fall 2002Biostat Statistical Inference - Proportions One sample Confidence intervals Hypothesis tests Two Sample Confidence intervals Hypothesis.
Suppose we have T genes which we measured under two experimental conditions (Ctl and Nic) in n replicated experiments t i * and p i are the t-statistic.
Stats 845 Applied Statistics. This Course will cover: 1.Regression –Non Linear Regression –Multiple Regression 2.Analysis of Variance and Experimental.
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
CHAPTER OVERVIEW Say Hello to Inferential Statistics The Idea of Statistical Significance Significance Versus Meaningfulness Meta-analysis.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Introduction to Biostatistics, Harvard Extension School, Fall, 2005 © Scott Evans, Ph.D.1 Contingency Tables.
Lecture Slides Elementary Statistics Twelfth Edition
Presentation transcript:

Incorporating Historical Control Data when Comparing Tumor Incidence Rates in Rodent Cancer Bioassay Shyamal D. Peddada Biostatistics Branch National Inst. Environmental Health Sciences (NIH) Research Triangle Park, NC

Outline of the talk The National Toxicology Program (NTP). NTPs 2-year rodent cancer bioassay. – Study design. – A motivating example. – Questions of interest. Brief review of some existing methods and the current practice. Proposed methodology. Illustration. Concluding remarks.

The National Toxicology Program (NTP) Established in 1978 (became permanent in 1981) One of the responsibilities of NTP is to provide information about potentially toxic chemicals. Humans are exposed to a large collection of chemicals some of which can cause cancer. So far NTP has evaluated nearly 550 chemicals.

Study design (NTP) Species: Rats and Mice Sex: Males and Females Number of dose groups for a given chemical: – Control, Low, Medium and High Number of animals per dose group: 50 Duration of study: 2 years Variable of interest: Tumor incidence in various organs

2-Butoxyethanol (TR-484): Male Mice. Hepatocellular Carcinoma Dose (ppm) Number of animals with tumors Route of exposure to the chemical: Inhalation.

2-Butoxyethanol (TR-484): Male Mice. Hepatocellular Carcinoma The P-value using NTPs trend test: Question: Does this suggest a dose-related trend in the incidence of hepetaocellular carcinoma? Question: Is the chemical potentially carcinogenic?

NTPs decision NTP not only relies on the p-value from the current data, but also uses other information in making decisions. – E.g. Any pre-cursors to the given tumor? Tumor incidence in the other sex and species? Mechanism/mode of action? Historically how often was the specific tumor seen? Other tumors? Etc.

Historical control data NTP maintains a database of control animals from each study by sex and species.

(Statistical) problem How to incorporate historical control (HC) data in evaluating the current experimental data for a given tumor? – Should HC be used at all? If so, what are the conditions/circumstances? – How to use the information?

Historical control data – heterogeneity among control groups Potential genetic drift over time. Differences among pathologists over time. Differences between sexes and species. Route of exposure to a chemical Inhalation Feed Water Subcutaneous Dermal Diet used (NIH-07, NTP 2000) Etc.

NTPs general strategy Select suitable controls from the HC database. Match the HC data with the current data by – sex and species – route of chemical exposure (i.e. feed study or inhalation study etc.) – study period (most recent to the current study – 5 year sliding window). Obtain the range of tumor incidence in the selected control groups. – This range is used by the NTP for making decisions.

2-Butoxyethanol (TR-484): Male Mice. Hepatocellular Carcinoma Observed relative frequency of tumors: 10/50, 11/50, 16/50, 21/50 NTPs trend test: P = Historical range: 11% to 48% (mean = 25.7%, sd = 10.4%) The current data are within the historical range ! - Therefore NTP classified this neoplasm under uncertain finding.

2-Butoxyethanol (TR-484): Forestomach squamous cell papillomas (Male mice) Tumor rates: 1/50, 1/50, 2/50, 2/50 P-value > Not significant. Historical range: 0% to 2% (mean = 0.5%, sd = 0.9%) Female mice: 0/50, 1/50, 2/50, 5/50 P –value for trend =

2-Butoxyethanol (TR-484): (0, 62.5, 125, 250 ppm) Notes: 1. This is a rare tumor according to the HC database. 2. Although it is rare, 5/150 dosed male mice had this tumor. 3. A significant trend is seen in female mice. NTP classified this tumor as an uncertain finding.

Question Can a formal statistical procedure be developed to incorporate historical control data when analyzing current data?

Tarone (1982) Beta-binomial model Data from current study Dose... Number of animals with tumors... Number of animals per dose group = The subscript c stands for current study.

Tarone (1982) = Number of historical control groups = Number of tumors in the j-th historical control group are assumed to be binomial random variables with binomial probability are assumed to i.i.d. beta distributed. This is in attempt allow for extra binomial variation between studies.

The hypotheses versus Here denotes the tumor rate in dose group i in the current study (denoted by c).

An important feature of the data Not all animals survive until the end of the study. Consequently, the binomial parameter is not a suitable parameter to study. – This parameter represents life time risk of developing tumors and not tumor incidence!

An example Control : All n = 50 survive until the end of the study (i.e. 2 years) and Y = 5 develop tumors. High dose: 30 animals die within the first 6 months. Of the remaining 20, Y = 2 develop tumors. – Should we compare 5/50 vs 2/50 Or is it 5/50 vs 2/20?

Other methods The following take a more direct approach by studying tumor incidence rather than life-time risk of developing tumors : –Ibrahim and Ryan (1996). Applicable if a tumor is known to be instantly lethal. –Ibrahim et al (1998). Applicable if a tumor is known to be non-lethal. –Dinse and Dunson (2001). A general Bayesian approach. Not very convenient to apply in practice.

NTP Board of Scientific Councilors recommendation (2005) None of the existing methods are endorsed by the NTP and in the 2005 meeting of the NTP Board of Scientific Councilors they recommended the development of a formal statistical procedure to incorporate historical control data. – (page 10)

Some notations and background: Analysis of current data

Poly-3 methodology (Bailer and Portier, 1988) Estimation of tumor incidence requires more data, e.g. interim animal sacrifice, or more assumptions. – Interim sacrifices are expensive. NTP developed tests based on hypothesis regarding life time tumor rates but adjust estimates to account for animal survival. – The resulting test procedure seems to perform well even for testing hypothesis on the tumor incidence.

Notations Bailer and Portier (1988, Biometrics): : Length of the study (current study) : Number of days the j-th animal survived. if the j-th animal in the i-th dose in the current study has a tumor. It is 0 otherwise.

Sample size correction for survival data if tumor present before death or. otherwise: Then effective sample size for the i-th group is:

NTPs Poly-3 test Poly-3 survival adjust tumor rate for the i-th dose group in the current study is given by: Using these estimates the NTP constructed the well-known Cochran-Armitage trend test. This is the Poly-3 test.

New test for trend (in current data): Basic idea 1. The classical Cochran-Armitage (CA) trend test is ideal for linear trend in dose-response. 2. The order-restricted inference based procedures work well for monotone responses that are not necessarily linear. 3. The trend test proposed in Peddada et al. (2005, 2006) is the maximum of tests 1 and 2. Thus this procedure works well for linear as well as non-linear, but monotonic responses.

Notations Let

Modified Cochran-Armitage trend test for NTP data

Williams type test for NTP data Let

Test for trend in the current data Peddada, et al. (2005, 2006) –As good as Cochran-Armitage trend test for linear response. –Maintains good power for non-linear but monotone response.

Null distribution For Denote Then the null distribution of can be simulated from

Analysis of current data making use of historical control data

Data Number of historical controls = k Number of animals in the m-th historical control group: Tumor status of the j-th animal in the control group:

Data Number of animals with tumors in the control group: Total number of animals with tumor among all historical controls

Basic ideas The current and all historical controls are random realizations from a population of control animals. If no animals die before the end of the study then we assume:

Basic ideas The proposed test statistic has two components: –As in Peddada et al. (2005, 2006) compare the dose groups from the current study with the current control group by computing: –Similarly, compare the dose groups from the current study with the historical control group by computing:

Basic ideas The two terms are analogous to respectively, except that they account for the between controls variability term, i.e.,

Proposed test Thus the proposed test statistic is then given by: Approximate null distribution of the above statistic can be derived very easily as follows:

Null distribution Let Similarly are isotonized values of

Null distribution Then the null distribution of the proposed test is approximated by the distribution of

Back to the NTP examples

2-Butoxyethanol (TR-484): Male Mice. Hepatocellular Carcinoma Observed relative frequency of tumors: 10/50, 11/50, 16/50, 21/50 NTPs trend test: P = Historical range: 11% to 48% (mean = 25.7%, sd = 10.4%) The current data are within the historical range ! - Therefore NTP classified this neoplasm under uncertain finding.

2-Butoxyethanol (TR-484): Forestomach squamous cell papillomas (Male mice) Tumor rates: 1/50, 1/50, 2/50, 2/50 P-value > Not significant. Historical range: 0% to 2% (mean = 0.5%, sd = 0.9%) Female mice: 0/50, 1/50, 2/50, 5/50 P –value for trend =

2-Butoxyethanol (TR-484): (0, 62.5, 125, 250 ppm) Hepatocellular Carcinoma P-value based on the new historical control test: Forestomach squamous cell papillomas P-value based on the new historical control test: 0.003

2-Butoxyethanol (TR-484): (0, 62.5, 125, 250 ppm) NTP: There was some evidence of carcinogenic activity of 2-Butoxyethanol in male B6C3F1 mice… based on Hemangiosarcoma of liver … A marginal increase in the forestomach squamous cell papilloma and an increase in the incidence of hepatocellular carcinoma may have been exposure related.

Simulation study

Simulation experiment As usually done, we generated independent Weibull random variables for (a) Tumor incidence and (b) Mortality. Weibull parameters were chosen so that the data resemble some of the commonly seen patterns in NTP studies. – Dose effect on mortality: None, Moderate. – Background tumor rate:.001 (rare tumor),.01,.05,.15,.30 (common tumor) – Heterogeneity of historical control groups: Low, Medium and High. – Tumor incidence shape parameter: 1.5, 3 and 6 – Tumor incidence ratio patterns: 5 different non-null patterns and 1 null pattern. – Total number of non-null patterns = 450. – Total number of null patterns = 90. – Each simulation was based on samples.

Some research questions Is there a better approach to this problem? Perhaps an empirical Bayes approach. How do we adjust for co-variables such as the body weight of the animal? – Note that the changes in body weight and animal survival and carcinogenicity are common response to changes in dose of a chemical. How do we perform simultaneous analyses of multiple tumors?

Concluding remarks The new historical control test procedure: (a) Operates at the desired level of significance. (b) Gains in power over NTPs Poly-3 trend test. (c) Computationally simple to implement (a user friendly JAVA- based multiplatform computer software package is available). (d) Unlike other existing statistical procedures, it is non- parametric. No complicated modeling/distributional assumptions are made.

Acknowledgements Gregg Dinse, NIEHS Grace Kissling, NIEHS Shawn Harris, Constella group Kevin McGowan, Constella group