Saturday, August 06, 2016 Farrokh Alemi, PhD.

Slides:



Advertisements
Similar presentations
Contingency Tables Prepared by Yu-Fen Li.
Advertisements

M2 Medical Epidemiology
Contingency Tables Chapters Seven, Sixteen, and Eighteen Chapter Seven –Definition of Contingency Tables –Basic Statistics –SPSS program (Crosstabulation)
Hypothesis Testing IV Chi Square.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #17.
Experimental Evaluation
5-3 Inference on the Means of Two Populations, Variances Unknown
Go to Table of ContentTable of Content Analysis of Variance: Randomized Blocks Farrokh Alemi Ph.D. Kashif Haqqi M.D.
The Chi-Square Test Used when both outcome and exposure variables are binary (dichotomous) or even multichotomous Allows the researcher to calculate a.
Categorical Data Prof. Andy Field.
Analysis of Categorical Data
- Interfering factors in the comparison of two sample means using unpaired samples may inflate the pooled estimate of variance of test results. - It is.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Contingency tables Brian Healy, PhD. Types of analysis-independent samples OutcomeExplanatoryAnalysis ContinuousDichotomous t-test, Wilcoxon test ContinuousCategorical.
The binomial applied: absolute and relative risks, chi-square.
What is “collapsing”? (for epidemiologists) Picture a 2x2 tables from Intro Epi: (This is a collapsed table; there are no strata) DiseasedUndiseasedTotal.
Chapter 10 The t Test for Two Independent Samples
1 G Lect 7a G Lecture 7a Comparing proportions from independent samples Analysis of matched samples Small samples and 2  2 Tables Strength.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Introduction to Biostatistics, Harvard Extension School, Fall, 2005 © Scott Evans, Ph.D.1 Contingency Tables.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Chapter 10: The t Test For Two Independent Samples.
Political Science 30: Political Inquiry. How Sure is Sure? Quantifying Uncertainty in Tables Using Two-Way Tables SAT scores and UC admissions What’s.
Comparing Counts Chi Square Tests Independence.
Basic Statistics The Chi Square Test of Independence.
Political Science 30: Political Inquiry
Chi-Square (Association between categorical variables)
Chapter 12 Analysis of count data.
Lecture Slides Elementary Statistics Twelfth Edition
The binomial applied: absolute and relative risks, chi-square
Association between two categorical variables
Lecture 8 – Comparing Proportions
CHAPTER 11 Inference for Distributions of Categorical Data
Chapter 8: Inference for Proportions
Elementary Statistics
Introduction to Inferential Statistics
Data Analysis for Two-Way Tables
Log Linear Modeling of Independence
Elementary Statistics
Analysis of count data 1.
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
2. Stratified Random Sampling.
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
2. Stratified Random Sampling.
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Receiver Operating Curves
Stratification Matters: Analysis of 3 Variables
Test of Independence in 3 Variables
Chapter 11: Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
Comparing two Rates Farrokh Alemi Ph.D.
Chi Square (2) Dr. Richard Jackson
One way ANALYSIS OF VARIANCE (ANOVA)
CHAPTER 11 Inference for Distributions of Categorical Data
Wednesday, September 21, 2016 Farrokh Alemi, PhD.
Analyzing the Association Between Categorical Variables
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
Section 11-1 Review and Preview
CHAPTER 11 Inference for Distributions of Categorical Data
Inference for Two Way Tables
Applied Statistics Using SPSS
CHAPTER 11 Inference for Distributions of Categorical Data
Applied Statistics Using SPSS
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
Risk Adjusted P-chart Farrokh Alemi, Ph.D.
Case-control studies: statistics
Presentation transcript:

Saturday, August 06, 2016 Farrokh Alemi, PhD. Stratification Saturday, August 06, 2016 Farrokh Alemi, PhD. This lecture introduce the concept of stratification. The lecture is based on slides prepared by Dr. Lin and modified by Dr. Alemi.

Each group with one or more variables held constant Divide Into Subgroups Each group with one or more variables held constant In analysis, often we need to separate the analysis into several sub-group analysis; each subgroup holds one or more confounding variable as constant. These subgroups are called strata and the process of dividing the data into the subgroups is called stratification. In general the data will be stratified into k sub-groups according to one or more confounding variables. Each strata has the same levels of the confounders so the effect of exposure on outcome can be calculated without the influence of the confounders.

Same Levels of Confounders 1 … K Desired Outcome Not Desired Outcome Exposed (Cases) a1 b1 Not Exposed (Controls) c1 d1 Same Levels of Confounders 1 … K Desired Outcome Not Desired Outcome Exposed (Cases) a2 b2 Not Exposed (Controls) c2 d2 The data for each stratum consist of a 2 × 2 contingency tables relating exposure to disease. Each entry in the table is the count of cases that share all the same confounder levels and fall into the table. Here we are showing 2 strata each having a 2 by 2 level with counts within the same table sharing the confounder levels within the strata.

Same Levels of Confounders 1 … K Desired Outcome Not Desired Outcome Exposed (Cases) ai bi Not Exposed (Controls) ci di This table shows the relationship between exposure and outcome in the ith strata. Note that the relationships in this 2 by 2 table does not depend on the confounders as these levels are held constant, what scientist call ceteris paribus

Same Levels of Confounders 1 … K Desired Outcome Not Desired Outcome Exposed (Cases) ai bi Not Exposed (Controls) ci di Fisher’s exact test provides a method of testing relationships across strata. This test assumes that the distribution of count of patients who were exposed and had the desired outcome, shown as a subscript i, follows a hyper-geometric distribution.

Same Levels of Confounders 1 … K Desired Outcome Not Desired Outcome Exposed (Cases) ai bi Not Exposed (Controls) ci di The test procedure will be based on a comparison of the observed number of units in the cell a subscript i with the expected number of units for that cell. The test procedure is the same regardless of the order of the rows and columns.

Same Levels of Confounders 1 … K Desired Outcome Not Desired Outcome Exposed (Cases) ai bi Not Exposed (Controls) ci di Note that for each strata, the sum of all 4 counts is shown as n with subscript i.

Mantel-Haenszel Test for Association To test if the exposure is associated with certain outcomes across strata, we need to introduce Mantel-Haenszel Test for Association

Mantel-Haenszel Test for Association Observed Mantel-Haenszel Test for association over different strata is calculated in several steps. First the sum of observed values is calculated. Same Levels of Confounders 1 … K Desired Outcome Not Desired Outcome Cases ai bi Controls ci di

Mantel-Haenszel Test for Association Observed Expected Variance Then in steps 2 and 3 the expected value and variance are calculated. Same Levels of Confounders 1 … K Desired Outcome Not Desired Outcome Cases ai bi Controls ci di

Mantel-Haenszel Test for Association Observed Expected Variance Chi-square Finally the chi-square test statistic is calculated. The square difference of observed and expected divided by the variance has a chi-square distribution with 1 degree of freedom. This allows the test of statistical significance of the association between exposure and the outcome across strata. Same Levels of Confounders 1 … K Desired Outcome Not Desired Outcome Cases ai bi Controls ci di

Table 1: Satisfaction Across Teams Mantel-Haenszel Test for Association Table 1: Satisfaction Across Teams Physicians Nurses Complaint Yes No George, MD Jim, RN 53 424 Jill, RN 11 37 Smith, MD 16 4 139 Let us look at an example. There are 2 strata in this table. The nurses performance is stratified by the physicians in the team. The outcome of interest is whether the patient complained. Here we see counts of number of complaints for different teams. We can use these data to test the association across strata.

Mantel-Haenszel Test for Association Physicians Nurses Complaint Yes No Total George, MD Jim, RN 53 424 477 Jill, RN 11 37 48 64 461 525 Physician Nurses Complaint Yes No Total Smith, MD Jim, RN 16 Jill, RN 4 139 143 155 159 There are 2 strata in this table, what we have elsewhere called partial tables. The nurses performance is stratified by the physicians in the team. We can use these data to test the association between nurses and complaints while holding the physician the same.

Mantel-Haenszel Test for Association The number of complaints for Jim across the two strata is 53.. Physicians Nurses Complaint Yes No Total George, MD Jim, RN 53 424 477 Jill, RN 11 37 48 64 461 525 Physician Nurses Complaint Yes No Total Smith, MD Jim, RN 16 Jill, RN 4 139 143 155 159

Mantel-Haenszel Test for Association If these complaints were occurring randomly, we would expect 58 complaints. Physicians Nurses Complaint Yes No Total George, MD Jim, RN 53 424 477 Jill, RN 11 37 48 64 461 525 Physician Nurses Complaint Yes No Total Smith, MD Jim, RN 16 Jill, RN 4 139 143 155 159

Mantel-Haenszel Test for Association The variance of the count of complaints is 5.03. Physicians Nurses Complaint Yes No Total George, MD Jim, RN 53 424 477 Jill, RN 11 37 48 64 461 525 Physician Nurses Complaint Yes No Total Smith, MD Jim, RN 16 Jill, RN 4 139 143 155 159

Mantel-Haenszel Test for Association If we look up a chi-square table, we see that for two sided test, each side having .025 probability of error and 1 degree of freedom, the chi-square statistic is 5.024. Therefore, we cannot reject the hypothesis that Jim and Jill have the same rates of complaints. Physicians Nurses Complaint Yes No Total George, MD Jim, RN 53 424 477 Jill, RN 11 37 48 64 461 525 Physician Nurses Complaint Yes No Total Smith, MD Jim, RN 16 Jill, RN 4 139 143 155 159

Test of Homogeneity of Odds Ratio Ok so Jim and Jill are not significantly different from each other across the strata, but perhaps they are different in some but not other strata. Perhaps the performance of Jim and Jill depend on who they are team up with. To test for this possibility, we need to introduce test of homogeneity. This test evaluates the hypothesis that the odds ratio in each strata is the same value. Not the odds ratio is a particular value but that it is the same value.

Test of Homogeneity of Odds Ratio Effect Modifiers & Interactions In general, we stratify the study population into k strata according to the confounding variable, confounder C. If the underling (true) odds ratio is different across the k strata, then there is said to be interaction or effect modification between risk factor and confounder. Then the confounder C is referred to as an effect modifier.

Test of Homogeneity of Odds Ratio The log of odds ratio is calculated for each strata.

Test of Homogeneity of Odds Ratio The variance of the log odds ratio can be calculated as the inverse of the count of each cell. Notice that this value is not defined when a count is zero and therefore in these situations an approximate measure is calculated by increasing the zero to 0.5.

Test of Homogeneity of Odds Ratio The expected value for log of odds ratio is calculated as a weighted average of the log odds ratios across strata. This is what we expect the log odds ratio to be if all the odds ratio within different strata came from the same distribution.

Test of Homogeneity of Odds Ratio Finally, the chi square test for homogeneity can be examined with k minus one degree of freedom.

Test of Homogeneity of Odds Ratio Let us apply this to an example set of data. The log of the odds ratio for the first strata is -.87 and for the second one is 0.25. Note that because the cell has a count of zero, we have replaced the cell value with 0.5 and adjusted the table so that the marginal values stay the same. Physicians Nurses Complaint Yes No Total George, MD Jim, RN 53 424 477 Jill, RN 11 37 48 64 461 525 Physician Nurses Complaint Yes No Total Smith, MD Jim, RN 16 Jill, RN 4 139 143 155 159

Test of Homogeneity of Odds Ratio The variance, shown as w, is 7.18 in the first strata and .42 in the second. Physicians Nurses Complaint Yes No Total George, MD Jim, RN 53 424 477 Jill, RN 11 37 48 64 461 525 Physician Nurses Complaint Yes No Total Smith, MD Jim, RN 16 Jill, RN 4 139 143 155 159

Test of Homogeneity of Odds Ratio The expected log of the odds ratio is -.81 and the chi-square statistic for homogeneity with 2 minus 1 degrees of freedom is calculated to be 0.5. The hypothesis that these two odds ratio are the same is not rejected. Physicians Nurses Complaint Yes No Total George, MD Jim, RN 53 424 477 Jill, RN 11 37 48 64 461 525 Physician Nurses Complaint Yes No Total Smith, MD Jim, RN 16 Jill, RN 4 139 143 155 159

Estimate of Common Odds Ratio IF Estimate of Common Odds Ratio If there is a statistically significant common odds ratio across the strata, and if the impact is homogenous across strata, the estimate of the common odds ratio can be calculated. The Mantel-Haenszel test provides a test of significance of the relationship between exposure and outcome. If we reject the null hypothesis in Mantel-Haenszel test, then there exist an association but we still do not know the magnitude of the association. In addition, if we do not reject the null hypothesis of common odds ratio across stratum, we are reassured that the common odds ratio exists and is meaningful. If the true odds ratios are different, then it makes no sense to obtain a pooled-odds ratio estimate.

Estimate of Common Odds Ratio Neither Mantel-Haenszel nor Chi-square test for homogeneity of odds ratios give a measure of the strength of the association. The following procedures describe how to pool a measure of association across strata.

Estimate of Common Odds Ratio The estimated common odds ratio is calculated as indicated here.

Estimate of Common Odds Ratio In addition, the confidence interval for this association can be calculated based on the variance of this measure across strata.

Estimate of Common Odds Ratio Let us see an example. The common odds ratio is calculated to be 0.5. This means that the odds of a complaint for nurse Jim is half as many times as the odds of complaint for nurse Jill. This estimate is made across the two strata. This estimate makes sense only if the strata are not significantly different. Earlier we showed that they were not, so the calculation of this estimate makes sense. Physicians Nurses Complaint Yes No Total George, MD Jim, RN 53 424 477 Jill, RN 11 37 48 64 461 525 Physician Nurses Complaint Yes No Total Smith, MD Jim, RN 16 Jill, RN 4 139 143 155 159

Stratification allows us to examine the common impact across strata Take Home Lessons Stratification allows us to examine the common impact across strata These slides have shown a number of ways in which stratification can help us understand if a common odds ratio across strata exists and is statistically significant.