Presentation is loading. Please wait.

Presentation is loading. Please wait.

I is for Investigation Outbreak Investigation Methods from Mystery to Mastery.

Similar presentations


Presentation on theme: "I is for Investigation Outbreak Investigation Methods from Mystery to Mastery."— Presentation transcript:

1 I is for Investigation Outbreak Investigation Methods from Mystery to Mastery

2 Session V Analyzing Data

3 Session Overview Analysis planning and data cleaning Attack rates Hypothesis testing in analytic epidemiology –Measures of association –Tests of significance

4 Learning Objectives Understand the purpose of an analytic study in an epidemiologic outbreak investigation Generate measures of association for cohort and case-control studies Interpret measures of association (risk ratios, odds ratios) and corresponding confidence intervals Interpret a statistical test of significance

5 Basic Steps of an Outbreak Investigation 1.Verify the diagnosis and confirm the outbreak 2.Define a case and conduct case finding 3.Tabulate and orient data: time, place, person 4.Take immediate control measures 5.Formulate and test hypotheses 6.Plan and execute studies 7.Implement and evaluate control measures 8.Communicate findings

6 Analysis Planning and Data Cleaning

7 Purpose of Analysis Planning Tailor questionnaire to collect necessary data in the correct format Select the most appropriate epidemiologic methods Use data efficiently in analytic software

8 Factors Influencing Data Analysis Research question Exposure and outcome variables Study design Sampling

9 Steps for Analysis Planning 1.Work backwards from the research question(s) to design the most efficient data collection instrument 2.Study design will determine which statistical tests and measures of association you evaluate in the analysis output 3.Consider the need to present, graph, or map data

10 Analysis Planning: Step 1 1.Work backwards from the research question(s) to design the most efficient data collection instrument –Develop a sound data collection instrument –Collect pieces of information that can be counted, sorted, and recoded or stratified Analysis phase is not the time to realize that you should have asked questions differently!

11 Analysis Planning: Step 2 2.Study design will determine which statistical tools you will use –Use risk ratio (RR) with cohort studies and odds ratio (OR) with case-control studies Match sampling methods to the required special types of analysis

12 Analysis Planning: Step 3 3.Consider the need to present, graph, or map data –Consider continuous versus categorical data –Collect additional data needed to map data

13 Data Cleaning Check for accuracy –Outliers Check for completeness –Missing values Get to know the descriptive findings Determine whether or not to create or collapse data categories

14 Data Cleaning: Distribution of Variables “Outlier”

15 Data Cleaning: Outliers Value falling far outside the values of the sample –First or last cases in an outbreak –Very high or very low values for a variable May be due to a collection, coding or data entry error If not due to error, the outlier may represent: –Baseline level of illness / unrelated to outbreak –Outbreak source –Case exposed earlier than the others –Case exposed later than the others –Case with a long incubation period

16 Data Cleaning: Missing Values Determine missing values by performing frequency distribution Select missing values for follow up Reasons for missing values –Expected missing values Skip patterns Non applicable questions –Problems in data collection or data entry

17 Data Cleaning: Data Categories Determine which variables are continuous versus categorical Decide whether to collapse existing categories –Never, former, sometimes, frequently –Ever, never Assess need to categorize continuous variables

18 Attack Rates

19 Understanding Attack Rates Attack Rate (AR) # of people who became ill # of people at risk (for a limited period of time) Exposure-specific AR # people who were exposed and became ill # of people who were exposed

20 Food-Specific Attack Rates Consumed Item Did Not Consume Item ItemIllTotalAR(%)IllTotalAR(%) Chicken124626172959 Cake264361203263 Water102442335165 Green Salad42547832114 Asparagus4667426961 CDC. Outbreak of foodborne streptococcal disease. MMWR 23:365, 1974. This food is probably not the source of infection

21 Hypothesis Testing in Analytic Epidemiology

22 Hypothesis Generation vs. Hypothesis Testing Generate hypotheses –Interview officials, case-patients, and others –Review the literature –Assess descriptive epidemiology Test hypotheses –After hypotheses have been generated –Use analytic epidemiology

23 Descriptive Versus Analytic Epidemiology Descriptive EpidemiologyAnalytic Epidemiology Search for cluesClues available Formulate hypothesesTest hypotheses No comparison groupComparison group Answers: −How much −Who −What −When −Where Answers: −How −Why

24 Measures of Association Assess the strength of an association between an exposure and the outcome of interest Two widely used measures –Risk ratio (relative risk, RR) Used with cohort studies –Odds ratio (OR) Used with case-control studies

25 2 x 2 Tables Outcome ExposureYesNoTotal Yesaba + b Nocdc + d Totala + cb + da + b + c + d Used to summarize counts of disease and exposure in order to do calculations of association

26 2 x 2 Tables Outcome ExposureYesNoTotal Yesaba + b Nocdc + d Totala + cb + da + b + c + d a = number who are exposed and have the outcome b = number who are exposed and do not have the outcome c = number who are not exposed and have the outcome d = number who are not exposed and do not have the outcome

27 2 x 2 Tables a + b = total number who are exposed c + d = total number who are not exposed a + c = total number who have the outcome b + d = total number who do not have the outcome a + b + c + d = total study population Outcome ExposureYesNoTotal Yesaba + b Nocdc + d Totala + cb + da + b + c + d

28 Risk Ratio IllNot IllTotal ExposedABA+B UnexposedCDC+D Risk Ratio[A/(A+B)] [C/(C+D)]

29 Interpreting a Risk Ratio RR = 1.0 –No association between exposure and disease RR > 1.0 –Positive association (more exposure = more disease) RR < 1.0 –Negative association / protective effect (more exposure = less disease)

30 Risk Ratio Example IllWellTotal Ate alfalfa sprouts431154 Did not eat alfalfa sprouts 31821 Total462975 RR = (43 / 54) / (3 / 21) = 5.6

31 Odds Ratio CasesControls ExposedAB UnexposedCD Odds Ratio(A/C)/(B/D)=(A*D)/(B*C)

32 Interpreting an Odds Ratio OR = 1.0 –No association between exposure and disease OR > 1.0 –Positive association OR < 1.0 –Negative association

33 Odds Ratio Example CaseControlTotal Ate at restaurant X 602585 Did not eat at restaurant X 185573 Total 7880158 OR = (60 / 18) / (25 / 55) = 7.3

34 What to Do with a Zero Cell CaseControlTotal Ate at restaurant X600 Did not eat at restaurant X 185573 Total7855133 Try to recruit more study participants Add 1 to each cell Remember to document / report this!

35 Tests of Significance Indication of reliability of the association Answers the question: “How likely is it that the observed association may be due to chance?” Two main methods: –95% Confidence Intervals (CIs) –p-values

36 Confidence Intervals Consist of a lower bound and an upper bound –Example: RR=1.9, 95% CI: 1.1, 3.1 Allow the investigator to: –Evaluate statistical significance (does not include 1.0) –Assess the precision of the estimate (the odds ratio or risk ratio)

37 Confidence Interval Precision Narrow confidence intervals –More precise –Larger sample size, less random error –Example: OR=10, 95% CI: 9.0, 11.0 Wide confidence intervals –Less precise –Smaller sample size, more random error –Example: OR=10, 95% CI: 0.9, 44.0

38 p-values How likely the observed association would be to occur by chance alone, in the absence of a true association A small p-value –Means that it is unlikely to have observed the RR or OR observed if there was no true association –Interpreted as a true association A p-value of 0.05 –Indicates a 5% chance that the RR or OR was observed by chance alone –Often used as the cut-point for statistical significance

39 Plan and Execute Additional Studies Re-examine data or collect new data –Test new hypotheses –Gather more specific info –Example: Salmonella muenchen Investigate new hypotheses about transmission and prevention –Intervention studies –Example: Intensive hand-washing to prevent Norwalk on cruise ships

40 Session 5 Summary Analysis planning ensures useful data that can be used in the analytic phase to address study hypotheses Attack rates are descriptive statistics useful for comparing the risk of disease in groups with different exposures Analytic epidemiology allows you to test the hypotheses generated via review of descriptive statistics and the medical literature

41 Session 5 Summary The measure of association for case- control analytic studies is the odds ratio The measure of association for cohort analytic studies is the risk ratio Confidence intervals and p-values can be used to evaluate the statistical significance of measures of association

42 References and Resources Centers for Disease Control and Prevention (1992). Principles of Epidemiology, 2 nd ed. Atlanta, GA: Public Health Practice Program Office. Gordis L. (1996). Epidemiology. Philadelphia, WB Saunders. Rothman KJ. Epidemiology: An Introduction. New York, Oxford University Press, 2002. Stehr-Green, J. and Stehr-Green, P. (2004). Hypothesis Generating Interviews. Module 3 of a Field Epidemiology Methods course being developed in the UNC Center for Public Health Preparedness, UNC Chapel Hill.


Download ppt "I is for Investigation Outbreak Investigation Methods from Mystery to Mastery."

Similar presentations


Ads by Google