BIOST 536 Lecture 3 1 Lecture 3 – Overview of study designs Prospective/retrospective  Prospective cohort study: Subjects followed; data collection in.

Slides:



Advertisements
Similar presentations
Agency for Healthcare Research and Quality (AHRQ)
Advertisements

How would you explain the smoking paradox. Smokers fair better after an infarction in hospital than non-smokers. This apparently disagrees with the view.
If we use a logistic model, we do not have the problem of suggesting risks greater than 1 or less than 0 for some values of X: E[1{outcome = 1} ] = exp(a+bX)/
Observational Studies and RCT Libby Brewin. What are the 3 types of observational studies? Cross-sectional studies Case-control Cohort.
Epidemiologic study designs
Observational Designs Oncology Journal Club April 26, 2002.
1 Case-Control Study Design Two groups are selected, one of people with the disease (cases), and the other of people with the same general characteristics.
Categorical Data. To identify any association between two categorical data. Example: 1,073 subjects of both genders were recruited for a study where the.
Measures of Disease Association Measuring occurrence of new outcome events can be an aim by itself, but usually we want to look at the relationship between.
Measures of association
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
Case-Control Studies. Feature of Case-control Studies 1. Directionality Outcome to exposure 2. Timing Retrospective for exposure, but case- ascertainment.
Measures of disease frequency (I). MEASURES OF DISEASE FREQUENCY Absolute measures of disease frequency: –Incidence –Prevalence –Odds Measures of association:
BIOST 536 Lecture 4 1 Lecture 4 – Logistic regression: estimation and confounding Linear model.
Analysis of Complex Survey Data
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Survival analysis Brian Healy, PhD. Previous classes Regression Regression –Linear regression –Multiple regression –Logistic regression.
Introduction to Survival Analysis August 3 and 5, 2004.
The Chi-Square Test Used when both outcome and exposure variables are binary (dichotomous) or even multichotomous Allows the researcher to calculate a.
Study Design / Data: Case-Control, Descriptives Basic Medical Statistics Course: Module C October 2010 Wilma Heemsbergen
Cohort Study.
Multiple Choice Questions for discussion
September 15. In Chapter 18: 18.1 Types of Samples 18.2 Naturalistic and Cohort Samples 18.3 Chi-Square Test of Association 18.4 Test for Trend 18.5 Case-Control.
Dr. Abdulaziz BinSaeed & Dr. Hayfaa A. Wahabi Department of Family & Community medicine  Case-Control Studies.
Epidemiologic Study Designs Nancy D. Barker, MS. Epidemiologic Study Design The plan of an empirical investigation to assess an E – D relationship. Exposure.
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
Study Design. Study Designs Descriptive Studies Record events, observations or activities,documentaries No comparison group or intervention Describe.
Types of study designs Arash Najimi
Lecture 6 Objective 16. Describe the elements of design of observational studies: (current) cohort studies (longitudinal studies). Discuss the advantages.
Research Study Design. Objective- To devise a study method that will clearly answer the study question with the least amount of time, energy, cost, and.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 8 – Comparing Proportions Marshall University Genomics.
Smart designs Case control studies FETP India. Competency to be gained from this lecture Design a case control study.
Design and Analysis of Clinical Study 6. Case-control Study Dr. Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia.
Introduction to Survival Analysis Utah State University January 28, 2008 Bill Welbourn.
Epidemiologic design from a sampling perspective Epidemiology II Lecture April 14, 2005 David Jacobs.
Case-control study Chihaya Koriyama August 17 (Lecture 1)
MBP1010 – Lecture 8: March 1, Odds Ratio/Relative Risk Logistic Regression Survival Analysis Reading: papers on OR and survival analysis (Resources)
Types of study designs.
Causal relationships, bias, and research designs Professor Anthony DiGirolamo.
Case-Crossover Studies.
BIOST 536 Lecture 1 1 Lecture 1 - Introduction Overview of course  Focus is on binary outcomes  Some ordinal outcomes considered Simple examples Definitions.
1 Lecture 6: Descriptive follow-up studies Natural history of disease and prognosis Survival analysis: Kaplan-Meier survival curves Cox proportional hazards.
Describing the risk of an event and identifying risk factors Caroline Sabin Professor of Medical Statistics and Epidemiology, Research Department of Infection.
Overview of Study Designs. Study Designs Experimental Randomized Controlled Trial Group Randomized Trial Observational Descriptive Analytical Cross-sectional.
Study designs. Kate O’Donnell General Practice & Primary Care.
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
Case-Control Studies Abdualziz BinSaeed. Case-Control Studies Type of analytic study Unit of observation and analysis: Individual (not group)
Satistics 2621 Statistics 262: Intermediate Biostatistics Jonathan Taylor and Kristin Cobb April 20, 2004: Introduction to Survival Analysis.
Case Control Studies Dr Amna Rehana Siddiqui Department of Family and Community Medicine October 17, 2010.
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
X Treatment population Control population 0 Examples: Drug vs. Placebo, Drugs vs. Surgery, New Tx vs. Standard Tx  Let X = decrease (–) in cholesterol.
Matched Case-Control Study Duanping Liao, MD, Ph.D Phone:
Types of Studies. Aim of epidemiological studies To determine distribution of disease To examine determinants of a disease To judge whether a given exposure.
Conditional Logistic Regression Epidemiology/Biostats VHM812/802 Winter 2016, Atlantic Veterinary College, PEI Raju Gautam.
Exact Logistic Regression
THE CHI-SQUARE TEST BACKGROUND AND NEED OF THE TEST Data collected in the field of medicine is often qualitative. --- For example, the presence or absence.
Fall 2002Biostat Inference for two-way tables General R x C tables Tests of homogeneity of a factor across groups or independence of two factors.
Chapter 9 Lecture Research Techniques: For the Health Sciences Fifth Edition © 2014 Pearson Education, Inc. Conducting Analytical Epidemiologic Studies.
1 Study Design Imre Janszky Faculty of Medicine, ISM NTNU.
Case control & cohort studies
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
Case Control study. An investigation that compares a group of people with a disease to a group of people without the disease. Used to identify and assess.
Measures of disease frequency Simon Thornley. Measures of Effect and Disease Frequency Aims – To define and describe the uses of common epidemiological.
Journal Club Curriculum-Study designs. Objectives  Distinguish between the main types of research designs  Randomized control trials  Cohort studies.
Biostatistics Case Studies 2016
Lecture 8 – Comparing Proportions
Lecture 1: Fundamentals of epidemiologic study design and analysis
Chapter 18 Cross-Tabulated Counts
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Presentation transcript:

BIOST 536 Lecture 3 1 Lecture 3 – Overview of study designs Prospective/retrospective  Prospective cohort study: Subjects followed; data collection in real time following a planned study  Retrospective prospective cohort study: Data collected prospectively and stored (administrative data); study analytic plan developed later  Retrospective study: Exposure & other covariates may be collected by interview or record review after disease identification Outcome  Binary  Time to event

BIOST 536 Lecture 3 2 Randomized trial with a binary outcome What statistical hypotheses could we test?  Does treatment dose affect probability of success?  Does success increase with amount of treatment ?  Does success increase with actual dose level ? Risk difference, risk ratio, odds ratios appropriate Can use asymptotic methods or small sample methods TreatmentSuccessFailureTotal Placebo (0 mg)r1r1 n 1 - r 1 n1n1 Low dose (20 mg)r2r2 n 2 - r 2 n2n2 High dose (50 mg)r3r3 n 3 - r 3 n3n3

BIOST 536 Lecture 3 3 Example TreatmentY=1Y=0N Placebo (0 mg) Low dose (20 mg) High dose (50 mg) | dose trt cnt y | | | 1. | | 2. | | 3. | | 4. | | 5. | | 6. | | tabulate trt y [fw=cnt], row chi2 lr exact Enumerating sample-space combinations: stage 3: enumerations = 1 stage 2: enumerations = 21 stage 1: enumerations = 0 | y trt | 0 1 | Total | | 100 | | | | 100 | | | | 100 | | Total | | 300 | | Pearson chi2(2) = Pr = likelihood-ratio chi2(2) = Pr = Fisher's exact = 0.040

BIOST 536 Lecture 3 4 Overall test (2 df) with treatment categorical. gen trt2=(trt==2). gen trt3=(trt==3). logistic y trt2 trt3 [fw=cnt] Logistic regression Number of obs = 300 LR chi2(2) = 6.51 Prob > chi2 = Log likelihood = Pseudo R2 = y | Odds Ratio Std. Err. z P>|z| [95% Conf. Interval] trt2 | trt3 | xi: logistic y i.trt [fw=cnt] i.trt _Itrt_1-3 (naturally coded; _Itrt_1 omitted) Logistic regression Number of obs = 300 LR chi2(2) = 6.51 Prob > chi2 = Log likelihood = Pseudo R2 = y | Odds Ratio Std. Err. z P>|z| [95% Conf. Interval] _Itrt_2 | _Itrt_3 |

BIOST 536 Lecture 3 5 Test of trend and test of dose (1 df). logistic y trt [fw=cnt] Logistic regression Number of obs = 300 LR chi2(1) = 6.01 Prob > chi2 = Log likelihood = Pseudo R2 = y | Odds Ratio Std. Err. z P>|z| [95% Conf. Interval] trt | logistic y dose [fw=cnt] Logistic regression Number of obs = 300 LR chi2(1) = 5.54 Prob > chi2 = Log likelihood = Pseudo R2 = y | Odds Ratio Std. Err. z P>|z| [95% Conf. Interval] dose | Logistic regression model can accommodate categorical, ordinal, continuous covariates Can control for other variables through stratification or modeling

BIOST 536 Lecture 3 6 Prospective models with long-term follow-up Longer follow-up means a greater likelihood of the outcome not being observed Example of a cancer clinical trial: Bone marrow transplant vs. chemotherapy alone  Death (solid circle) is the outcome; observations may be “censored” (square) due to end of study

BIOST 536 Lecture 3 7 Interested in 18 month survival All outcomes could be classified at 18 months; some information loss in not using known failure times

BIOST 536 Lecture 3 8 Suppose one patient is censored early Not all outcomes at 18 months are known Need to use survival analysis to account for censoring; estimate hazard ratios (similar to risk ratios)

BIOST 536 Lecture 3 9 Case-control study What statistical hypotheses could we test?  Is exposure associated with case-control status?  Is case-control status associated with a trend in exposure level?  Is case-control status associated with the actual amount of exposure? Odds ratios appropriate Can use asymptotic methods or small sample methods ExposureCasesControls None (0 rads)n 11 n 01 Low (20 rads)n 12 n 02 High (50 rads)n 13 n 03 Totaln 1. n 0.

BIOST 536 Lecture 3 10 Example Same data as in the previous example Perform the same logistic regression analyses – obtain odds ratios for the associations of disease and exposure category, ordinal level of exposure, or actual estimated exposure Observational data, potential bias in exposure assessment soften the scientific conclusions (“associations”) ExposureCasesControls None (0 rads) 4951 Low (20 rads) 6238 High (50 rads) 6634 Total

BIOST 536 Lecture 3 11 Case-control Decide on criteria for cases and identify all cases possible Decide on criteria for controls and identify a number proportional to the number of cases (1 case to 1 control; 1-2; or 1-4) May be frequency matched on age/gender or other characteristics of the cases (logistic regression) May be tightly matched to a case in 1-m matching on several factors simultaneously (conditional logistic regression) Interpret the odds ratios

BIOST 536 Lecture 3 12 Cohort-embedded case-control studies Overall longitudinal cohort available from which possible cases can be identified Identify actual cases among the possible cases (intake diagnosis suggests possible myocardial infarction; identify actual MI cases) Identify controls (anyone without an actual MI) occurring at the same age as the case Decide on number of controls per case and randomly sample from all potential controls the same age as the case Assemble information retrospectively that was collected prior to the case’s diagnosis date Perform a matched case-control analysis (conditional logistic regression) Interpret the odds ratios

BIOST 536 Lecture 3 13 Estimation Measures for prospective designs  Incidence risk ratio  Risk difference  Attributable fraction Typically compare exposed incidence rate to unexposed incidence rate using the ratio 1. Incidence risk ratio may not depend on age (time) even though the incidence rates do 2. Mathematically convenient so extensively used May prefer a risk difference instead Cannot be used for case-control studies without more information

BIOST 536 Lecture 3 14 Attributable fraction Exposed attributable risk  Cole & McMahon, 1971  Used descriptively primarily Population attributable risk 1. Levin, Need to know the proportion of the population in the exposed group at time t, p t

BIOST 536 Lecture 3 15 Study designs 2 x 2 Tables can have either the table total (n), column totals (m 1, m 0 ) or row totals (n 1, n 0 ) fixed by design Cross-sectional design  Only n fixed  Choose n subjects at random and ascertain both exposure and disease status  Very inefficient for studying association since disease and/or exposure may be rare  Can estimate disease prevalence in unexposed and exposed  Estimates may not be stable for small a, b, m 1 ExposedUnexposed Diseaseabn 1 No Diseasecdn 0 m 1 m 0 n

BIOST 536 Lecture 3 16 Prospective cohort Choose subjects from a population that does not have disease at time t 0 and follow until t 1 Ascertain exposure prior to t 0 and disease status in ( t 0, t 1 ) Total n or the column totals (m 1, m 0 ) may be fixed by design  Greater efficiency usually if m 1 = m 0 Estimate the risk of developing disease in the interval [ t 0, t 1 ) by exposure status If only n fixed by design (no sampling on exposure) then risk of developing the disease in [ t 0, t 1 ) among the population is  Otherwise need to weight by exposure probabilities ExposedUnexposed Diseaseabn 1 No Diseasecdn 0 m 1 m 0 n

BIOST 536 Lecture 3 17 Prospective cohort facts If the risk ratio does not depend on t, then  Means the probability of not having disease in the interval if exposed is the probability of not having disease in the interval for the unexposed raised to the power of the risk ratio If the disease probability p i (t) is small, then  So the risk ratio is estimated by

BIOST 536 Lecture 3 18 Case-control design Choose subjects from a population that do or do not have disease in the interval ( t 0, t 1 ) Typically n 0, n 1 are fixed by design The proportions below do not estimate disease risk among exposed and unexposed Instead we estimate probability of exposure conditional on case status Without more information cannot get ExposedUnexposed Diseaseabn 1 No Diseasecdn 0 m 1 m 0 n

BIOST 536 Lecture 3 19 Case-control design estimation Let Y= disease and X=exposure and adopt a logistic regression model Then the odds ratio is estimated as We cannot estimate P(D|E) since we need  0 to do that We do get an estimated  0 from our logistic regression, but it is not the right  0 [more on that later]

BIOST 536 Lecture 3 20 Case-control design estimation Cannot estimate the risk difference Cannot estimate the population attributable risk without estimates of exposure rates in the population Case-control analysis also assumes that sampling rates for exposed and unexposed individuals are the same, otherwise bias can result Later will control for other sources of bias in case-control analysis