Cohort Studies, Relative Risk, and Attributable Risk STAT 6395 Spring 2008 Filardo and Ng.

Cohort Studies, Relative Risk, and Attributable Risk STAT 6395 Spring 2008 Filardo and Ng

Cohort studies A study in which a group of persons exposed to a factor of interest and a group of persons not exposed are followed Type of studies  Observational  Cohort studies and compared with respect to the incidence rate of the disease or other condition of interest Time

Types of Epidemiologic studies

Cohort A designated group of subjects who are followed (traced) over a period of time Type of studies  Observational  Cohort studies

Time Type of studies  Observational  Cohort studies

Cohort studies are also called: Prospective studies Retrospective cohort studies Follow-up studies Longitudinal studies Type of studies  Observational  Cohort studies

Cohort studies (study schema) Type of studies  Observational  Cohort studies

Among observational studies, cohort studies are the ‘gold standard’ Exposure precedes onset of disease (a necessary condition of causality) No differential recall of exposures by those who develop the disease compared to those who do not (recall bias) Exposure measurements are taken at baseline, as opposed to querying about past exposures that occurred before the onset of disease or assuming that current levels of biologic markers reflect past exposures (as in case-control studies) Type of studies  Observational  Cohort studies

Limitations of cohort studies Large number of study participants Many years of follow-up Expensive Losses to follow-up Main limitation: observational and not experimental Type of studies  Observational  Cohort studies

Non exposed comparison group can be internal or external When the cohort includes both exposed and unexposed individuals, the comparison group is internal, within the cohort When the entire cohort is exposed, need an external group for purposes of comparison Non exposed group  Internal or External

Internal comparison group: Occupational cohort study. Hypothesis: exposure to Chemical X causes one or more types of cancer. Cohort of workers employed in Factory A 40% of workers are exposed to Chemical X 60% of workers are not exposed to Chemical X. The unexposed workers would serve as in internal comparison group Non exposed group  Internal or External

External comparison group: Occupational cohort study Cohort of workers employed in Factory B All workers in Factory B are exposed to Chemical X External comparison group Workers in Factory C, where the workers have similar demographic characteristics to the workers in Factory B and not exposed to Chemical X General U.S. population (mortality rates from vital statistics data) Non exposed group  Internal or External

Two types of cohorts Cohort defined by an exposure or group of related exposures Purpose: test specific hypotheses about the exposure Study of rare exposures External comparison group if entire cohort is exposed Cohort defined by a factor unrelated to any particular exposure Data collection systems to test multiple hypotheses Sample of general population of defined geographic area Convenience sample Willingness of members to participate Logistic advantages, such as ease of follow-up Two types of cohorts  defined by the exposure or unrelated

Cohort defined by an exposure or group of related exposures Occupational cohort Japanese atomic bomb cohort Cohort of persons treated with radiotherapy for ankylosing spondylitis (inflammatory disease of the spine) Cohort of persons taking a particular drug Multicenter AIDS Cohort Study (drug addicts) Two types of cohorts  defined by the exposure or unrelated

Cohort defined by a factor unrelated to any particular exposure Framingham Heart Study Began in 1948 with 5,127 participants Cancer Prevention Study II Began in 1982 with 1,184,657 participants Nurses Health Study (females) Began in 1976 with 121,700 women Two types of cohorts  defined by the exposure or unrelated

Cohort defined by a factor unrelated to any particular exposure: advantages Data collection system to test multiple hypotheses about multiple exposures and disease outcomes Internal comparison group (unexposed members of the cohort) Two types of cohorts  defined by the exposure or unrelated

Analysis of a cohort study: Measure of associations Incidence rate (absolute risk) in exposed group: I exp = a/b Relative risk (RR) = I exp /I nonexp

I exp = a/(a+b) Relative risk (RR) = I exp /I nonexp Analysis of a cohort study: Measure of associations

Comparison is fundamental to epidemiology: Relative risk (RR) = incidence in exposed /incidence in non exposed The relative risk is a ratio (dimensionless) Always make clear which group is exposed and which is non exposed Measure of associations  Relative Risk

Interpretation of relative risk (RR) RR = 1 Risk in exposed = risk in non exposed No association RR > 1 Risk in exposed > risk in non exposed Positive association The larger the RR, the stronger the association May or may not be causal Measure of associations  Relative Risk Relative risk is a measure of association between the exposure and the disease

Interpretation of relative risk (RR) RR < 1 Risk in exposed < risk in non exposed Negative association The smaller the RR, the larger the negative association May or may not be causal If causal, indicates a protective effect Measure of associations  Relative Risk

Hypothetical cohort study of benzene (exposure) and leukemia (outcome) Hypothesis: benzene exposure increases the risk of leukemia Non exposed group to benzene Follow the two groups for 10 years Exposed group to benzene Time Yes leukemia No leukemia

Benzene exposure and leukemia *Per 100,000 person-years

Benzene exposure and leukemia

How much of the disease that occurs can be attributed to a particular exposure? Relative risk -- is a measure of association between an exposure and a disease Attributable risk -- the magnitude of disease incidence attributable to a specific exposure Attributable risk percent -- the percent of disease incidence attributable to a specific exposure The answer to this question tells us how much of the disease we can prevent if we eliminate the exposure Measure of associations  Attributable risk (exposure)

Attributable risk for the exposed group Attributable risk (exposed) = I exp – I nonexp Attributable risk percent (exposed) = [(I exp - I nonexp )/ I exp ] x 100 = = [(RR - 1)/RR] x 100 Incidence in non exposed group can be considered the background incidence, which would occur regardless of the exposure Measure of associations  Attributable risk (exposure)

Attributable risk for the exposed group Measure of associations  Attributable risk (exposure)

Attributable risk (exposed) tells us the most we can hope to accomplish in reducing the risk of disease among the exposed if we totally eliminated the exposure Measure of associations  Attributable risk (exposure)

AR(exposed) = 40 - 5 = 35 per 100,000 person-years AR percent (exposed) = [(40-5)/40] x 100 = 87.5% Benzene exposure and leukemia *Per 100,000 person-years Measure of associations  Attributable risk

Attributable risk for the total population Attributable risk (population) = I pop – I nonexp Attributable risk percent* (population) = [(I pop - I nonexp )/ I pop ] x 100 = Measure of associations  Attributable risk (population) Tells us what percent of disease in the total population is due to the exposure

Attributable risk (population) tells us the most we can hope to accomplish in reducing the incidence of disease in the total population if we totally eliminated the exposure Measure of associations  Attributable risk (population)

Attributable risk percent (population): alternate formula P x (RR-1) x 100 P x (RR-1) + 1 Where P is the population prevalence of the exposure Algebraically equivalent to the original formula AR percent (population) increases with: Increasing RR Increasing population prevalence of the exposure Measure of associations  Attributable risk (population)

Rare exposure (low population prevalence) RR = 2 (per year)

Common exposure (high population prevalence) (per year)

‘Dependence’ of AR% (population) on RR and prevalence of exposure

AR (factory pop.) = 22.5 - 5 = 17.5 per 100,000 person-years AR percent (factory pop.) = [(22.5-5)/22.5] x 100 = 77.8% Benzene exposure and leukemia *Per 100,000 person-years Measure of associations  Attributable risk (population)

Attributable risk percent (population): alternate formula P x (RR-1) _ x 100 P x (RR-1) + 1 Where P is the prevalence of the exposure in the population Measure of associations  Attributable risk (population) 0.5 (8 -1) x 100 = 77.8% 0.5 (8 -1) + 1

Incidence of leukemia in the general population attributable to benzene: hypothetical example… AR (population) = 5.25 - 5 = 0.25 per 100,000 per year AR percent (population) = [(5.25 - 5)/5.25] x 100 = 4.8% Measure of associations  Attributable risk (example) From cancer registry data we determine the incidence of leukemia in the general population to be 5.25 per 100,000 per year

Incidence of leukemia in the general population attributable to benzene: hypothetical example, alternate formula… AR percent (population) = P x (RR-1) x 100 = P x (RR-1) + 1 = [.007 x (8-1)] x 100 = 4.7% ≈ 4.8 [.007 x (8-1) +1] Measure of associations  Attributable risk (example) We are able to estimate that 0.7% of the general population has significant exposure to benzene

Relative risk vs. attributable risk RR is a measure of the strength of an association between an exposure and a disease, and is the measure used in etiologic studies AR is a measure of how much of the disease incidence is attributable to the exposure, and is useful in assessing the exposure’s public health importance AR (population) will vary among populations, depending upon the prevalence of the exposure Measure of associations  Relative risk vs. attributable risk

(Retrospective) Year 2008 Year 2020 Year 2008 Year 1960 Year 2008 Year 1990 Year 2010 Cohort studies by calendar time period of follow-up

Concurrent cohort study A cohort study in which the investigator assembles the cohort and measures baseline exposures in present time. The cohort is followed forward from present time into the future for a number of years (calendar time and follow-up time are concurrent), during which time disease outcomes are observed. Cohort Studies  Concurrent, retrospective, mixed design Time Present time Study ends Study cohort follow-up begins

Concurrent cohort study (cont.) Advantage: baseline exposure assessment and methods of follow-up for disease outcome are planned and implemented for purposes of the study Disadvantages Study takes many years to conduct High cost Cohort Studies  Concurrent, retrospective, mixed design

Historical (retrospective) cohort study A cohort study in which the investigator uses historical data, based on existing records of past exposures, to go back in time to assemble a cohort. Also using existing records, the investigator reconstructs the disease experience of the cohort from a defined point in the past to a point in the near present. Study cohort follow-up begins Study ends Present time Time Cohort Studies  Concurrent, retrospective, mixed design

Historical (retrospective) cohort study (cont.) Cohort Studies  Concurrent, retrospective, mixed design Advantage: less expensive than concurrent studies less time to conduct than concurrent studies Disadvantages the quality of exposure or disease outcome data is often inferior to the quality obtained in concurrent studies (due to the reliance on records that usually were collected for a purpose other than conducting an epidemiologic study)

Mixed design cohort study A cohort study in which the investigator uses historical data, based on existing records of past exposures, to go back in time to assemble a cohort. Also using existing records, the investigator reconstructs the disease experience of the cohort from a defined point in the past to a point in the future. Data for the cohort includes data from the past and data from present time into the future. Study cohort follow-up begins Study ends Present time Time Cohort Studies  Concurrent, retrospective, mixed design

Concurrent cohort study: Hepatocelluar carcinoma and hepatitis B virus -- a prospective study of 22,707 men in Taiwan (Beasley et al.). Cohort: male Taiwanese government civil servants Why civil servants? Life and health insurance system provided almost total ascertainment of death Retained insurance after retirement Hypothesis: hepatitis B virus infection causes hepatocellular carcinoma (HCC) Cohort Studies  Concurrent (example), retrospective, mixed design

Concurrent cohort study: Hepatocelluar carcinoma and hepatitis B virus -- a prospective study of 22,707 men in Taiwan (Beasley et al.). First cohort study of hepatitis B virus infection and Hepatocelluar carcinoma Cohort Studies  Concurrent (example), retrospective, mixed design

Why study was restricted to men? Incidence of HCC 3-4 times higher in men than in women There were many more male civil servants Average age of male civil servants higher than that of female civil servants Men stay in government service longer than women, and it is usually their only occupation Cohort Studies  Concurrent (example), retrospective, mixed design

Recruitment of study participants Men attending the Government Employees Clinic Center Men participating in another study (Cardiovascular Disease Study) Recruitment took place between November 3, 1975 and June 30, 1978 22,707 male government employees were recruited into the cohort Cohort Studies  Concurrent (example), retrospective, mixed design

Exposure measurement at baseline Blood sample to test for hepatitis B surface antigen (exposed group) 3,454 men tested positive Cohort Studies  Concurrent (example), retrospective, mixed design (nonexposed group) 19,253 men tested negative

Follow-up methods Cohort followed forward in time from the present (calendar time and follow-up time concurrent) to assess outcome (death from HCC) Cohort Studies  Concurrent (example), retrospective, mixed design

Follow-up methods Detection of death Health and life insurance system Identification of deaths caused by HCC Hospital records of all decedents were reviewed Study cohort subjects positive to hepatitis B subjects negative to hepatitis B death from HCC Cohort Studies  Concurrent (example), retrospective, mixed design

Results of follow-up Follow-up through December 31, 1980 307 members of cohort died 74 members of cohort had retired and canceled their insurance and could not be traced About 75,000 person-years of follow-up; average of 3.3 years per man Cohort Studies  Concurrent (example), retrospective, mixed design

HCC mortality Relative risk = 1,158/5.2 = 223 *(per 100,000); Mean follow-up 3.3 years Cohort Studies  Concurrent (example), retrospective, mixed design

Conclusions hepatitis B virus infection preceded (causes) the development of Hepatocelluar carcinoma The estimated relative risk (223) provides strong evidence that hepatitis B virus infection causes Hepatocelluar carcinoma Cohort Studies  Concurrent (example), retrospective, mixed design

Comparing incidence rates in different populations Calculate the number of deaths or cases of disease expected in the cohort if it had the same age-specific mortality or incidence rates as the external comparison group (usually general population) Compare the expected number with the actual observed number of deaths or disease in the cohort Usually also adjust for calendar-year time of death or disease incidence (Selikoff et al.) Indirect age-adjustment (often used in retrospective cohort studies) uses standard age-specific mortality or incidence rates from an external comparison group (usually the general population)

Standardized mortality ratio (SMR): observed number of deaths x 100 expected number of deaths Where expected = population of the study cohort X mortality rate of external population Standardized incidence ratio (SIR): observed number of cases x 100 expected number of cases Where expected = population of the study cohort X incidence rate of external population SMR and SIR can be interpreted as relative risks

…a key factor for the interpretation of standardized mortality ratios and incidence ratios is the construction of reliable 95% confidence intervals Cohort Studies  Concurrent, retrospective, mixed design

Hypothetical historical cohort study of lung cancer in uranium miners… Person-years lung cancer at mortality rate Expected Observed Age risk (per 100,000) deaths deaths 40-44 5,000 10 0.5 6 45-49 10,000 20 2.0 10 50-54 8,000 35 2.8 15 Totals 5.3 31 SMR = (31/5.3) x 100 = 585 Cohort Studies  Concurrent, retrospective, mixed design

Example of historical cohort study: Teta et al. Cancer incidence among cosmetologists Hypothesis: cosmetologists are at greater than average risk of respiratory cancer because they use many chemicals in their work Cohort: licensed Connecticut cosmetologists Cohort Studies  Concurrent, retrospective (example), mixed design Since 1925, cosmetologists in Connecticut were required to register annually with the State Dept. of Health

Construction of cohort The target population: 17,121 cosmetologists who: Were CT residents Were ever licensed as cosmetologists Began hairdressing school before Jan. 1, 1966 Data abstracted from registration records Full name, including former surnames Sex Date of birth Dates of first and last licenses Last known address Cohort Studies  Concurrent, retrospective (example), mixed design

Final cohort 11,845 females and 1,805 males Construction of cohort (continued) Cohort Studies  Concurrent, retrospective (example), mixed design Persons excluded from cohort 2,530 who had been licensed less than 5 years, 33 who had reported being diagnosed with cancer prior study, 908 for whom date of birth or sex were missing Initial cohort 17,121 subjects

Follow-up For a given individual, follow-up began either 4th year after the date of first license or 1935 for the few first licensed before 1931 Ascertained cancer incidence through Connecticut Tumor Registry Ascertained vital status through CT death certificate records Ascertained residence in CT through lists of licensed CT drivers and city directories Follow-up period: January 1, 1935  through September 30, 1978

For each cohort member, person-years at risk were counted until one of the following, whichever came first: » Last known year at a CT address » Date of death » Cancer diagnosis » September 30, 1978 241,580 person-years of follow-up Follow-up (continued)

Cancer in cosmetologists: Indirect age adjustment The general CT population was the external comparison group Used the cancer incidence rates from the Connecticut Tumor Registry as the standard incidence rates, specific for: Age Sex Calendar year Cancer site

Length of follow-up (years) Obs. Exp. SIR 5-14 4 4.85 82 15-24 8 7.22 111 25-34 14 11.49 122 > 35 23 11.28 204* Total 49 34.84 141* *P < 0.05 Cancer in cosmetologists (continued)

Cancer in cosmetologists: limitations History of specific exposures unavailable Smoking histories unavailable Authors present anecdotal evidence that cosmetologists had higher smoking rates than general female population ‘Adjustment’ for important confounders not possible

Conceptually, the designs of concurrent cohort studies and historical cohort studies are identical Start at baseline with exposed and non-exposed groups free of the disease of interest Identify new (incident) cases as we go forward in time from baseline Difference is the calendar time period of observation Concurrent cohort study Baseline: in the present Follow-up: forward into the future Historical cohort study Baseline: a time point in the past Follow-up: forward from that time point to the present

Cohort Studies, Relative Risk, and Attributable Risk STAT 6395 Spring 2008 Filardo and Ng.

Similar presentations

Presentation on theme: "Cohort Studies, Relative Risk, and Attributable Risk STAT 6395 Spring 2008 Filardo and Ng."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Cohort Studies, Relative Risk, and Attributable Risk STAT 6395 Spring 2008 Filardo and Ng.

Similar presentations

Presentation on theme: "Cohort Studies, Relative Risk, and Attributable Risk STAT 6395 Spring 2008 Filardo and Ng."— Presentation transcript:

Similar presentations

About project

Feedback