Presentation is loading. Please wait.

Presentation is loading. Please wait.

Causation. Associations may be due to Chance (random error) statistics are used to reduce it by appropriate design of the study statistics are used to.

Similar presentations


Presentation on theme: "Causation. Associations may be due to Chance (random error) statistics are used to reduce it by appropriate design of the study statistics are used to."— Presentation transcript:

1 Causation

2 Associations may be due to Chance (random error) statistics are used to reduce it by appropriate design of the study statistics are used to estimate the probability that the observed results are due to chance Bias (Systematic error) must be considered in the design of the study Confounding can be dealt with during both the design and the analysis of the study Causation

3 Associations may be due to Chance (random error) statistics are used to reduce it by appropriate design of the study statistics are used to estimate the probability that the observed results are due to chance Bias (Systematic error) must be considered in the design of the study Confounding can be dealt with during both the design and the analysis of the study Causation

4 Random error vs. Systematic error 70 ---- 70 69 70 71 70 68 72 69 71 ---- 70 71 ---- 71 70 71 72 73 70 71 70 69 70 71 ---- 71

5

6 Dealing with chance error During design of study Sample size Power During analysis (Statistical measures of chance) Test of statistical significance (P value) Confidence intervals

7 Statistical measures of chance I (Test of statistical significance) Observed association Association in Reality YesNo Yes No Type I error Type II error

8 P-value the probability the observed results occurred by chance the probability that an effect at least as extreme as that observed could have occurred by chance alone, given there is truly no relationship between exposure and disease (Ho) statistically non-significant results are not necessarily attributable to chance due to small sample size

9 Statistical Power Power = 1 – type II error Power = 1 - ß

10 P value 0.00001 Clinical Importance VS Statistical Significance

11 Statistical measures of chance II (Confidence intervals)

12 Question? 20 out of 100 participants: 20% 200 out of 1000 participants: 20% 2000 out of 10000 participants: 20% What is the difference?

13 Answer: Confidence Interval Definition: A range of values for a variable of interest constructed so that this range has a specified probability of including the true value of the variable for the population Characteristics: a measure of the precision (stability) of an observed effect the range within which the true magnitude of effect lies with a particular degree of certainty 95% C.I. means that true estimate of effect (mean, risk, rate) lies within 2 standard errors of the population mean 95 times out of 100 Confidence intervals get smaller (i.e. more precise or more certain) if the underlying data have less variation/scatter Confidence intervals get smaller if there are more people in your sample

14 95% Confidence Interval (95% CI) 20 out of 100 participants: 20% 95% CI: 12 to 28 80 out of 400 participants: 20% 95% CI: 16 to 24 2000 out of 10000 participants: 20% 95% CI: 19.2 to 20.8

15 How to Estimate CI? Standard Error (SE) 95% CI = statistic ± 1.96 SE Example: 95% CI of mean = sample mean ± SEM SD SEM = n

16 How to Estimate CI? (example) A sample of 100 participants Mean of their age 25 years SD of age: 10 CIM? CIM = 25 ± 1.96 * 10 / 100 CIM ~ from 23 to 27

17 Confidence Interval vs P value

18 Associations may be due to Chance (random error) statistics are used to reduce it by appropriate design of the study statistics are used to estimate the probability that the observed results are due to chance Bias (Systematic error) must be considered in the design of the study Confounding can be dealt with during both the design and the analysis of the study Causation

19 Associations may be due to Chance (random error) statistics are used to reduce it by appropriate design of the study statistics are used to estimate the probability that the observed results are due to chance Bias (Systematic error) must be considered in the design of the study Confounding can be dealt with during both the design and the analysis of the study Causation

20 Bias Any systematic error that results in an incorrect estimate of the association between risk factors and outcome

21 Types of Bias Selection bias – identification of individual subjects for inclusion in study on the basis of either exposure or disease status depends in some way on the other axis of interest Observation (information) bias – results from systematic differences in the way data on exposure or outcome are obtained from the various study groups

22 Selection Bias Self-selection bias Healthy (or diseased) people may seek out participation in the study Referral bias Sicker patients are referred to major health centers Diagnostic bias Diagnosis of outcome associated with exposure Non-response bias Response, or lack of it, depends on exposure Differential loss to follow-up Exposed (or unexposed) group followed with different intensity Berkson’s bias Hospitalization rates differ for by disease and presence/absence of the exposure of interest

23 Information Bias Data collection bias Bias that results from abstracting charts, interviews or surrogate interviews Recall bias Disease occurrence enhances recall about potential exposures Surveillance or detection bias The exposure promoted more careful evaluation for the outcome of interest Reporting bias Exposure may be under-reported because of attitudes, perceptions, or beliefs

24 Bias results from systematic flaws study design, data collection, analysis interpretation of results

25 Control of Bias Can only be prevented and controlled during the design and conduct of a study: Careful planning of measurements Formal assessments of validity Regular calibration of instruments Training of data collection personnel

26 Associations may be due to Chance (random error) statistics are used to reduce it by appropriate design of the study statistics are used to estimate the probability that the observed results are due to chance Bias (Systematic error) must be considered in the design of the study Confounding can be dealt with during both the design and the analysis of the study Causation

27 Associations may be due to Chance (random error) statistics are used to reduce it by appropriate design of the study statistics are used to estimate the probability that the observed results are due to chance Bias (Systematic error) must be considered in the design of the study Confounding can be dealt with during both the design and the analysis of the study Causation

28 Confounding Confounding results when the effect of an exposure on the disease (or outcome) is distorted because of the association of exposure with other factor(s) that influence the outcome under study.

29 Confounding Coffee MI Smoking, Stress Observed association, presumed causation Observed association True association

30 Properties of a confounder A confounding factor must be a risk factor for the disease. The confounding factor must be associated with the exposure under study in the source population. A confounding factor must not be affected by the exposure or the disease. The confounder cannot be an intermediate step in the causal path between the exposure and the disease.

31 Confounding High fat diet MI cholesterol Observed association, presumed causation Observed association True association

32 Control of Confounding During design of study Restriction Matching Randomization During analysis Stratified analysis Multivariate analysis

33 Confounding Coffee MI Smoking, Stress Observed association, presumed causation Observed association True association

34 Is there any confounding effect? Coffee drinking Myocardial Infarction YesNo Yes 3624 No 2436 OD= ? OD = 36×36/24×24 OD= 2.25

35 Stratification Coffee Drinking Myocardial Infarction YesNo Yes 26 No 1834 Coffee Drinking Myocardial Infarction YesNo Yes 3624 No 2436 Coffee Drinking Myocardial Infarction YesNo Yes 3418 No 62 Smokers Non Smokers

36 Stratification Coffee Drinking Myocardial Infarction YesNo Yes 26 No 1834 Coffee Drinking Myocardial Infarction YesNo Yes 3418 No 62 Smokers Non Smokers OD= ? OD = 34×2/18×6 OD= 0.63 OD= ? OD = 2×34/6×18 OD= 0.63

37 Crude and Common OR Crude OR = 2.25 Common OR = 0.63 So There is confounding effect.

38 Mantel Haenzel OD Sample sizes in the strata is not equal ? Common OR is a weighted summary of OR

39 Control of Confounding During design of study Restriction Matching Randomization During analysis Stratified analysis Multivariate analysis

40 () Disease Rates 3.09.0 15.0? Absent Factor B Present Factor A Absent Present 3.09.0 15.021.0 Absent Factor B Present Factor A Absent Present Interaction Additive Model

41 Interaction Multiplicative Model Disease Rates 3.09.0 15.0? Absent Factor B Present Factor A Absent Present 3.09.0 15.045.0 Absent Factor B Present Factor A Absent Present

42 Approach to Interaction and Confounding Report separate measures for levels of covariate Yes Is there interaction? No Is there confounding? Adjust for confounder Yes No No need for adjustment

43 Associations may be due to Chance (random error) statistics are used to reduce it by appropriate design of the study statistics are used to estimate the probability that the observed results are due to chance Bias (Systematic error) must be considered in the design of the study Confounding can be dealt with during both the design and the analysis of the study Causation

44 Associations may be due to Chance (random error) statistics are used to reduce it by appropriate design of the study statistics are used to estimate the probability that the observed results are due to chance Bias (Systematic error) must be considered in the design of the study Confounding can be dealt with during both the design and the analysis of the study Causation

45 DETERMINATION OF CAUSATION The general QUESTION:Is there a cause and effect relationship between the presence of factor X and the development of disease Y? One way of determining causation is personal experience by directly observing a sequence of events.

46 Personal Experience / Insufficient Long latency period between exposure and disease Common exposure to the risk factor Small risk from common or uncommon exposure Rare disease Common disease Multiple causes of disease

47 Scientific Evidences The answer is made by inference and relies on a summary of all valid evidence.

48 Nature of Evidence: 1. Replication of Findings – consistent in populations 2. Strength of Association – significant high risk 3. Temporal Sequence – exposure precede disease

49 Nature of Evidence: 4. Dose-Response – higher dose exposure, higher risk 5. Biologic Credibility – exposure linked to pathogenesis 6. Consideration of alternative explanations – the extent to which other explanations have been considered.

50 Nature of Evidence 7. Cessation of exposure (Dynamics) – removal of exposure – reduces risk 8. Specificity specific exposure is associated with only one disease 9. Experimental evidence

51 Bradford Hill Criteria (1965) criteria for assessing causality:  Consistency  Strength  Specificity  Temporality  Plausibility  Coherence  Dose Response  Analogy  Experimental evidence

52 Bradford Hill Criteria Hill stated None of my criteria can bring indisputable evidence for or against the cause-and-effect hypothesis None can be required as sufficient alone

53 Henle-Koch Postulates before a causative relationship can be accepted between a disease agent and the disease in question The agent must be shown to be present in every case of the disease by isolation in a pure culture. The agent must not be found in cases of other disease. Once isolated, the agent must be capable of reproducing the disease in experimental animals. The agent must be recovered from the experimental disease produced

54 Henle-Koch Postulates Useful in infectious diseases Not useful with complicated chronic diseases Heart disease Diabetes Cancer Violence

55 H. pylori Consistency association has been replicated in other studies Strength H. pylori is found in at least 90% of patients with duodenal ulcer Temporal relationship 11% of chronic gastritis patients go on the develop duodenal ulcers over a 10-year period. Dose response density of H.pylori is higher in patients with duodenal ulcer than in patients without

56 H. pylori Biologic plausibility originally – no biologic plausibility then H. pylori binding sites were found know H. pylori induces inflammation Cessation Eradication of H Pylori heals duodenal ulcers

57 SMOKING AND LUNG CANCER 1. Strength of Association: The relative risks for the association of smoking and lung cancer are very high 2. Biologic Credibility: The burning of tobacco produces carcinogenic compounds which are inhaled and come into contact with pulmonary tissue.

58 SMOKING AND LUNG CANCER 3. Replication of findings: The association of cigarette smoke and lung cancer is found in both sexes in all races, in all socioeconomic classes, etc. 4. Temporal Sequence: Cohort studies clearly demonstrate that smoking precedes lung cancer and that lung cancer does not cause an individual to become a cigarette smoker.

59 SMOKING AND LUNG CANCER 5. Dose-Response: The more cigarette smoke an individual inhales, over a life-time, the greater the risk of developing lung cancer. 6. Dynamics (cessation of exposure): Reduction in cigarette smoking reduces the risk of developing lung cancer.

60 Smoking is cited as a cause of lung cancer, however...... smoking is not necessary (is not a prerequisite) to get lung cancer. Some people get lung cancer who have never smoked.... smoking alone does not cause lung cancer. Some smokers never get lung cancer. Smoking is a member of a set of factors (i.e., web of causation) which cause lung cancer. The identity of all the other factors in the set are unknown. (One factor in the web of causation is probably genetic susceptibility.)

61 necessary / sufficient “A” is necessary – since it appears in each sufficient causal complex “A” is not sufficient – Disease Not PresentDisease Present

62 necessary / sufficient necessary and sufficient the factor always causes disease and disease is never present without the factor most infectious diseases necessary but not sufficient multiple factors are required cancer sufficient but not necessary many factors may cause same disease leukemia neither sufficient nor necessary multiple cause

63 necessary / sufficient Few causes are necessary and sufficient High cholesterol is neither necessary nor sufficient for CVD because many individuals who develop CVD do not have high serum cholesterol levels

64 Hierarchy in Establishing Causality trials randomized, double-blind, placebo- controlled with sufficient power and appropriate analysis cohort studies & case-control studies hypothesis specified prior to analysis case-series no comparison groups


Download ppt "Causation. Associations may be due to Chance (random error) statistics are used to reduce it by appropriate design of the study statistics are used to."

Similar presentations


Ads by Google