Download presentation
Presentation is loading. Please wait.
Published byArchibald Stone Modified over 9 years ago
1
Health and Disease in Populations 2002 Sources of variation (1) Paul Burton! Jane Hutton
2
Informal lecture objectives ¥Objective 1 ¥To enable the student to distinguish between observed data and the underlying tendencies which give rise to those data ¥Objective 2: ¥To understand the concepts of sources of variation and randomness
3
Formal lecture objectives for Random Variation (1) and (2) ¥ Objective 1 Distinguish between observed epidemiological quantities (incidence, prevalence, incidence rate ratio etc) and their true or underlying values. ¥ Objective 2 Discuss how observed epidemiological quantities depart from their true values because of random variation.
4
Formal lecture objectives for Sources of Variation Objective 3 Describe how observed values help us towards a knowledge of the true values by: ¥allowing us to test hypotheses about the true value (SoV 1) ¥allowing us to calculate a range within which the true value probably lies (SoV 2)
5
Drawing conclusions ¥Experiment ¥Flip a coin 10 times ¥Result ¥Observe 7 heads, 3 tails ¥Conclusions ¥Data wrong (e.g. a miscount) ¥Artefact ¥Chance ¥The coin is biased towards heads
6
Tendency versus observation ¥Coins tend to produce equal numbers of heads and tails, but what we observe may depart from this by random variation. ¥Random variation in health ¥On average, there are 4 cases of meningitis per month in Leicester; some months we observe 10, some months 0. ¥Smokers tend to be less healthy than non- smokers; but if we pick a few people at random, we might find that the smokers are healthier than the non-smokers.
7
Tendency versus observation ¥Epidemiologists, health planners etc. want to know about the underlying tendencies and patterns. However, as well as systematic variation, everything they observe is affected by random variation.
8
If we know about the underlying tendency, we can predict what we may reasonably expect to observe (probability theory).
9
Neonatal Intensive Care (NIC) cots ¥True requirement (1992 figures) 1/1,000 live births per annum ¥Health authority has approximately 12,000 live births per annum ¥On average 12 NIC `cots' will be required per year (this is the true tendency)
10
95% 18 29/30 19 99% 21
11
Obstetric beds (NIC cots) ¥ Often observe 8-16 cots being used ¥ Need 19 or more on 1/day per month ¥ Need 21 or more on 1% of days ¥ Hardly ever need more than 24 cots ¥ Provide 19 cots ¥ On average 12 are occupied = 63% occupancy True tendency observed distribution easy ¥ BUT how do we reverse the direction of inference? Observed distribution true tendency
12
Any questions?
13
Hypothesis testing Objective 3 Describe how observed values help us towards a knowledge of the true values by: ¥Allowing us to test hypotheses about the true value
14
Hypothesis testing ¥An hypothesis: A statement that an underlying tendency of scientific interest takes a particular quantitative value ¥The coin is fair (the probability of heads is 0.5) ¥The new drug is no better than the standard treatment (the ratio of survival rates = 1.0) ¥The true prevalence of tuberculosis in a given population is 2 in 10,000
15
Testing hypotheses Are the observed data consistent with the stated hypothesis? ¥Informally? ¥Formally?
16
Formally ¥ Calculate the probability of getting an observation as extreme as, or more extreme than, the one observed if the stated hypothesis was true. ¥ If this probability is very small, then either ¥ something very unlikely has occurred; or ¥ the hypothesis is wrong ¥ It is then reasonable to conclude that the data are incompatible with the hypothesis. The probability is called a p-value
17
Hypothesis: this coin is fair ¥ Observed data: 10 heads, 0 tails P-value: 0.002 (1 in 500) (exactly 2 1/ 1,024) ¥ Conclusion: Data inconsistent with hypothesis; strong evidence against the hypothesis ¥ Prior beliefs relevant here: ¥ 10 heads, 0 tails: (Is the coin biased?) ¥ 10 survivors, 0 deaths on new treatment X: (Does X work if historically 50% died)
18
An arbitrary convention P-value: p 0.05 Data inconsistent with hypothesis Substantive evidence against the hypothesis Reasonable to reject the hypothesis Statistically significant ¥ P-value: p>0.05 ¥ None of the above ¥ The mean surface temperature of the earth has increased by only 1°C over the last 50 years p=0.1 does not prove that there is no global warming!
19
Hypothesis tests ¥ The incidence of disease X in Warwickshire is significantly lower than in the rest of the UK (p=0.01) ¥ The death rate from disease Y is significantly higher in Barnsley than in Leicester (p=0.05) ¥ Patients on the new drug did not live significantly longer than those on the standard drug (p=0.4)
20
The null hypothesis The hypothesis to be tested is often called the null hypothesis (H 0 ) ¥ The ratio of death rates is 1.0 ¥ The prevalence in Warwickshire is the same as in Leicestershire p<=0.05 : substantial evidence against the hypothesis being tested, not that it is definitely false ¥ p>0.05: Data (not in-) consistent with the hypothesis. Little or no evidence against the hypothesis being tested, not that it is definitely true
21
An experiment: flip a coin 10 times ¥ Observed result: 7 heads, 3 tails ¥ Question: ¥ Is the coin biased? 0 0.001 * 1 0.010 * 2 0.044 * 3 0.117 * 4 0.205 5 0.246p = 2×(0.001+0.010+0.044+0.117) = 0.344 6 0.205 7 0.117 * 8 0.044 * 9 0.010 * 10 0.001 *
22
An experiment: flip a coin 10 times ¥Observed result: 7 heads, 3 tails ¥Data consistent with the coin being unbiased. Weak evidence against the null hypothesis ¥So: little evidence that the coin is biased ¥But: does not prove that the coin is unbiased
23
Problems ¥ Rejecting H 0 is not always much use. ¥ p<0.05 is arbitrary; nothing special happens between p=0.049 and p=0.051 ¥ p=0.000001and p=0.6 easy to interpret ¥ False positive results Statistical significance depends on sample size. Flip a coin 3 times minimum p=0.25 (i.e. 2×1/8) Statistically significant clinically important ¥ Nevertheless, p values are used a lot
24
A solution Objective 3 Describe how observed values help us towards a knowledge of the true values by: ¥Allowing us to test hypotheses about the true value ¥Providing us with a range within which the underlying tendency probably lies
25
Any questions?
Similar presentations
© 2025 SlidePlayer.com Inc.
All rights reserved.