Significance testing Ioannis Karagiannis (based on previous EPIET material) 18 th EPIET/EUPHEM Introductory course 28.09.2012.

Slides:



Advertisements
Similar presentations
Introduction to Hypothesis Testing
Advertisements

Introductory Mathematics & Statistics for Business
Confidence intervals Kristin Tolksdorf (based on previous EPIET material) 18th EPIET/EUPHEM Introductory course
Significance testing and confidence intervals Ágnes Hajdu EPIET Introductory course
Sample size calculation
Sample size calculation
Critical review of significance testing F.DAncona from a Alain Morens lecture 2006.
Probability models- the Normal especially.
SADC Course in Statistics (Session 09)
Inferential Statistics and t - tests
Introduction to Hypothesis Testing
6. Statistical Inference: Example: Anorexia study Weight measured before and after period of treatment y i = weight at end – weight at beginning For n=17.
Putting Statistics to Work
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
1 Hypothesis Testing William P. Wattles, Ph.D. Psychology 302.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
1. Estimation ESTIMATION.
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE.
Lecture 11 Psyc 300A. Null Hypothesis Testing Null hypothesis: the statistical hypothesis that there is no relationship between the variables you are.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Inference.ppt - © Aki Taanila1 Sampling Probability sample Non probability sample Statistical inference Sampling error.
S519: Evaluation of Information Systems Social Statistics Inferential Statistics Chapter 8: Significantly significant.
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Chapter 9 Hypothesis Testing.
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Inferential Statistics
Choosing Statistical Procedures
1. Statistics: Learning from Samples about Populations Inference 1: Confidence Intervals What does the 95% CI really mean? Inference 2: Hypothesis Tests.
© 2011 Pearson Prentice Hall, Salkind. Introducing Inferential Statistics.
Chapter 8 Introduction to Hypothesis Testing
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Let’s flip a coin. Making Data-Based Decisions We’re going to flip a coin 10 times. What results do you think we will get?
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
The Argument for Using Statistics Weighing the Evidence Statistical Inference: An Overview Applying Statistical Inference: An Example Going Beyond Testing.
Chapter 8 Introduction to Hypothesis Testing
Making decisions about distributions: Introduction to the Null Hypothesis 47:269: Research Methods I Dr. Leonard April 14, 2010.
Inferential Statistics 2 Maarten Buis January 11, 2006.
Significance testing and confidence intervals Col Naila Azam.
CHAPTER 17: Tests of Significance: The Basics
Hypothesis Testing A procedure for determining which of two (or more) mutually exclusive statements is more likely true We classify hypothesis tests in.
Inferential Statistics Body of statistical computations relevant to making inferences from findings based on sample observations to some larger population.
S-012 Testing statistical hypotheses The CI approach The NHST approach.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Copyright © 2011 Pearson Education, Inc. Putting Statistics to Work.
CHAPTER OVERVIEW Say Hello to Inferential Statistics The Idea of Statistical Significance Significance Versus Meaningfulness Meta-analysis.
T tests comparing two means t tests comparing two means.
Chapter 13 Understanding research results: statistical inference.
Today: Hypothesis testing. Example: Am I Cheating? If each of you pick a card from the four, and I make a guess of the card that you picked. What proportion.
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
15 Inferential Statistics.
Hypothesis Testing Hypothesis testing is an inferential process
Significance testing and confidence intervals
Statistical inference: distribution, hypothesis testing
MATH 2311 Section 8.2.
Statistical Tests P Values.
Statistical Inference
Hypothesis Tests for Proportions
Hypothesis Testing.
CHAPTER 12 Inference for Proportions
CHAPTER 12 Inference for Proportions
More on Testing 500 randomly selected U.S. adults were asked the question: “Would you be willing to pay much higher taxes in order to protect the environment?”
CHAPTER 10 Comparing Two Populations or Groups
Statistical Test A test of significance is a formal procedure for comparing observed data with a claim (also called a hypothesis) whose truth we want to.
Statistical Power.
Presentation transcript:

Significance testing Ioannis Karagiannis (based on previous EPIET material) 18 th EPIET/EUPHEM Introductory course

The idea of statistical inference Sample Population Conclusions based on the sample Generalisation to the population Hypotheses 2

Inferential statistics Uses patterns in the sample data to draw inferences about the population represented, accounting for randomness Two basic approaches: – Hypothesis testing – Estimation Common goal: conclude on the effect of an independent variable on a dependent variable 3

The aim of a statistical test To reach a deterministic decision (yes or no) about observed data on a probabilistic basis. 4

Why significance testing? Norovirus outbreak on a Greek island: The risk of illness was higher among people who ate raw seafood (RR=21.5). Is the association due to chance? 5

The two hypotheses There is a difference between the two groups (=there is an effect) Alternative Hypothesis (H 1 ) (e.g.: RR=21.5) When you perform a test of statistical significance, you reject or do not reject the Null Hypothesis (H 0 ) There is NO difference between the two groups (=no effect) Null Hypothesis (H 0 ) (RR=1) 6

Norovirus on a Greek island Null hypothesis (H 0 ): There is no association between consumption of raw seafood and illness. Alternative hypothesis (H 1 ): There is an association between consumption of raw seafood and illness. 7

Hypothesis testing Tests of statistical significance Data not consistent with H 0 : – H 0 can be rejected in favour of some alternative hypothesis H 1 (the objective of our study). Data are consistent with the H 0 : – H 0 cannot be rejected You cannot say that the H 0 is true. You can only decide to reject it or not reject it. 8

p value p value = probability that our result (e.g. a difference between proportions or a RR) or more extreme values could be observed under the null hypothesis H 0 rejected using reported p value 9

p values – practicalities Low p values = low degree of compatibility between H 0 and the observed data: association unlikely to be by chance you reject H 0, the test is significant High p values = high degree of compatibility between H 0 and the observed data: association likely to be by chance you dont reject H 0, the test is not significant 10

Levels of significance – practicalities We need of a cut-off ! 1% 5% 10% p value > 0.05 = H 0 not rejected (non significant) p value 0.05 = H 0 rejected (significant) BUT: Give always the exact p-value rather than significant vs. non-significant. 11

The limit for statistical significance was set at p=0.05. There was a strong relationship (p<0.001). …, but it did not reach statistical significance (ns). The relationship was statistically significant (p=0.0361) Examples from the literature p=0.05 Agreed convention Not an absolute truth Surely, God loves the 0.06 nearly as much as the 0.05 (Rosnow and Rosenthal, 1991) 12

p = 0.05 and its errors Level of significance, usually p = 0.05 p value used for decision making But still 2 possible errors: H 0 should not be rejected, but it was rejected : Type I or alpha error H 0 should be rejected, but it was not rejected : Type II or beta error 13

H 0 is true but rejected: Type I or error H 0 is false but not rejected: Type II or error Types of errors Decision based on the p value Truth No diff Diff 14

More on errors Probability of Type I error: – Value of α is determined in advance of the test – The significance level is the level of α error that we would accept (usually 0.05) Probability of Type II error: – Value of β depends on the size of effect (e.g. RR, OR) and sample size – 1- β : Statistical power of a study to detect an effect on a specified size (e.g. 0.80) – Fix β in advance: choose an appropriate sample size 15

Quantifying the association Test of association of exposure and outcome E.g. chi 2 test or Fishers exact test Comparison of proportions Chi 2 value quantifies the association The larger the chi 2 value, the smaller the p value – the more the observed data deviate from the assumption of independence (no effect). 16

Chi-square value 17

Norovirus on a Greek island 2x2 table Raw seafood No raw seafood IllNon ill %81% Expected proportion of ill and not ill : x19% ill x 81% non-ill x 19% ill x 81% non-ill Expected number of ill and not ill for each cell :

Chi-square calculation (29-6) 2 /6(9-31) 2 /31 (5-27) 2 /27 ( ) 2 / 114 Raw seafood No raw seafood IllNon ill χ 2 = 125 p < 0.001

Norovirus on a Greek island The attack rate of illness among consumers of raw seafood was 21.5 times higher than among non consumers of these food items (p<0.001). The p value is smaller than the chosen significance level of α = 5%. The null hypothesis is rejected. There is a < probability (<1/1000) that the observed association could have occured by chance, if there were no true association between eating imported raw seafood and illness. 20

C2012 vs facilitators The ultimate (eye) test. H 0 : the proportion of facilitators wearing glasses during the Tuesday morning sessions was equal to the proportion of fellows wearing glasses. H 1 : the above proportions were different. 21

C2012 vs facilitators Fellow Facilitator GlassesNo glasses %67% Expected proportion of ill and not ill : x33% +ve x67% -ve x33% +ve x67% -ve Expected number of ill and not ill for each cell :

Chi-square calculation (11-13) 2 /13(27-25) 2 /25 (6-4.6) 2 /4.6(8-9.4) 2 /9.4 Fellow Facilitator GlassesNo glasses 23 χ 2 = 1.11 p = 0.343

t-test Used to compare means of a continuous variable in two different groups Assumes normal distribution 24

t-test H 0 : fellows with glasses do not tend to sit further in the back of the room compared to fellows without glasses H 1 : fellows with glasses tend to sit further in the back of the room compared to fellows without glasses 25

t-test 26

Epidemiology and statistics 27

Criticism on significance testing Epidemiological application need more than a decision as to whether chance alone could have produced association. (Rothman et al. 2008) Estimation of an effect measure (e.g. RR, OR) rather than significance testing. 28

Suggested reading KJ Rothman, S Greenland, TL Lash, Modern Epidemiology, Lippincott Williams & Wilkins, Philadelphia, PA, 2008 SN Goodman, R Royall, Evidence and Scientific Research, AJPH 78, 1568, 1988 SN Goodman, Toward Evidence-Based Medical Statistics. 1: The P Value Fallacy, Ann Intern Med. 130, 995, 1999 C Poole, Low P-Values or Narrow Confidence Intervals: Which are more Durable? Epidemiology 12, 291,

Previous lecturers Alain Moren Paolo DAncona Lisa King Ágnes Hajdu Preben Aavitsland Doris Radun Manuel Dehnert 30