P value and confidence intervals

Slides:



Advertisements
Similar presentations
Pitfalls of Hypothesis Testing + Sample Size Calculations.
Advertisements

Chapter 10, part D. IV. Inferences about differences between two population proportions You will have two population proportions, p1 and p2. The true.
Hypothesis Testing making decisions using sample data.
Hypothesis Testing An introduction. Big picture Use a random sample to learn something about a larger population.
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
Inference Sampling distributions Hypothesis testing.
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and.
AP Statistics – Chapter 9 Test Review
Section 7.1 Hypothesis Testing: Hypothesis: Null Hypothesis (H 0 ): Alternative Hypothesis (H 1 ): a statistical analysis used to decide which of two competing.
Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Introduction to Hypothesis Testing CJ 526 Statistical Analysis in Criminal Justice.
Ch. 9 Fundamental of Hypothesis Testing
Introduction to Testing a Hypothesis Testing a treatment Descriptive statistics cannot determine if differences are due to chance. A sampling error occurs.
Statistical hypothesis testing – Inferential statistics I.
Statistical Inference Dr. Mona Hassan Ahmed Prof. of Biostatistics HIPH, Alexandria University.
AM Recitation 2/10/11.
Testing Hypotheses I Lesson 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics n Inferential Statistics.
Overview Definition Hypothesis
Section 9.1 Introduction to Statistical Tests 9.1 / 1 Hypothesis testing is used to make decisions concerning the value of a parameter.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Testing Hypotheses Tuesday, October 28. Objectives: Understand the logic of hypothesis testing and following related concepts Sidedness of a test (left-,
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Statistical Inference Decision Making (Hypothesis Testing) Decision Making (Hypothesis Testing) A formal method for decision making in the presence of.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
A Broad Overview of Key Statistical Concepts. An Overview of Our Review Populations and samples Parameters and statistics Confidence intervals Hypothesis.
1 Lecture 19: Hypothesis Tests Devore, Ch Topics I.Statistical Hypotheses (pl!) –Null and Alternative Hypotheses –Testing statistics and rejection.
Agresti/Franklin Statistics, 1 of 122 Chapter 8 Statistical inference: Significance Tests About Hypotheses Learn …. To use an inferential method called.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Hypothesis and Test Procedures A statistical test of hypothesis consist of : 1. The Null hypothesis, 2. The Alternative hypothesis, 3. The test statistic.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Lecture 18 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Statistical Inference An introduction. Big picture Use a random sample to learn something about a larger population.
Issues concerning the interpretation of statistical significance tests.
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
Hypothesis Testing. “Not Guilty” In criminal proceedings in U.S. courts the defendant is presumed innocent until proven guilty and the prosecutor must.
A review of key statistical concepts. An overview of the review Populations and parameters Samples and statistics Confidence intervals Hypothesis testing.
Introduction to hypothesis testing Hypothesis testing is about making decisions Is a hypothesis true or false? Ex. Are women paid less, on average, than.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
Introduction to Testing a Hypothesis Testing a treatment Descriptive statistics cannot determine if differences are due to chance. Sampling error means.
Hypothesis Testing. Statistical Inference – dealing with parameter and model uncertainty  Confidence Intervals (credible intervals)  Hypothesis Tests.
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Hypothesis Tests for 1-Proportion Presentation 9.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
Ch06 Hypothesis Testing.
Statistics for Managers Using Microsoft® Excel 5th Edition
Logic of Hypothesis Testing
Lecture Nine - Twelve Tests of Significance.
FINAL EXAMINATION STUDY MATERIAL III
Inference and Tests of Hypotheses
Chapter 9 Hypothesis Testing: Single Population
Hypothesis Tests for a Population Mean,
Statistical Tests - Power
Chapter 9 Hypothesis Testing.
Daniela Stan Raicu School of CTI, DePaul University
Hypothesis Testing.
Hypothesis Testing.
Significance Tests: The Basics
Hypothesis Testing A hypothesis is a claim or statement about the value of either a single population parameter or about the values of several population.
Psych 231: Research Methods in Psychology
Testing Hypotheses I Lesson 9.
Chapter 9 Hypothesis Testing: Single Population
STA 291 Spring 2008 Lecture 17 Dustin Lueker.
Presentation transcript:

P value and confidence intervals Imre Janszky Faculty of Medicine NTNU

” The statistical education of scientists emphasizes a flawed approach to data analysis that should have been discarded long ago. This defective method is statistical significance testing. ... It has produced countless misinterpretations of data that are often amusing for their folly, but also hair-raising in view of the serious consequences.” Kenneth Rothman

Overview P value, what is it, what is it not, misconceptions and problems with p value Statistical significance and hypothesis testing, misconceptions and problems with statistical significance History of statistical significance Confidence intervals and estimation

P value and hypothesis testing We need : A sample from a population Set up a null hypothesis concerning the population Set up an alternative hypothesis concerning the population Contrast the data with the null hypothesis

Example – body weight of men and women A sample from the big population of men and women The null hypothesis: women and men have the same body weight, ∆weight = 0 Alternative hypothesis: women and men have a different body weight, ∆weight ≠ 0

P value- What is it? A measure of the consistency between the data obtained in a sample and a null hypothesis concerning the big population A simplified definition: P value usually refers to the probability, assuming that the null hypothesis is true, that the data obtained in a study would demonstrate an association as far from the null-hypothesis as, or farther than, what was actually obtained

Example – body weight of men and women We observed that in our sample, ∆weight=5 kg, i.e., on average, men are heavier than women by 5 kg P=0.01 Using the simplified definition: If in the big population of men and women ∆weight=0, then the probability to obtain a difference of +5 kg or higher than that or to obtain a difference of -5kg or lower than that between men and women in our sample is 1%

P value - What is it not? ≠ the probability that the null hypothesis is true ≠ the probability that a finding is just by chance ≠ strength of the association ≠ degree of uncertainty

Statistical significance, hypothesis testing Based on a predefined cut off for p value Typically, the null hypothesis is rejected if p<0.05 (and the results are called statistically significant) and not rejected if p>0.05 (and called statistically non-significant) Type I error is the incorrect rejection of a true null hypothesis Type II error is the incorrect lack of rejection of a false null hypothesis

Problems with statistical significance Court analogy: the defendant is assumed to be innocent until it is proved otherwise/ the null hypotheses is assumed to be true until we can reject it However, decisions in medicine are not be based on a p value from a single study In contrast with the courts, “gray zones” are acceptable Very common misconception: lack of significance is an evidence for lack of effect Loss of important information with simple dichotomization

History of statistical significance 1. First use of statistical significance (John Venn, 1888): the mean height of 2,315 criminals differs from the mean height of 8,585 members of the general adult population by about two inches. The chance to observe this is very low, many billions to one, if there would be no difference. Therefore, Venn concluded, it is likely that the difference is real, or statistically significant

History of statistical significance 2. Modern use of statistical significance was developed in business applications needing decisions based on single results At the beginning there was no universal agreement on what should be the cut-off, or whether an universal cut-off is needed at all

History of statistical significance 3. Developers of hypothesis testing (Egon Pearson and Jerzy Neyman) emphasized that the cut off values should be chosen based on the concrete situation Roland Fisher published a very influential statistical textbook:Statistical Methods for Research Workers (first edition in 1925)

History of statistical significance 4. In his book, Fisher published helping aids (distribution tables for different test statistics) for the calculation of selected p values, including 0.05 He also stated: ”Personally, the writer prefers to set a low standard of significance at the 5 per cent point, and ignore entirely all results which fail to reach this level. ”

Why statistical significance become so widespread? Relatively easy calculation (it was easier to claim statistical significance vs. no significance than calculating a concrete p value) Feeling of objectivity Simplified interpretation for the researcher and for the readers Easy writing False impression of certainty

Estimation and CI In contrast to hypothesis testing, we might be interested in estimating a certain population parameter (e.g. ∆weight in the previous example) It is a quantitative rather than qualitative approach We provide the single most likely value (point estimate) and the uncertainty around it (confidence intervals, CI)

Estimation and CI 95% CI= over unlimited repetitions of the study the 95% confidence interval will contain the true population parameter with a frequency of no less than 95% If a 95% CI does not include the value for the null hypothesis (0 for continuous measures, e.g. ∆weight, risk or rate difference, but 1 for relative measures, e.g. risk ratio, incidence rate ratio, relative risk, odds ratio) then the finding is statistically significant

Estimation and CI However, it is much more important that the point estimate and its CI allows you to consider the effect size and its uncertainty separately

Summary P value is often misinterpreted We shall avoid “statistical significance” as a simple dichotomy of study results Use of confidence intervals is preferable over p values