Nemours Biomedical Research Statistics March 19, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.

Slides:



Advertisements
Similar presentations
Hypothesis Testing making decisions using sample data.
Advertisements

Last Time (Sampling &) Estimation Confidence Intervals Started Hypothesis Testing.
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
Introduction to Statistics
Confidence Intervals © Scott Evans, Ph.D..
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Elementary hypothesis testing
Basic Elements of Testing Hypothesis Dr. M. H. Rahbar Professor of Biostatistics Department of Epidemiology Director, Data Coordinating Center College.
Normality,Sampling & Hypothesis Testing and sample size estimation Jobayer Hossain, PhD Larry Holmes, Jr, PhD October 23, 2008 RESEARCH STATISTICS.
PSY 1950 Confidence and Power December, Requisite Quote “The picturing of data allows us to be sensitive not only to the multiple hypotheses that.
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Nemours Biomedical Research Statistics March 26, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Chapter 9 Hypothesis Testing.
BCOR 1020 Business Statistics
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Inference about Population Parameters: Hypothesis Testing
AM Recitation 2/10/11.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
1 STATISTICAL HYPOTHESES AND THEIR VERIFICATION Kazimieras Pukėnas.
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
More About Significance Tests
BPS - 3rd Ed. Chapter 141 Tests of Significance: The Basics.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
Introduction to Statistical Inference Probability & Statistics April 2014.
Comparing Two Population Means
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
Jan 17,  Hypothesis, Null hypothesis Research question Null is the hypothesis of “no relationship”  Normal Distribution Bell curve Standard normal.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Essential Statistics Chapter 131 Introduction to Inference.
INTRODUCTION TO INFERENCE BPS - 5th Ed. Chapter 14 1.
CHAPTER 14 Introduction to Inference BPS - 5TH ED.CHAPTER 14 1.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
Hypothesis Testing Hypothesis Testing Topic 11. Hypothesis Testing Another way of looking at statistical inference in which we want to ask a question.
Agresti/Franklin Statistics, 1 of 122 Chapter 8 Statistical inference: Significance Tests About Hypotheses Learn …. To use an inferential method called.
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
The Practice of Statistics Third Edition Chapter 10: Estimating with Confidence Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
HYPOTHESIS TESTING. Statistical Methods Estimation Hypothesis Testing Inferential Statistics Descriptive Statistics Statistical Methods.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
Chapter 8 Delving Into The Use of Inference 8.1 Estimating with Confidence 8.2 Use and Abuse of Tests.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Statistics 101 Chapter 10 Section 2. How to run a significance test Step 1: Identify the population of interest and the parameter you want to draw conclusions.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 8 First Part.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
BPS - 3rd Ed. Chapter 141 Tests of significance: the basics.
1 When we free ourselves of desire, we will know serenity and freedom.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Logic and Vocabulary of Hypothesis Tests Chapter 13.
AP Statistics Section 11.1 B More on Significance Tests.
Statistical Inference Drawing conclusions (“to infer”) about a population based upon data from a sample. Drawing conclusions (“to infer”) about a population.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Tests of Significance: Stating Hypothesis; Testing Population Mean.
1 Probability and Statistics Confidence Intervals.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
A short introduction to epidemiology Chapter 6: Precision Neil Pearce Centre for Public Health Research Massey University Wellington, New Zealand.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Hypothesis Tests for 1-Proportion Presentation 9.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
More on Inference.
Chapter 5 STATISTICAL INFERENCE: ESTIMATION AND HYPOTHESES TESTING
More on Inference.
Essential Statistics Introduction to Inference
Basic Practice of Statistics - 3rd Edition Introduction to Inference
Presentation transcript:

Nemours Biomedical Research Statistics March 19, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility

Nemours Biomedical Research Sampling Variability and Standard Error If we repeat an experiment or measurement on the same number of subjects, the statistic varies as sample varies. This variability is known sampling variability Standard error (SE) measures the sampling variability or the precision of an estimate. –It indicates how precisely one can estimate a population value from a given sample. –For a large sample, approximately 68% of times sample estimate will be within one SE of the population value.

Nemours Biomedical Research Statistical inference is the process by which we acquire information about populations from samples. Two types of estimates for making inferences: –Point estimation: Calculates statistic and then uses for guessing population parameter. –Interval estimation: Calculates an interval using sample data with in which parameter lie with a certain probability. Statistical Inference Sample Population

Nemours Biomedical Research Point estimator Sample distribution Parameter ? Population distribution A point estimate draws inference about a population by estimating the value of an unknown parameter using a single value or a point. Point Estimation

Nemours Biomedical Research Interval estimator Sample distribution An interval estimator draws inferences about a population by estimating the value of an unknown parameter using an interval. Population distribution Parameter Interval Estimation

Nemours Biomedical Research Hypothesis Testing (Quantitative)

Nemours Biomedical Research Hypothesis Testing Hypothesis –Statement or belief about some characteristic of a population such as it’s mean or standard deviation Types of Hypotheses –Null Hypothesis (H 0 ): The hypothesis that is assumed to be true unless contradicted by data observed in a sample –Alternative Hypothesis(H 1 ): The hypothesis one must assumed if the H 0 is rejected. H 0 & H 1 are complementary Normally want to reject H 0 in favor of H 1

Nemours Biomedical Research Test of hypothesis Goal: Keep ,  reasonably small

Nemours Biomedical Research Test of hypothesis Level of SignificanceLevel of Significance: The probability of type I error is known as the level of significance. Power (1-β): The probability of rejecting H 0 when alternative hypothesis is true at a fixed level of significance (α). That is, the probability of rejecting null hypothesis for a specified value of an alternative hypothesis. –Power of a test is a function of sample size and the parameter of interest. –We calculate power for a particular value of the parameter in alternative hypothesis. –An increased sample size increases power of a test.

Nemours Biomedical Research Test of hypothesis We aim to make inferences controlling both type I and type II errors. The reduction in one results in an increase in the other The consequence of type I error seems to be more severe than that of type II error. That’s why we choose a test that minimizes type II error (maximize the power of the test) keeping type I error at a fixed low level (say 0.05).

Nemours Biomedical Research Test of hypothesis Reasons for estimating power prior to conducting a research study: To determine the sample size required for a sufficient power to correctly detect a significant difference.

Nemours Biomedical Research Test of hypothesis Increasing Power: If power of a test is too small than following steps can increase power: Increase the sample size Increase the significance level (α ) Reduce the variability Enlarge the effect size

Nemours Biomedical Research P-Value P-value is the probability, assuming H 0 is true, that the test statistic would take a value as extreme or more extreme than that actually observed. P value is a statement of the likelihood that the the null hypothesis is true. The smaller the P-value, the stronger the evidence against H 0 provided by the data. A p-value of 0.05 implies that there is a 1 in 20 chance of a Type I error, i.e., rejecting H 0 when it is actually true.

Nemours Biomedical Research Confidence Interval A confidence interval (CI) is an interval within which the value of the parameter lies with a specified probability CI measures the precision of an estimate A large sampling variability leads a wide interval reflecting the uncertainty of the estimate A 95% CI implies that if one repeats a study 100 times, the true measure of association will lie inside the CI in 95 out of 100 measures. If a parameter does not lie within 95% CI, indicates significance at 5% level of significance

Nemours Biomedical Research Confidence Interval (CI) point estimate  (measure of how confident we want to be)  (standard error) What effect does larger sample size have on the confidence interval? It reduces standard error and makes CI narrower indicating more precision of estimate

Nemours Biomedical Research Steps in hypothesis Testing Hypothesis testing steps: 1. Specify null (H 0 ) and alternative (H 1 ) hypotheses 2. Select significance level (alpha) - say 0.05 or Calculate test statistic – e.g. t, F, Chi-square 4. Calculate probability value (p-value) or confidence Interval (CI)? 5. Describing the result and statistic in an understandable way.

Nemours Biomedical Research Procedures for sample size calculation Select primary variables of interest and formulate of hypotheses Determine/estimate standard deviation (if numeric) or proportion (if categorical) Decide a tolerance level of significance (  ) Determine test statistic to use Determine desired power or confidence level Determine effect size -- a scientifically or clinically meaningful difference

Nemours Biomedical Research Factors affecting sample size The larger the standard deviation, the larger the sample size needs to be. The smaller the likelihood of a Type I error, the larger the sample size must be The greater the amount of power desired, the larger the sample size must be The smaller the expected effect, the larger the sample size must be

Nemours Biomedical Research Example: Sample size calculation Aim of the study: To test the hypothesis that kids with a parental history of obesity are at a higher risk of being obese themselves. Suppose prevalence of obesity is 2% among US kids, whereas it is 5% among US kids with a parental history. How many kids should we recruit to detect this difference, ( =.03) with a 90% power using a two-sided test with α =.05? Calculation shows that  341 kids are needed for this study.

Nemours Biomedical Research Sample size calculation in R power.t.test: For a hypothesis test of mean of single or two groups. E.g. power.t.test( power=0.87, delta=1, sd=1, sig.level=0.05) (in one sample delta is the difference null hypothesis and alternative hypothesis and in two-sample delta is the true difference of means of two groups). Calculated n  21. power.prop.test: For a hypothesis test of a proportion power.prop.test(power=.80, p1=.5, p2=.75, sig.level=0.05) (in one sample: p1 is the null and p2 is alternative and for two samples p1 and p2 proportion for two groups. Calculated n  58.

Nemours Biomedical Research Useful links for sample size Calculation

Nemours Biomedical Research Power calculation in R Some R functions for power calculation (same functions can be used for sample size calculation): Power.t.test: For a hypothesis test of mean of single or two groups. E.g. power.t.test( n=20, delta=1, sd=1, sig.level=0.05) (in one sample delta is the difference null hypothesis and alternative hypothesis and in two-sample delta is the true difference of means of two groups). Calculated power is about Power.prop.test: For a hypothesis test of a proportion Power.prop.test(n=50, p1=.5, p2=.75, sig.level=0.05) (in one sample: p1 is the null and p2 is alternative and for two samples p1 and p2 proportion for two groups. Calculated power is about 0.74.

Nemours Biomedical Research Rcmdr Demo: p-value and Confidence Interval (CI) calculation Let us calculate p-value and 95% CI for mean of the variable age in our default data set Load the csv data in R as well as in Rcmdr Select menu ‘statistics’ from Rcmdr and then select ‘single.sample t- test’ from drop down menu and specify null hypothesis (say mean=80) and confidence level (.95) and select alternative hypothesis from single.sample t-test’ window.The resulting R output is pasted below: data: Dataset$age t = , df = 59, p-value = alternative hypothesis: true mean is not equal to percent confidence interval: sample estimates: mean of x

Nemours Biomedical Research Rcmdr Demo: p-value and Confidence Interval (CI) calculation Let us calculate the 95% CI and p-value for the proportion of female or male kids in the default data set. Load the csv data in R as well as in Rcmdr Select menu ‘statistics’ from Rcmdr and then select ‘proportions’ from drop down menu and then specify null hypothesis and confidence level and select alternative hypothesis and test type from ‘single sample proportion’ window.The R output is pasted below: null probability 0.6 X-squared = 2.5, df = 1, p-value = alternative hypothesis: true p is not equal to percent confidence interval: sample estimates: p 0.5