RDPStatistical Methods in Scientific Research - Lecture 41 Lecture 4 Sample size determination 4.1 Criteria for sample size determination 4.2 Finding the.

Slides:



Advertisements
Similar presentations
Sample size estimation
Advertisements

Hypothesis testing Another judgment method of sampling data.
Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
Beyond Null Hypothesis Testing Supplementary Statistical Techniques.
Significance Testing Chapter 13 Victor Katch Kinesiology.
Sample size computations Petter Mostad
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Hypothesis Testing After 2 hours of frustration trying to fill out an IRS form, you are skeptical about the IRS claim that the form takes 15 minutes on.
Topic 2: Statistical Concepts and Market Returns
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
Experimental Evaluation
Inferences About Process Quality
Chapter 9 Hypothesis Testing.
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
BCOR 1020 Business Statistics
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
5-3 Inference on the Means of Two Populations, Variances Unknown
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Quantitative Business Methods for Decision Making Estimation and Testing of Hypotheses.
Sample Size Determination Ziad Taib March 7, 2014.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Chapter 15 Nonparametric Statistics
Hypothesis Testing:.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 2 – Slide 1 of 25 Chapter 11 Section 2 Inference about Two Means: Independent.
More About Significance Tests
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
RMTD 404 Lecture 8. 2 Power Recall what you learned about statistical errors in Chapter 4: Type I Error: Finding a difference when there is no true difference.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
F OUNDATIONS OF S TATISTICAL I NFERENCE. D EFINITIONS Statistical inference is the process of reaching conclusions about characteristics of an entire.
Jan 17,  Hypothesis, Null hypothesis Research question Null is the hypothesis of “no relationship”  Normal Distribution Bell curve Standard normal.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
Elementary Statistical Methods André L. Souza, Ph.D. The University of Alabama Lecture 22 Statistical Power.
RDPStatistical Methods in Scientific Research - Lecture 11 Lecture 1 Interpretation of data 1.1 A study in anorexia nervosa 1.2 Testing the difference.
LECTURE 19 THURSDAY, 14 April STA 291 Spring
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Hypothesis Testing Hypothesis Testing Topic 11. Hypothesis Testing Another way of looking at statistical inference in which we want to ask a question.
STA Lecture 251 STA 291 Lecture 25 Testing the hypothesis about Population Mean Inference about a Population Mean, or compare two population means.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
통계적 추론 (Statistical Inference) 삼성생명과학연구소 통계지원팀 김선우 1.
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005 Dr. John Lipp Copyright © Dr. John Lipp.
Chapter 8 Delving Into The Use of Inference 8.1 Estimating with Confidence 8.2 Use and Abuse of Tests.
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
1 9 Tests of Hypotheses for a Single Sample. © John Wiley & Sons, Inc. Applied Statistics and Probability for Engineers, by Montgomery and Runger. 9-1.
3-1 MGMG 522 : Session #3 Hypothesis Testing (Ch. 5)
Fall 2002Biostat Statistical Inference - Proportions One sample Confidence intervals Hypothesis tests Two Sample Confidence intervals Hypothesis.
MPS/MSc in StatisticsAdaptive & Bayesian - Lect 41 Lecture 4 Sample size reviews 4.1A general approach to sample size reviews 4.2Binary data 4.3Normally.
BPS - 5th Ed. Chapter 151 Thinking about Inference.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
This Week Homework 2 parts 1 and 2 due by 5pm today. Answer keys posted tonight. Midterm 2 on Thursday, bring 8 ½ by 11 inch note sheet. No calculator.
WS 2007/08Prof. Dr. J. Schütze, FB GW KI 1 Hypothesis testing Statistical Tests Sometimes you have to make a decision about a characteristic of a population.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University.
Inference for distributions: - Comparing two means.
Sample Size Needed to Achieve High Confidence (Means)
Hypothesis Tests. An Hypothesis is a guess about a situation that can be tested, and the test outcome can be either true or false. –The Null Hypothesis.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Inferential Statistics Psych 231: Research Methods in Psychology.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
Chapter 8: Inference for Proportions
Presentation transcript:

RDPStatistical Methods in Scientific Research - Lecture 41 Lecture 4 Sample size determination 4.1 Criteria for sample size determination 4.2 Finding the sample size 4.3 Some simple variations 4.4 Further considerations

RDPStatistical Methods in Scientific Research - Lecture Criteria for sample size determination Suppose that we are to conduct an investigation comparing populations,  A and  B Sample A comprises n A units of observation from  A Sample B comprises n B units of observation from  B Suppose that n A = n B and that n = n A + n B The responses will be quantitative, and the analysis will use a t-test How should we choose n?

RDPStatistical Methods in Scientific Research - Lecture 43 Let  A  = mean response for  A  B  = mean response for  B Null hypothesis is H 0 :  A  =  B From the data, we will obtain the sample means and and sample standard deviations S A and S B for groups A and B Once we have the data, we can:  Reject H 0 and say that  A  >  B  Reject H 0 and say that  A  <  B  Not reject H 0

RDPStatistical Methods in Scientific Research - Lecture 44 When n A = n B = n/2, the t-statistic is where t will tend to be positive if  A  >  B,  negative if  A  <  B and close to zero if  A  =  B

RDPStatistical Methods in Scientific Research - Lecture 45 We will:  Reject H 0 and say that  A  >  B  if t  k  Reject H 0 and say that  A  <  B if t   k  Not reject H 0 if  k < t < k Say  A  B  k 0 k t Now we need to find both n and k

RDPStatistical Methods in Scientific Research - Lecture 46 Suppose that, in truth,  A  =  B This does not mean that we will observe nor t = 0 In fact, we may observe t  k or t   k, just by chance This means that we might reject H 0 when H 0 is true This is called type I error

RDPStatistical Methods in Scientific Research - Lecture 47 Suppose that, in truth,  A  =  B  +  where  > 0, and  is of a magnitude that would be scientifically worth detecting We may still observe t  k by chance This means that we might fail to reject H 0 when H 0 is false This is called type II error

RDPStatistical Methods in Scientific Research - Lecture 48 The probability that t  k or t   k, when  A  =  B, is called the risk of type I error, and is denoted by  (This is for a two-sided alternative: the probability that t  k, when  A  =  B, is the risk of type I error for a one-sided alternative and is equal to  /2) The probability that t  k, when  A  =  B +  is called the risk of type II error, and is denoted by  The probability that t  k, when  A  =  B +  is called the power, and is equal to 1  

RDPStatistical Methods in Scientific Research - Lecture 49 Reducing type I error Increase k – make it difficult to reject H 0 Increasing power Decrease k – make it easy to reject H 0 Reducing type I error and increasing power simultaneously Increase n – this will make the study more informative, but it will cost more

RDPStatistical Methods in Scientific Research - Lecture Finding the sample size Suppose that the true standard deviation within each of the populations  A and  B is  Then t  Z where Z follows the normal distribution, with standard deviation 1 When  A  =  B, Z has mean 0 When  A  =  B + , Z has mean  n/(2  )

RDPStatistical Methods in Scientific Research - Lecture 411 Specify that the type I risk of error (two-sided) should be  : P( Z  k or Z   k :  A  =  B ) =  (1) Under H 0, Z is normally distributed with mean 0 and st dev 1 k is the value exceeded by a normal (0, 1) random variable with prob  /2

RDPStatistical Methods in Scientific Research - Lecture 412 Specify that the type II risk of error should be  : P( Z  k :  A  =  B +  ) =  (2) Under H 0, Z is normally distributed with mean  n/(2  ) and st dev 1 k   n/(2  ) is the value exceeded by a normal (0, 1) random variable with prob 1 

RDPStatistical Methods in Scientific Research - Lecture 413 For  = 0.05 and 1 –  = 0.90, we have k = and k   n/(2  ) =  Thus Power: 1  Type I error: 

RDPStatistical Methods in Scientific Research - Lecture 414 Sample size increases:  as  increases  as  decreases  as  decreases  as 1  increases

RDPStatistical Methods in Scientific Research - Lecture 415 Unequal randomisation The power of a study depends on which, for equal sample sizes is equal to For n E = Rn C, n = Rn C + n C and so 4.3 Some simple variations

RDPStatistical Methods in Scientific Research - Lecture 416 Unequal randomisation So, the overall sample size is multiplied by the factor and n E by F E and n C by F C, where R F FEFE FCFC

RDPStatistical Methods in Scientific Research - Lecture 417 Unknown standard deviation The sample size formula depends on guessing  If this guess is smaller than the truth, the sample size will be too small and the study underpowered If this guess is larger than the truth, the sample size will be too large and the sample size unnecessarily large A more accurate calculation can be based on the t-distribution rather than the normal, but this makes little difference and does not overcome the dependence on 

RDPStatistical Methods in Scientific Research - Lecture 418 Unknown standard deviation Often, the final analysis will be based on a linear model, not just a t-test The formulae given can still be used, but  is now the residual standard deviation (the SD about the fitted model) Fitting the right factors will reduce the residual standard deviation, and so the sample size will also be reduced  but you have to guess what  will be in advance!

RDPStatistical Methods in Scientific Research - Lecture 419 Sample size for estimation The sample size can be determined to give a confidence interval of specified width W The 95% confidence interval for  =  A   B is of the form when sample sizes are large (Lecture 1, Slide 24) When n A = n B = n/2, this has length

RDPStatistical Methods in Scientific Research - Lecture 420 Sample size for estimation We need to set which means that

RDPStatistical Methods in Scientific Research - Lecture 421 Binary data For R = 1,  = 0.05 and 1 –  = 0.90, we have where p C is the anticipated success rate in  C, and p E the improved rate in  E to be detected with power 1 

RDPStatistical Methods in Scientific Research - Lecture 422 Examples for binary data: R = 1,  = 0.05 and 1 –  = 0.90 pCpC pEpE  n

RDPStatistical Methods in Scientific Research - Lecture 423 Binary data This approach is based on the log-odds ratio Many other approximate formulae exist All give similar answers when sample sizes are large: exact calculations can be made for small sample sizes

RDPStatistical Methods in Scientific Research - Lecture Further considerations Setting the values for  and  The standard scientific convention is to ensure that  will be small, and allow any risks to be taken with  For example, if an SD or a control success rate is underestimated at the design stage, the study will be underpowered – the analysis maintains the type I error  at the cost of losing power  is the community’s risk of being given a false conclusion  is the scientist’s risk of not proving his/her point

RDPStatistical Methods in Scientific Research - Lecture 425 Exceptions If the scientist wishes to prove the null hypothesis (equivalence testing)  then  should be kept small, while  can be inflated if necessary In a pilot study, preliminary to a larger confirmatory study  type I errors can be rectified in the next study, but type II errors will mean that the next study is not conducted at all

RDPStatistical Methods in Scientific Research - Lecture 426 Finally:  Many more sample size formulae exist – see Machin et al. (1997)  Software also exists: nQuery advisor, PASS  Ensure that the sample size formula used matches the intended final analysis  In complicated situations, the whole study can be simulated on the computer in advance to determine its power