Chapter 9 Tests of Hypothesis Single Sample Tests The Beginnings – concepts and techniques Chapter 9A.

Chapter 9 Tests of Hypothesis Single Sample Tests The Beginnings – concepts and techniques Chapter 9A

9-1.1 Statistical Hypotheses Some Definitions Statistical Hypothesis - An assertion about a population parameter or distribution. Test of hypothesis – arriving at a decision to reject or not reject a hypothesis based upon a sample from the population. Null hypothesis – usually the hypothesis of no difference. The assertion that the researcher usually wants to reject. H o :  =  0 Alternate Hypothesis – the assertion that is accepted if the null hypothesis is rejected. The assertion that the researcher generally wants to prove. H 1 :    0

Hypothesis Test on a Population Mean Two-Sided Test: One-Sided Tests:

Test of Hypothesis If the information in a sample is consistent with the null hypothesis, then we will conclude that the null hypothesis cannot be rejected; If this information is inconsistent with the null hypothesis, we will conclude that the hypothesis is false and reject the null hypothesis in favor of the alternate hypothesis. Critical Region – the set of values for the test statistic that results in rejecting the null hypothesis. The test statistic is calculated from the sample; i.e. a sample statistic

What can go wrong? H 0 true H 0 false Do not Reject H 0 correct Type II decision error Reject H 0 Type I correct (accept H 1 )error decision

How Likely are the errors? Type I Error – Incorrectly Rejecting a True Hypothesis  = P(Type I error) (1-  probability of not rejecting a true hypothesis Type II Error – Incorrectly Accepting a False Hypothesis  = P(Type II error) Power of test (1-  ) - probability of correctly rejecting the null when the alternative is true. The probability of a type I error is called the significance level of the test.

Our Very First Hypothesis Test Professor Notso Brite believes that his mean driving time to the campus from his home is 50 minutes while Dean Nowet Ah disagrees with him believing that it takes him, on the average more than 50 minutes. It is known that the standard deviation of his driving time is 2.5 minutes and driving time is normally distributed. humor me here For the next 10 days, Professor Brite records his driving time with Can we accept Dean Nowet Ah’s assertion that the mean driving time must be greater than 50 minutes?

The Hypothesis H 0 :  = 50 minutes H 1 :  > 50 minutes (one-tailed test) Given:  = 2.5 minutes n = 10

More Probability of a Type I Error Let’s set the probability of a Type I error =.05, Then P(Type I error) = P(reject H 0 |H 0 is correct) =  =.05 H 0 :  = 50 minutes H 1 :  > 50 minutes

What about the Type II Error? Incorrectly Accepting a False Hypothesis P(Type II error) = P(not rejecting H 0 |H 1 is correct) =  H 0 :  = 50 minutes H 1 :  > 50 minutes But professor, that probability depends upon the true value of the population mean under the alternate hypothesis.

More about Type II Errors Say the true mean is 51:

The Situation Graphically Displayed X c = 51.3  0 = 50;  1 = 51 Prob =.6479 Prob =.05

More Graphical Display X c = 51.3 Prob =.05  0 = 50;  1 = 52 Prob =.1878

The Operating Characteristic (OC) Curve

The Power of the Test The power is computed as 1 - , and power can be interpreted as the probability of correctly rejecting a false null hypothesis. We often compare statistical tests by comparing their power properties.

The Power of the Test Power of test (1-  ) - probability of correctly rejecting the null hypothesis when the alternative is true.

The Prob-Value H 0 :  = 50 minutes H 1 :  > 50 minutes (one-tailed test) Given:  = 2.5, n = 10

The Prob-Value 51.7 P-Value =.0157 51.3  =.05

Sample Size Determination For 2-tailed test:

Sample Size in Action What sample size is need if the level of significance is one percent and the probability of rejecting the null hypothesis if the true mean is 52 is 95 percent?

A Two-Tailed Test H 0 :  = 50 minutes H 1 :  = 50 minutes (two-tailed test) Given:  = 2.5 minutes n = 10

More Probability of a Type I Error Let’s set the probability of a Type I error =.05, Then

Probability of a Type II Error  1 = 52:  1 = 49:

A Two-Tailed Prob-Value Reject H 0 if  .0314

A two-sided confidence interval – a study in comparison Given:  = 2.5 minutes and n = 10 95% confidence interval: A 95% confidence interval identifies a set of acceptable hypotheses at the 5% level of significance. A mean of 50 lies outside the interval and is therefore rejected.

Confidence Intervals and Hypothesis Tests – together at last

9-2 Tests on the Mean of a Normal Distribution, Variance Known We wish to test : The test statistic is : Reject H 0 if the observed value of the test statistic z 0 is either : z 0 > z  /2 or z 0 < -z  /2 Fail to reject H 0 if -z  /2 < z 0 < z  /2

9-2 Tests on the Mean of a Normal Distribution, Variance Known

Alternately

Points to Ponder When we are considering Type II errors (beta), we use the distribution of the test statistic under the alternative hypothesis. When we are considering Type I errors (alpha), we use the distribution of the test statistic under the null hypothesis.

Statistical versus Practical Significance statistical significance says nothing about the importance of the difference there may be statistically significant difference between two values with no practical difference mean of 50.4 driving minutes versus 49.7 driving minutes large sample sizes will identify a difference There may no statistically significant difference between two values but there is a significant practical difference mean of.20 mm in the diameter of a ball-bearing versus.18 mm

Interactions -- alpha, beta, sample size Alpha/beta tradeoffs. Lower alpha value means a larger beta value. Power of a test is (1-beta). Lower alpha implies we are reluctant to risk rejecting a true hypothesis. But it means we must risk accepting a false one. Only way to improve both is to increase the sample size. N  and  then  N  and  then  N  then:  and 

On the selection of the level of significance Convention is to use.01 or.05 Consider practical consequences of making a type I or II error Consider power of the test and sample size Large N – small difference will be statistically significant – use small  (.01 -.001) Small N – large differences may not be detected – use large  (.05 -.10) Consider “true” difference Type I versus Type II errors Use the P-value and let the reader decide “I’m just reporting the facts; you decide”

General Procedures for Hypothesis Tests 1.Identify the parameter of interest. 2.State the null hypothesis – H 0. 3.Specify the alternative – H 1. 4.Choose the significance level – alpha – risk of Type I error. 5.Determine the appropriate test statistic. 6.State the rejection region for the statistic. 7.Compute the sample quantities (i.e. from the experiment or measurement) and substitute into the equation for test statistic. 8.Decide whether to reject H 0.

A Little Philosophy If we reject the null hypothesis: either H 1 is true or we were extremely unlucky and hit on the 5 percent of the samples that fall in the critical region We go with the odds and reject the null If we fail to reject the null (assume x-bar = 11.1) H 0 is still left standing at the end of the test The alternative hypothesis is what we wish to prove and believe to be correct The sample supports H 1 but the test does not allow us to reject H 0 Therefore, we conclude that the evidence does not allow us to reject H 0 stopping short of saying we accept H 0. Consider the following: Consider H 0:  = 10 H 1  > 0

Large Sample Test In most situations, the population variance is unknown and the population may not be well modeled as a normal distribution If n is large (n >40), the sample standard deviation, s, can be substituted for  with little effect appealing to the central limit theorem Exact tests where the population is normal,  2 is unknown, and n is small results in t-distribution.

A Little Recap Tests on a mean, variance known, normal population or large sample size (CLT) H 0 :  =  0 H 1 :    0 H 0 :  =  0 H 1 :  >  0 H 0 :  =  0 H 1 :  <  0

Next Time Time Permitting

Chapter 9 Tests of Hypothesis Single Sample Tests The Beginnings – concepts and techniques Chapter 9A.

Similar presentations

Presentation on theme: "Chapter 9 Tests of Hypothesis Single Sample Tests The Beginnings – concepts and techniques Chapter 9A."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Chapter 9 Tests of Hypothesis Single Sample Tests The Beginnings – concepts and techniques Chapter 9A.

Similar presentations

Presentation on theme: "Chapter 9 Tests of Hypothesis Single Sample Tests The Beginnings – concepts and techniques Chapter 9A."— Presentation transcript:

Similar presentations

About project

Feedback