Presentation on theme: "Hypothesis Testing Dhon G. Dungca, PIE, M.Eng’g.."— Presentation transcript:
Hypothesis Testing Dhon G. Dungca, PIE, M.Eng’g.
Hypothesis Testing Hypothesis – is an assumption about a population or an assertion about the possible value of a population parameter. Hypothesis Testing – is a statistical process of determining whether a hypothesis made is reasonable or not, based upon sample evidence.
Hypothesis Testing Example: –Hypothesis : “The mean of a population is 50” –Test: We decide to take 40 samples and determine x to test if our hypothesis is valid.
Steps in Hypothesis Testing Step 1: Formulate the Null Hypothesis –Denoted by H 0 –Is the hypothesis we hope to reject –This is the hypothesis used for testing and is the starting point of the testing process –Must always express the idea of a no significant difference or relationship –It is a precise statement of equality, such as =10. –We always first consider that the null hypothesis is true unless proven otherwise by sample evidence.
Steps in Hypothesis Testing Assuming k to be any constant, we have the following possible forms of the null hypothesis: –H 0 : = K –H 0 : = K As a precaution, we never make hypothesis tests regarding the statistic. Hence, the null hypothesis H 0 : x = K is not acceptable. We only hypothesize about the value of the population.
Steps in Hypothesis Testing Step 2: Formulate the Alternative Hypothesis –Denoted by H 1 –Is opposite the null hypothesis –It specifies existence of a difference or a relationship –The acceptance of H 1 would mean that the difference between the statistic and population parameter hypothesized is too great for us to allow the acceptance of H 0.
Steps in Hypothesis Testing Assuming k to be any constant, we have the following possible forms of the alternative hypothesis: –H 1 : = K –H 1 : = K –H 1 : K –H 1 : K If H1 is in the form of H 1 : = K, we have a two-tailed test.
Steps in Hypothesis Testing H 0 : = K H 1 : = K Two-tail Test H 0 is rejected H 0 is not rejected
Steps in Hypothesis Testing H 0 : = K H 1 : < K One-tail Test H 0 is rejectedH 0 is not rejected
Steps in Hypothesis Testing H 0 : = K H 1 : > K One-tail Test H 0 is rejectedH 0 is not rejected
Steps in Hypothesis Testing Step 3: Specify the level of significance, –Specifies the area within H 1 is accepted. –1- is the Level of confidence – is the Level of significance – divides the graph into 2 regions, the region of the acceptance of H 0 and the region of the acceptance of H 1 (critical region). –It is customary to use an of 0.05 or 0.01.
Level of Significance H 0 : = K H 1 : = K Two-tail Test 1 - /2 H 0 is not rejected H 0 is rejected
Level of Significance H 0 : = K H 1 : < K One-tail Test 1 - H 0 is rejected H 0 is not rejected
Level of Significance H 0 : = K H 1 : > K One-tail Test 1 - H 0 is not rejected H 0 is rejected
Steps in Hypothesis Testing Step 4: Decide which sampling distribution to choose –Determine the appropriate test statistic whose sampling distribution is known under the assumption that H 0 is true. –Parameter TestedTest Statistic Used z or t x 2
Steps in Hypothesis Testing Step 5: Determine the critical value and the critical region –The critical value is the dividing point between the acceptance and rejection points. –The critical region is the region within which H 0 is rejected. –The critical region (CR) depends upon the value of the level of significance selected in Step 3.
Critical Regions H 0 : = K H 1 : = K Two-tail Test H 0 is rejected H 0 is not rejected 1 - /2 -Z /2 Z 1- /2 CR: Z < -Z /2 Z > Z 1- /2 H 0 is rejected
Critical Regions H 0 : = K H 1 : < K One-tail Test H 0 is not rejected 1 - -Z CR: Z < -Z H 0 is rejected
Critical Regions H 0 : = K H 1 : > K One-tail Test H 0 is not rejected 1 - Z 1- CR: Z > Z 1- H 0 is rejected
Steps in Hypothesis Testing Step 6: Compute the value of the statistic –Using the data from a sample of size n, compute the value of the test statistic z, t or x 2 whichever was selected in Step 4 using the appropriate formula for each case. Step 7: State the conclusion and make the decision –Determine whether the test statistic is in the critical region or not. If the test statistic is in the CR, we accept H 1. Otherwise, H 0 is selected.
Test of Means Gives us an indication of the true average of a population.
Test of Means (Summary) H 0 : = K H 1 : = K, K : 0 CR: Z Z 1- Z > Z 1- /2 t t t > t /2
Test of Means (Summary) Computations: Used if n 30 z = x - or z = x - / n s / n Used if n < 30 t = x - s / n Conclusion:
Example 1: A new process will be installed if its mean processing time is at most 20 minutes. The new procedure was tried. In a random sample of 50 trials, an average processing time of 22.2 minutes with a standard deviation of 4.3 minutes was obtained. At a level of significance =0.05, should the new process be installed?
Answer: H 0 : = 20 (install the new process) H 1 : > 20 (do not install) : 0.05 CR: z > 1.645 Computations: z = 22.2 – 20 4.3 / 50 = 3.62 Conclusion: Since z is greater than 1.645, we reject H 0. We should not install the new process. 1.645 3.62 H 0 is not rejectedH 0 is rejected
Example 2: A company that makes chocolates claims that the mean weight of a bag of chocolates is 240 grams with a standard deviation of 20.5 grams. Using a 0.05 significance level, would you agree with the company if a random sample of 50 bags of chocolates was found to have a mean weight of 230 grams?
Answer: H 0 : = 240 H 1 : 240 : 0.05 CR: z < -1.96 z > 1.96 Computations: z = 230-240 20.5 / 50 = -3.45 Conclusion: Reject Ho. The claim of the manufacturer is not true 1.96 -3.45 H 0 is accepted H 1 is accepted -1.96 H 1 is accepted
Example 3: The Edison Electric Institute has published figures on the annual number of kilowatt-hours expended by various home appliances. It is claimed that a vacuum cleaner expends an average of 46 kilowatt-hours per year. If a random sample of 12 homes included in a planned study indicates that vacuum cleaners expend an average of 42 kilowatt-hours with a standard deviation of 11.9 kilowatt-hours, does this suggest at the 0.05 level of significance that vacuum cleaners expend, on the average, less than 46 kilowatt-hours annually?
Answer: H 0 : = 46 H 1 : < 46 : 0.05 CR: t < -1.796 w/ v = n-1 degrees of freedom Computations: t = 42 - 46 11.9 / 12 = -1.16 Conclusion: Do not reject Ho. -1.16 H 0 is accepted H 1 is accepted -1.796
Example 4: A new process for producing synthetic diamonds can be operated at a profitable level only if the average weight of the diamonds is greater than 0.5K. To evaluate the profitability of the process, 6 diamonds are generated with a mean and a standard deviation of 0.53 and 0.0559 respectively. Do the 6 diamonds’ measurements present sufficient evidence to indicate that the average weight of the diamond produced by the process is in excess of 0.5K?
Answer: H 0 : = 0.5 H 1 : > 0.5 : 0.05 CR: t > 2.015 w/ v = 5 degrees of freedom Computations: t = 0.53 – 0.5.0559 / 6 = 1.32 Conclusion: Do not reject Ho. 1.32 H 0 is accepted H 1 is accepted 2.015
Test of Variance Gives us the degree of dispersion of the population in consideration
Test of Variance (Summary) H 0 : 2 = K H 1 : 2 = K, 2 K : 0 CR: x 2 x 2 x 2 > x 2 /2 Computations: x 2 = (n-1) S 2 2 Conclusion: Accept Ho
Test of Variance (Summary) Accept Ho Accept H 1 x 2 < x 2 1- x 2 > x 2 x 2 < x 2 1- /2 x 2 > x 2 /2 x 2 1- x2x2 x 2 1- /2 x 2 /2
Example 5: In the past, scores in DIFFCAL test have been found to be normally distributed with a variance of 100. A class of 29 students now taught by an outstanding teacher whose method of teaching is expected to have a wider dispersion of test scores. Suppose that the class can be considered to be a random sample and suppose that the variance of student scores in the final examinations is found to be 150; can we conclude that this instructor’s method of teaching results in a wider dispersion at the 5% level of significance?
Answer: H 0 : 2 = 100 (no wider dispersion) H 1 : 2 > 100 (there is wider dispersion) : 0.05 CR: x 2 > 41.337 (x 2 0.05 at v = 29-1 = 28) degrees of freedom Computations: x 2 = (29-1)(150) 100 = 42 Conclusion: Reject Ho. Teaching method results in a wider dispersion 42 Accept Ho Accept H 1 41.337
Example 6: The volume of containers of a particular lubricant is known to be normally distributed with a variance of 0.03 liter. Test the hypothesis that 2 = 0.03 against the alternative that 2 0.03 for the random sample of 10 containers if their variance is 0.06. Use 0.01 level of significance.
Answer: H 0 : 2 = 0.03 H 1 : 2 0.03 : 0.01 CR: x 2 < 1.735 (x 2 0.995 at v = 10-1 = 9) x 2 > 23.589 (x 2 0.005 at v = 10-1 = 9) Computations: x 2 = (10-1)(0.06) 0.03 = 18 Conclusion: Do not reject Ho. 18 Accept Ho Accept H 1 23.5891.735 Accept H 1
Example 7: Past experience indicates that the time for high school seniors to complete a standardized test is normal random variable with a standard deviation of 6 minutes. Test the hypothesis that = 6 against the alternative that < 6 if a random sample of 20 high school seniors has a standard deviation s = 4.51. Use a 0.05 level of significance.
Answer: H 0 : 2 = 36 H 1 : 2 < 36 : 0.05 CR: x 2 < 10.117 (x 2 0.95 at v = 20-1 = 19) Computations: x 2 = (20-1)(20.34) 36 = 10.74 Conclusion: Do not reject Ho. 10.74 Accept Ho 10.117 Accept H 1
Extra Challenge 1: For some years now, a certain hospital has been using Formula for patients known to have a certain type of disease. It has been proven that the mean recovery time of patients under Formula is equal to 20 days with a standard deviation of 4 days. A new type of treatment is now available. This new type of treatment is branded as Formula . A random sample of 25 patients was created with this new formula and showed a mean recovery time of 18 days with a standard deviation of 3.5 days. The hospital is not willing to gamble on new formula unless it is proven to be better than the old formula as far as mean recovery time and variability are concerned. Decide whether the hospital should employ the new formula at the 5% level of significance. (Hint: 2 tests)
Extra Challenge 2: The specifications for a component of a particular item provide that the mean length should not be less than 4 cm with a standard deviation 0.45 cm. A random sample of 50 components drawn from a very large shipment showed a mean length of 3.95 cm and a standard deviation of 0.45 cm. If you were the manufacturer of this item who had received this shipment, would you accept it? Assume 5% level of significance. What if the standard deviation is 0.55 cm?