Presentation is loading. Please wait.

Presentation is loading. Please wait.

ANOVA.

Similar presentations


Presentation on theme: "ANOVA."— Presentation transcript:

1 ANOVA

2 ANOVA - What is it? Analysis of variance.
A method for splitting the total variation of a data into meaningful components that measure different sources of variation

3 One-Way Classification (equal samples)
Assumption Random samples of size n are selected from each of the k populations. The k populations are independent and normally distributed with means 𝝁 𝟏 , 𝝁 𝟐 ,⋯ 𝝁 𝒌 and common variance 𝝈 𝟐

4 Ho: & Ha: Ho: 𝝁 𝟏 = 𝝁 𝟐 =⋯ =𝝁 𝒌 Ha: at least two of the means are not equal

5 Critical Region and ANOVA Table [equal samples]
𝑓> 𝑓 𝛼 [𝑘−1,𝑘(𝑛−1)]

6 Computational Formulas

7 EXAMPLE A company has three manufacturing plants, and company officials want to determine whether there is a difference in the average age of workers at the three locations. The following data are the ages of five randomly selected workers at each plant. Perform a one-way ANOVA to determine whether there is a significant difference in the mean ages of the workers at the three plants. Use 0.01level of significance.

8

9 Between Groups = Column Means Within Groups = Error

10 EXAMPLE

11 Between Groups = Column Means Within Groups = Error

12 Critical Region and ANOVA Table [unequal samples]
If the sample size for the k populations are 𝑛 1 , 𝑛 2 ,…, 𝑛 𝑘 then the critical region is given 𝑓> 𝑓 𝛼 𝑘−1,𝑁−𝑘 where 𝑁= 𝑖=1 𝑘 𝑛 𝑖

13 Computational Formulas

14 Example It is suspected that higher-priced automobiles are assembled with greater care than lower-priced automobiles. To investigate whether there is any basis for this feeling, a large luxury model A, a medium-size sedan B, and a subcompact hatchback C were compared for defects when they arrived at the dealer’s showroom. All cars were manufactured by the same company. The number of defects for several of the three models are recorded.

15 Test the hypothesis at 0.05 level of significance that the average number of defects is the same for the three models. MODEL A B C 4 5 8 7 1 6 3 9 TOTAL 23 21 36 80

16

17 EXAMPLE A milk company has four machines that fill gallon jugs with milk. The quality control manager is interested in determining whether the average fill for these machines is the same. The following data represent random samples of fill measures (in quarts) for 19 jugs of milk filled by the different machines. Use 𝛼=0.01to test the hypotheses. Discuss the business implications of your findings.

18 MACHINE 1 MACHINE 2 MACHINE 3 MACHINE 4 4.05 3.99 3.97 4 4.01 4.02 3.98 4.04 3.95

19 Tukey’s Honestly Significant Difference Test : (HSD)
Equal Samples 𝐻𝑆𝐷= 𝑞 𝛼,𝑘,𝑘(𝑛−1) 𝑀𝑆𝐸 𝑛 Unequal Samples 𝐻𝑆𝐷= 𝑞 𝛼,𝑘,𝑘(𝑛−1) 𝑀𝑆𝐸 𝑛 𝑟 𝑛 𝑠 If 𝒙 𝒓 − 𝒙 𝒔 >𝑯𝑺𝑫 then 𝝁 𝒓 is is significantly different from 𝝁 𝒔

20 Example (Milk)

21 ROWS COLUMNS 1 2 ⋯ j c 𝑥 11 𝑥 12 𝑥 1𝑗 𝑥 1𝑐 𝑇 1. 𝑥 1. 𝑥 21 𝑥 22 𝑇 2.
TOTAL MEANS 1 2 j c 𝑥 11 𝑥 12 𝑥 1𝑗 𝑥 1𝑐 𝑇 1. 𝑥 1. 𝑥 21 𝑥 22 𝑇 2. 𝑥 2. i 𝑥 𝑖1 𝑥 𝑖2 𝑥 𝑖𝑗 𝑥 𝑖𝑐 𝑇 𝑖. 𝑥 𝑖. r 𝑥 𝑟1 𝑥 𝑟2 𝑥 𝑟𝑗 𝑥 𝑟𝑐 𝑇 𝑟. 𝑥 𝑟. 𝑇 .1 𝑇 .2 𝑇 .𝑗 𝑇 .𝑐 𝑇 .. MEAN 𝑥 .1 𝑥 .2 𝑥 .𝑗 𝑥 .𝑐 𝑥 ..

22 Two-Way ANOVA (w/o replication)
We wish to test the following hypotheses: Ho: The row means are all equal H1: The row means are significantly different Ho: The column means are all equal H1: The column means are significantly different

23 Computational Formulas
𝑆𝑆𝑇= 𝑖=1 𝑟 𝑗=1 𝑐 𝑥 𝑖𝑗 2 − 𝑇 𝑟𝑐 𝑆𝑆𝑅= 1 𝑐 𝑖=1 𝑟 𝑇 𝑖. 2 − 𝑇 𝑟𝑐 𝑆𝑆𝐶= 1 𝑟 𝑗=1 𝑐 𝑇 .𝑗 2 − 𝑇 𝑟𝑐 𝑆𝑆𝐸=𝑆𝑆𝑇−𝑆𝑆𝐶−𝑆𝑆𝑅

24 Example 4 The yields of three types of wheat using four different kinds of fertilizer were recorded and are shown on the next page: Test the hypothesis at the 0.05 level of significance that there is no difference in the average yield of wheat when different kinds of fertilizer are used. Also, test the hypothesis that there is no difference in the average yield of the three varieties of wheat.

25 Example 4

26 Two-Way ANOVA (with Replication)
We wish to test the following hypotheses: Ho: The row means are all equal H1: The row means are significantly different Ho: The column means are all equal H1: The column means are significantly different Ho: There is no significant interaction effect. H1: There is a significant interaction effect.

27 Computational Formulas

28 Example 5 Aside from testing the difference in the yields according to fertilizer and variety of wheat, try to determine if there is a significant interaction effect on the two variables, given the following data set. Use a 0.05 level of significance.

29 Example 5 ANOVA : PLBautista


Download ppt "ANOVA."

Similar presentations


Ads by Google