Presentation is loading. Please wait.

Presentation is loading. Please wait.

Anthony J Greene1 ANOVA: Analysis of Variance 1-way ANOVA.

Similar presentations


Presentation on theme: "Anthony J Greene1 ANOVA: Analysis of Variance 1-way ANOVA."— Presentation transcript:

1 Anthony J Greene1 ANOVA: Analysis of Variance 1-way ANOVA

2 Anthony J Greene2 ANOVA I.What is Analysis of Variance 1.The F-ratio 2.Used for testing hypotheses among more than two means 3.As with t-test, effect is measured in numerator, error variance in the denomenator 4.Partitioning the Variance II.Different computational concerns for ANOVA 1.Degrees Freedom for Numerator and Denominator 2.No such thing as a negative value III.Using Table B.4 IV.The Source Table V.Hypothesis testing

3 Anthony J Greene3 M2M2 M3M3 M1M1

4 4 ANOVA Analysis of Variance Hypothesis testing for more than 2 groups For only 2 groups t 2 (n) = F (1,n)

5 Anthony J Greene5 BASIC IDEA As with the t-test, the numerator expresses the differences among the dependent measure between experimental groups, and the denominator is the error. If the effect is enough larger than random error, we reject the null hypothesis. M 1 = 1 M 2 = 5 M 3 = 1 Is the Effect Variability Large Compared to the Random Variability Grp 1 Grp 2 Grp 3 Effect V Random V =

6 Anthony J Greene6 BASIC IDEA If the differences accounted for by the manipulation are low (or zero) then F = 1 If the effects are twice as large as the error, then F = 3, which generally indicates an effect.

7 Anthony J Greene7 Sources of Variance

8 Anthony J Greene8 Why Is It Called Analysis of Variance? Aren’t We Interested In Means, Not Variance? Most statisticians do not know the answer to this question? If we’re interested in differences among means why do an analysis of variance? The misconception is that it compares  1 2 to  2 2. No The comparison is between effect variance (differences in group means) to random variance.

9 Anthony J Greene9 Learning Under Three Temperature Conditions T is the treatment total, G is the Grand total M2M2 M3M3 M1M1

10 Anthony J Greene10 Computing the Sums of Squares

11 Anthony J Greene11 How Variance is Partitioned This simply disregards group membership and computes an overall SS Variability Between and Within Groups is Included Keep in mind the general formula for SS M 1 = 1 M 2 = 5 M 3 = 1 Grp 1 Grp 2 Grp 3

12 Anthony J Greene12 How Variance is Partitioned Imagine there were no individual differences at all. The SS for all scores would measure only the fact that there were group differences. Grp 1 Grp 2 Grp 3 Keep in mind the general formula for SS 151151151151151151151151151151 M 1 = 1 M 2 = 5 M 3 = 1T 1 = 5 T 2 = 25 T 3 = 5

13 Anthony J Greene13 How Variance is Partitioned SS computed within a column removes the mean. Thus summing the SS’s for each column computes the overall variability except for the mean differences between groups. M 1 = 1 M 2 = 5 M 3 = 1 Grp 1 Grp 2 Grp 3 Keep in mind the general formula for SS 0-1 1-1 3-1 1-1 0-1 4-5 3-5 6-5 3-5 4-5 1-1 2-1 2-1 0-1 0-1

14 Anthony J Greene14 How Variance is Partitioned M 1 = 1 M 2 = 5 M 3 = 1 Grp 1 Grp 2 Grp 3 0-1 1-1 3-1 1-1 0-1 4-5 3-5 6-5 3-5 4-5 1-1 2-1 2-1 0-1 0-1

15 Anthony J Greene15 Computing Degrees Freedom df between is k-1, where k is the number of treatment groups (for the prior example, 3, since there were 3 temperature conditions) df within is N-k, where N is the total number of ns across groups. Recall that for a t-test with two independent groups, df was 2n-2? 2n was all the subjects N and 2 was the number of groups, k.

16 Anthony J Greene16 Computing Degrees Freedom

17 Anthony J Greene17 How Degrees Freedom Are Partitioned N-1 = (N - k) + (k - 1) N-1 = N - k + k – 1

18 Anthony J Greene18 Partitioning The Sums of Squares

19 Anthony J Greene19 Computing An F-Ratio

20 Anthony J Greene20 Consult Table B-4 Take a standard normal distribution, square each value, and it looks like this

21 Anthony J Greene21 Table B-4

22 Anthony J Greene22 Two different F-curves

23 Anthony J Greene23 ANOVA: Hypothesis Testing

24 Anthony J Greene24 Basic Properties of F-Curves Property 1: The total area under an F-curve is equal to 1. Property 2: An F-curve starts at 0 on the horizontal axis and extends indefinitely to the right, approaching, but never touching, the horizontal axis as it does so. Property 3: An F-curve is right skewed.

25 Anthony J Greene25 Finding the F-value having area 0.05 to its right

26 Anthony J Greene26 Assumptions for One-Way ANOVA 1.Independent samples: The samples taken from the populations under consideration are independent of one another. 2.Normal populations: For each population, the variable under consideration is normally distributed. 3.Equal standard deviations: The standard deviations of the variable under consideration are the same for all the populations.

27 Anthony J Greene27 Learning Under Three Temperature Conditions M 1 = 1 M 2 = 5 M 3 = 1

28 Anthony J Greene28 Learning Under Three Temperature Conditions M 1 = 1 M 2 = 5 M 3 = 1 Is the Effect Variability Large Compared to the Random Variability

29 Anthony J Greene29 Learning Under Three Temperature Conditions

30 Anthony J Greene30 Learning Under Three Temperature Conditions

31 Anthony J Greene31 Learning Under Three Temperature Conditions

32 Anthony J Greene32 Learning Under Three Temperature Conditions

33 Anthony J Greene33 Learning Under Three Temperature Conditions M2M2 M3M3 M1M1

34 Anthony J Greene34 Learning Under Three Temperature Conditions Σ X 2 = 106 191191 16 9 36 9 16 144144 M2M2 M3M3 M1M1

35 Anthony J Greene35 Learning Under Three Temperature Conditions M2M2 M3M3 M1M1

36 36 Learning Under Three Temperature Conditions M2M2 M3M3 M1M1

37 Calculating the F statistic Ss total = X 2 -G 2 /N = 46 SS between = SS between = 30 SS total = Ss between + SS within Ss within = 16

38 Anthony J Greene38 Distribution of the F-Statistic for One-Way ANOVA Suppose the variable under consideration is normally distributed on each of k populations and that the population standard deviations are equal. Then, for independent samples from the k populations, the variable has the F-distribution with df = (k – 1, n – k) if the null hypothesis of equal population means is true. Here n denotes the total number of observations.

39 Anthony J Greene39 ANOVA Source Table for a one-way analysis of variance

40 Anthony J Greene40 The one-way ANOVA test for k population means (Slide 1 of 3) Step 1The null and alternative hypotheses are H o :  1 =  2 =  3 = …=  k H a : Not all the means are equal Step 2Decide On the significance level,  Step 3The critical value of F , with df = (k - 1, N - k), where N is the total number of observations.

41 Anthony J Greene41 The one-way ANOVA test for k population means (Slide 2 of 3)

42 Anthony J Greene42 The one-way ANOVA test for k population means (Slide 3 of 3) Step 4Obtain the three sums of squares, STT, STTR, and SSE Step 5Construct a one-way ANOVA table: Step 6If the value of the F-statistic falls in the rejection region, reject H 0 ;

43 Anthony J Greene43 Post Hocs H 0 :  1 =  2 =  3 = …=  k Rejecting H 0 means that not all means are equal. Pairwise tests are required to determine which of the means are different. One problem is for large k. For example with k = 7, 21 means must be compared. Post-Hoc tests are designed to reduce the likelihood of groupwise type I error.

44 Anthony J Greene44 Criterion for deciding whether or not to reject the null hypothesis

45 Anthony J Greene45 One-Way ANOVA controllow dosehigh dose 015 138 346 014 117 A researcher wants to test the effects of St. John’s Wort, an over the counter, herbal anti-depressant. The measure is a scale of self-worth. The subjects are clinically depressed patients. Use α = 0.01

46 Anthony J Greene46 One-Way ANOVA controllow dosehigh dose 015 138G=45 346 014 117 T 1 =5T 2 =10T 3 =30 Compute the treatment totals, T, and the grand total, G

47 Anthony J Greene47 One-Way ANOVA controllow dosehigh dose 015 138G=45 346N=15 014k=3 117 T 1 =5T 2 =10T 3 =30 n 1 =5n 2 =5n 3 =5 Count n for each treatment, the total N, and k

48 Anthony J Greene48 One-Way ANOVA controllow dosehigh dose 015 138G=45 346N=15 014k=3 117 T 1 =5T 2 =10T 3 =30 n 1 =5n 2 =5n 3 =5 M 1 =1M 2 =2M 3 =6 Compute the treatment means

49 Anthony J Greene49 One-Way ANOVA controllow dosehigh dose 015 138G=45 346N=15 014k=3 117 T 1 =5T 2 =10T 3 =30 n 1 =5n 2 =5n 3 =5 M 1 =1M 2 =2M 3 =6 SS=6SS=8SS=10 Compute the treatment SSs (0-1) 2 =1 (1-1) 2 =0 (3-1) 2 =4 (0-1) 2 =1 (1-1) 2 =0 sum

50 Anthony J Greene50 One-Way ANOVA controllow dosehigh dose 015 138G=45 346N=15 014k=3 117  X 2 = 229 T 1 =5T 2 =10T 3 =30 n 1 =5n 2 =5n 3 =5 M 1 =1M 2 =2M 3 =6 SS=6SS=8SS=10 Compute all X 2 s and sum them

51 Anthony J Greene51 One-Way ANOVA controllow dosehigh dose 015 138G=45 346N=15 014k=3 117  X 2 = 229 T 1 =5T 2 =10T 3 =30 SS Total =94 n 1 =5n 2 =5n 3 =5 M 1 =1M 2 =2M 3 =6 SS=6SS=8SS=10 Compute SS Total SS Total =  X 2 – G 2 / N

52 Anthony J Greene52 One-Way ANOVA controllow dosehigh dose 015 138G=45 346N=15 014k=3 117  X 2 = 229 T 1 =5T 2 =10T 3 =30 SS Total =94 n 1 =5n 2 =5n 3 =5 SS Within =24 M 1 =1M 2 =2M 3 =6 SS 1 =6SS 2 =8SS 3 =10 Compute SS Within SS Within =  SS i

53 Anthony J Greene53 One-Way ANOVA controllow dosehigh dose 015 138G=45 346N=15 014k=3 117  X 2 = 229 T 1 =5T 2 =10T 3 =30 SS Total =94 n 1 =5n 2 =5n 3 =5 SS Within =24 M 1 =1M 2 =2M 3 =6 d.f. Within =12 SS 1 =6SS 2 =8SS 3 =10 d.f. Between =2 d.f. Total =14 Determine d.f.s d.f. Within =N-k d.f. Between =k-1 d.f. Total =N-1 Note that (N-k)+(k-1)=N-1

54 Anthony J Greene54 One-Way ANOVA controllow dosehigh dose 015 138G=45 346N=15 014k=3 117  X 2 = 229 T 1 =5T 2 =10T 3 =30 SS Total =94 n 1 =5n 2 =5n 3 =5 SS Within =24 M X1 =1M X2 =2M X3 =6 d.f. Within =12 SS 1 =6SS 2 =8SS 3 =10 d.f. Between =2 d.f. Total =14 Ready to move it to a source table

55 Anthony J Greene55 One-Way ANOVA Compute the missing values SourceSSdfMSF Between702 Within2412 Total9414

56 Anthony J Greene56 One-Way ANOVA Compute the missing values SourceSSdfMSF Between70235 Within24122 Total9414

57 Anthony J Greene57 One-Way ANOVA Compute the missing values SourceSSdfMSF Between7023517.5 Within24122 Total9414

58 Anthony J Greene58 One-Way ANOVA 1.Compare your F of 17.5 with the critical value at 2,12 degrees of freedom,  = 0.01: 6.93 2.reject H 0 SourceSSdfMSF Between7023517.5 Within24122 Total9414

59 Anthony J Greene59 One-Way ANOVA Low Medium High 269 4410 358 03 2121 6666 8989 Students want to know if studying has an impact on a 10-point statistics quiz, so they divided into 3 groups: low studying (0-5hrs./wk), medium studying (6-15 hrs./wk) and high studying (16+ hours/week). At α=0.01, does the amount of studying impact quiz scores?

60 Anthony J Greene60 One-Way ANOVA low medium high 269 4410G=96 358 0310 2121 6666 8989 T 1 =12T 2 =30T 3 =54 Compute the treatment totals, T, and the grand total, G

61 Anthony J Greene61 One-Way ANOVA lowmedium high 269 4410G=96 358N=18 0310k=3 2121 6666 8989 T 1 =12T 2 =30T 3 =54 n 1 =6n 2 =6n 3 =6 Count n for each treatment, the total N, and k

62 Anthony J Greene62 One-Way ANOVA lowmedium high 269 4410G=96 358N=18 0310k=3 2121 6666 8989 T 1 =12T 2 =30T 3 =54 n 1 =6n 2 =6n 3 =6 M 1 =2M 2 =5M 3 =9 Compute the treatment means

63 Anthony J Greene63 One-Way ANOVA low medium high 269 4410G=96 358N=18 0310k=3 2121 6666 8989 T 1 =12T 2 =30T 3 =30 n 1 =6n 2 =6n 3 =6 M 1 =2M 2 =5M 3 =9 SS=10SS=8SS=10 Compute the treatment SSs (2-2) 2 =0 (4-2) 2 =4 (3-2) 2 =1 (0-2) 2 =4 (2-2) 2 =0 (1-2) 2 =1 sum

64 Anthony J Greene64 One-Way ANOVA lowmedium high 269 4410G=96 358N=18 0310k=3 2121 6666 8989  X 2 =682 T 1 =12T 2 =30T 3 =54 n 1 =6n 2 =6n 3 =6 M 1 =2M 2 =5M 3 =9 SS=10SS=8SS=10 Compute all X 2 s and sum them

65 Anthony J Greene65 One-Way ANOVA lowmedium high 269 4410G=96 358N=18 0310k=3 2121 6666 8989  X 2 = 682 T 1 =12T 2 =30T 3 =54 SS Total =170 n 1 =6n 2 =6n 3 =6 M 1 =2M 2 =5M 3 =9 SS=10SS=8SS=10 Compute SS Total SS Total =  X 2 – G 2 / N

66 Anthony J Greene66 One-Way ANOVA lowmedium high 269 4410G=96 358N=18 0310k=3 2121 6666 8989  X 2 = 682 T 1 =12T 2 =30T 3 =54 SS Total =170 n 1 =6n 2 =6n 3 =6 SS Within =28 M 1 =2M 2 =5M 3 =9 SS 1 =10SS 2 =8SS 3 =10 Compute SS Within SS Within =  SS i

67 Anthony J Greene67 One-Way ANOVA lowmedium high 269 4410G=90 358N=18 0310k=3 2121 6666 8989  X 2 = 682 T 1 =12T 2 =30T 3 =54 SS Total =170 n 1 =6n 2 =6n 3 =6 SS Within =28 M 1 =2M 2 =5M 3 =9 d.f. Within =15 SS 1 =10SS 2 =8SS 3 =10 d.f. Between =2 Determine d.f.s d.f. Within =N-k d.f. Between =k-1 d.f. Total =N-1 Note that (N-k)+(k-1)=N-1

68 Anthony J Greene68 One-Way ANOVA Fill in the values you have SourceSSdfMSF Between2 Within2815 Total17017

69 Anthony J Greene69 One-Way ANOVA Compute the missing values SourceSSdfMSF Between14227137.97 Within28151.87 Total17017

70 Anthony J Greene70 One-Way ANOVA 1.Compare your F of 37.97 with the critical value at 2,15 degrees of freedom,  = 0.01: 6.36 2.reject H 0 SourceSSdfMSF Between14227137.97 Within28151.87 Total17017


Download ppt "Anthony J Greene1 ANOVA: Analysis of Variance 1-way ANOVA."

Similar presentations


Ads by Google