ANOVA: A Test of Analysis of Variance

Slides:



Advertisements
Similar presentations
Analysis of Variance Chapter 12 McGraw-Hill/Irwin
Advertisements

Chapter 25 Paired Samples and Blocks
Chi-square and F Distributions
Inference on Proportions. What are the steps for performing a confidence interval? 1.Assumptions 2.Calculations 3.Conclusion.
BPS - 5th Ed. Chapter 241 One-Way Analysis of Variance: Comparing Several Means.
CHAPTER 25: One-Way Analysis of Variance Comparing Several Means
CHAPTER 25: One-Way Analysis of Variance: Comparing Several Means ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner.
Confidence Interval and Hypothesis Testing for:
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
Testing means, part III The two-sample t-test. Sample Null hypothesis The population mean is equal to  o One-sample t-test Test statistic Null distribution.
Classical Regression III
PSY 307 – Statistics for the Behavioral Sciences
Statistics Are Fun! Analysis of Variance
Statistics 101 Class 9. Overview Last class Last class Our FAVORATE 3 distributions Our FAVORATE 3 distributions The one sample Z-test The one sample.
Testing the Difference Between Means (Small Independent Samples)
8-5 Testing a Claim About a Standard Deviation or Variance This section introduces methods for testing a claim made about a population standard deviation.
Hypothesis Testing :The Difference between two population mean :
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Chapter 12: Analysis of Variance
T Test for One Sample. Why use a t test? The sampling distribution of t represents the distribution that would be obtained if a value of t were calculated.
The Chi-Square Distribution 1. The student will be able to  Perform a Goodness of Fit hypothesis test  Perform a Test of Independence hypothesis test.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 14 Analysis.
Hypothesis Testing with Two Samples
Education 793 Class Notes T-tests 29 October 2003.
STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
Student’s t-distributions. Student’s t-Model: Family of distributions similar to the Normal model but changes based on degrees-of- freedom. Degrees-of-freedom.
Hypothesis Testing with One Sample Chapter 7. § 7.3 Hypothesis Testing for the Mean (Small Samples)
Copyright © 2010, 2007, 2004 Pearson Education, Inc Chapter 12 Analysis of Variance 12.2 One-Way ANOVA.
1 Section 9-4 Two Means: Matched Pairs In this section we deal with dependent samples. In other words, there is some relationship between the two samples.
Chapter 12 Analysis of Variance. An Overview We know how to test a hypothesis about two population means, but what if we have more than two? Example:
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Example 10.17:
Tests of Hypotheses Involving Two Populations Tests for the Differences of Means Comparison of two means: and The method of comparison depends on.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
While you wait: Enter the following in your calculator. Find the mean and sample variation of each group. Bluman, Chapter 121.
9.2 Testing the Difference Between Two Means: Using the t Test
10.5 Testing Claims about the Population Standard Deviation.
The Analysis of Variance. One-Way ANOVA  We use ANOVA when we want to look at statistical relationships (difference in means for example) between more.
Testing Differences between Means, continued Statistics for Political Science Levin and Fox Chapter Seven.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
11.1 Inference for the Mean of a Population.  Perform a one sample t-test for the mean of a population  Perform a matched pairs t-test.
- We have samples for each of two conditions. We provide an answer for “Are the two sample means significantly different from each other, or could both.
Inferences Concerning Variances
AP Statistics.  If our data comes from a simple random sample (SRS) and the sample size is sufficiently large, then we know that the sampling distribution.
Chapters 22, 24, 25.
T tests comparing two means t tests comparing two means.
While you wait: Enter the following in your calculator. Find the mean and sample variation of each group. Bluman, Chapter 121.
Chapter 10 Section 5 Chi-squared Test for a Variance or Standard Deviation.
AP Statistics Chapter 24 Notes “Comparing Two Sample Means”
The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Statistical Inferences for Population Variances
Chapter 9 Hypothesis Testing.
Unit 8 Section 7.5.
Testing the Difference Between Two Means
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
Lecture Slides Elementary Statistics Twelfth Edition
Math 4030 – 10a Tests for Population Mean(s)
Basic Practice of Statistics - 5th Edition
Chapter 8 Hypothesis Testing with Two Samples.
HYPOTHESIS TESTING FOR Variance and standard deviation
Elementary Statistics
Elementary Statistics: Picturing The World
Hypothesis Tests for a Standard Deviation
Comparing Two Populations
Hypothesis Testing and Confidence Intervals
Homework: pg. 693 #4,6 and pg. 698#11,12 4.) A. µ=the mean gas mileage for Larry’s car on the highway Ho: µ=26 mpg Ha: µ>26 mpg B.
Lecture Slides Elementary Statistics Twelfth Edition
Statistical Inference for the Mean: t-test
STA 291 Summer 2008 Lecture 21 Dustin Lueker.
Presentation transcript:

ANOVA: A Test of Analysis of Variance By Harry Lee and Manik Kuchroo

What is the ANOVA Test? Remember the 2-Mean T-Test? For example: A salesman in car sales wants to find the difference between two types of cars in terms of mileage: Mid-Size Vehicles Sports Utility Vehicles

Car Salesman’s Sample The salesman took an independent SRS from each population of vehicles: Level n Mean StDev Mid-size 28 27.101 mpg 2.629 mpg SUV 26 20.423 mpg 2.914 mpg If a 2-Mean TTest were done on this data: T = 8.15 P-value = ~0

Level n Mean StDev Midsize 28 27.101 mpg 2.629 mpg What if the salesman wanted to compare another type of car, Pickup Trucks in addition to the SUV’s and Mid-size vehicles? Level n Mean StDev Midsize 28 27.101 mpg 2.629 mpg SUV 26 20.423 mpg 2.914 mpg Pickup 8 23.125 mpg 2.588 mpg

In a 2-Mean TTest, we see if the This is an example of when we would use the ANOVA Test. In a 2-Mean TTest, we see if the difference between the 2 sample means is significant. The ANOVA is used to compare multiple means, and see if the difference between multiple sample means is significant.

Let’s Compare the Means… Yes, we see that no two of these confidence intervals overlap, therefore the means are significantly different. This is the question that the ANOVA test answers mathematically. Do these sample means look significantly different from each other?

More Confidence Intervals What if the confidence intervals were different? Would these confidence intervals be significantly different? Significant Not Significant

ANOVA Test Hypotheses H0: µ1 = µ2 = µ3 (All of the means are equal) HA: Not all of the means are equal For Our Example: H0: µMid-size = µSUV = µPickup The mean mileages of Mid-size vehicles, Sports Utility Vehicles, and Pickup trucks are all equal. HA: Not all of the mean mileages of Mid-size vehicles, Sports Utility Vehicles, and Pickup trucks are equal.

F Statistic Like any other test, the ANOVA test has its own test statistic The statistic for ANOVA is called the F statistic, which we get from the F Test The F statistic takes into consideration: number of samples taken (I) sample size of each sample (n1, n2, …, nI) means of the samples ( 1, 2, …, I) standard deviations of each sample (s1, s2, …, sI)

Explaining the F-Statistic The F statistic determines if the variation between sample means is significant This is what we are doing when we look at the 95% confidence intervals.

Another Look at the CI’s From this picture, we can see that the variation between sample means is greater than the variation in each sample; therefore, F is large.

F Statistic Equation Rewritten as a formula, the F Statistic looks like this: Means (Squared) Weighing Weighing Standard Deviations (Squared)

The F Statistic

Degrees of Freedom The ANOVA test has 2 degrees of freedom: N-I (Total number sampled – Number of Groups) I-1 (Number of Groups – 1) Some sample distributions with different degrees of freedom:

How About Our Example: Data: Level n Mean StDev Midsize 28 27.101 mpg 2.629 mpg SUV 26 20.423 mpg 2.914 mpg Pickup 8 23.125 mpg 2.588 mpg F value = 40.05 P-value = ~0 (Found from a table or using the Fcdf calculator command).

Conditions As useful as the ANOVA test is, we can only use it if a number of conditions are met: We must take an independent SRS from each population that we sample All populations have the same standard deviation. (No population’s standard deviation is double another’s) All of the populations must be normally distributed

Testing the Conditions The salesman had originally taken independent SRS’s. The second condition is fulfilled since no sample has more than twice the standard deviation of any other. To test the third condition, whether the populations being sampled are normally shaped, we must look at the histograms of each sample:

Therefore, all of the conditions are fulfilled. Sample Histograms All of the histograms appear to be relatively normally shaped. Therefore, all of the conditions are fulfilled.

Try a Problem Researchers are trying to see if the English AP scores from four different Massachusetts private schools are different. From each school, a random sample of students in the past year was taken and compared. Here are the results from the samples:

Results School n Mean StDev BB&N 23 4.3 0.4 Roxbury Latin 25 3.9 0.6 Winsor 26 4.2 0.3 Belmont Hill 29 3.1 0.3 Is there any significant difference between these schools’ AP English scores? (Assume that the populations are normally distributed)

Hypotheses H0: = µBB&N µRL = µWinsor = µBelHill The mean AP English Test scores in BB&N, Roxbury Latin, Winsor, and Belmont Hill are all the same. HA: The mean AP English Test scores in BB&N, Roxbury Latin, Winsor, and Belmont Hill are not all the same.

Conditions Random samples taken All of the standard deviations are the same No standard deviation is more than twice any other. All of the populations are normally distributed

Doing out the F Statistic

F Curve Plug the F statistic into the F distribution (df = 3, 99). The shaded area has a p-value of nearly 0.

Interpretation Since all the conditions were met, we have conclusive evidence (df = 3,99, p = 0) to reject the null hypothesis that the mean AP English Test scores in BB&N, Roxbury Latin, Winsor, and Belmont Hill are all the same.

Thanks For Watching A special thanks to Mr. Coons for all the help and advice.