Chapter 13 Analysis of Variance (ANOVA). ANOVA can be used to test for differences between three or more means. The hypotheses for an ANOVA are always:

Slides:



Advertisements
Similar presentations
Analysis of Variance (ANOVA) ANOVA can be used to test for the equality of three or more population means We want to use the sample results to test the.
Advertisements

1 Chapter 10 Comparisons Involving Means  1 =  2 ? ANOVA Estimation of the Difference between the Means of Two Populations: Independent Samples Hypothesis.
1 1 Slide Slides by JOHN LOUCKS St. Edward’s University.
Chapter 10 Comparisons Involving Means
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide © 2009, Econ-2030 Applied Statistics-Dr Tadesse Chapter 10: Comparisons Involving Means n Introduction to Analysis of Variance n Analysis of.
Analysis of Variance: Inferences about 2 or More Means
PSY 307 – Statistics for the Behavioral Sciences
Chapter 17 Analysis of Variance
1 Pertemuan 13 Analisis Ragam (Varians) - 2 Matakuliah: I0272 – Statistik Probabilitas Tahun: 2005 Versi: Revisi.
1 Pertemuan 10 Analisis Ragam (Varians) - 1 Matakuliah: I0262 – Statistik Probabilitas Tahun: 2007 Versi: Revisi.
ANOVA Single Factor Models Single Factor Models. ANOVA ANOVA (ANalysis Of VAriance) is a natural extension used to compare the means more than 2 populations.
AM Recitation 2/10/11.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide 統計學 Spring 2004 授課教師:統計系余清祥 日期: 2004 年 3 月 30 日 第八週:變異數分析與實驗設計.
1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 14 Analysis.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 13 Experimental Design and Analysis of Variance nIntroduction to Experimental Design.
1 1 Slide Analysis of Variance Chapter 13 BA 303.
12-1 Chapter Twelve McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Chapter 11 HYPOTHESIS TESTING USING THE ONE-WAY ANALYSIS OF VARIANCE.
Lesson 2. ANOVA Analysis of Variance One Way ANOVA One variable is measured for many different treatments (population)_ Null Hypothesis: all population.
Basic concept Measures of central tendency Measures of central tendency Measures of dispersion & variability.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
INTRODUCTION TO ANALYSIS OF VARIANCE (ANOVA). COURSE CONTENT WHAT IS ANOVA DIFFERENT TYPES OF ANOVA ANOVA THEORY WORKED EXAMPLE IN EXCEL –GENERATING THE.
Chapter 19 Analysis of Variance (ANOVA). ANOVA How to test a null hypothesis that the means of more than two populations are equal. H 0 :  1 =  2 =
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
Lecture 9-1 Analysis of Variance
Previous Lecture: Phylogenetics. Analysis of Variance This Lecture Judy Zhong Ph.D.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.
© 2006 by Thomson Learning, a division of Thomson Asia Pte Ltd.. 1 Slide Slide Slides Prepared by Juei-Chao Chen Fu Jen Catholic University Slides Prepared.
ANalysis Of VAriance can be used to test for the equality of three or more population means. H 0 :  1  =  2  =  3  = ... =  k H a : Not all population.
Formula for Linear Regression y = bx + a Y variable plotted on vertical axis. X variable plotted on horizontal axis. Slope or the change in y for every.
1/54 Statistics Analysis of Variance. 2/54 Statistics in practice Introduction to Analysis of Variance Analysis of Variance: Testing for the Equality.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
DSCI 346 Yamasaki Lecture 4 ANalysis Of Variance.
1 Pertemuan 19 Analisis Varians Klasifikasi Satu Arah Matakuliah: I Statistika Tahun: 2008 Versi: Revisi.
Rancangan Acak Lengkap ( Analisis Varians Klasifikasi Satu Arah) Pertemuan 16 Matakuliah: I0184 – Teori Statistika II Tahun: 2009.
Chapter 11 Analysis of Variance
Chapter 11 Created by Bethany Stubbe and Stephan Kogitz.
CHAPTER 3 Analysis of Variance (ANOVA) PART 1
Pertemuan 17 Analisis Varians Klasifikasi Satu Arah
CHAPTER 3 Analysis of Variance (ANOVA) PART 1
Chapter 10 Two-Sample Tests and One-Way ANOVA.
i) Two way ANOVA without replication
Applied Business Statistics, 7th ed. by Ken Black
CHAPTER 3 Analysis of Variance (ANOVA)
Statistics Analysis of Variance.
CHAPTER 12 ANALYSIS OF VARIANCE
Statistics for Business and Economics (13e)
Econ 3790: Business and Economic Statistics
Chapter 11 Analysis of Variance
Analysis of Variance Objective
Chapter 15 Analysis of Variance
Chapter 10 – Part II Analysis of Variance
Presentation transcript:

Chapter 13 Analysis of Variance (ANOVA)

ANOVA can be used to test for differences between three or more means. The hypotheses for an ANOVA are always: H 0 :  1 =  2 =... =  k (where k is the number of groups) H a : Not all of the population means are equal

Analysis of Variance: Assumptions For each population the response variable is normally distributed The variance of the response variable,  2, is the same for all of the populations The observations must be independent

Analysis of Variance (ANOVA) The ANOVA hypothesis test is based on a comparison of the variation between groups (treatments) and within groups (treatments).

Between vs. Within Variation FamilyCarsAttribute 1Attribute 2 11AX 21AY 31AZ 42BX 52BY 62BZ 73CX 83CY 93CZ

Between vs. Within Variation Group AGroup BGroup C Case 1: All variation is due to differences between groups

Between vs. Within Variation Group XGroup YGroup Z Case 2: All variation is due to differences within groups

Analysis of Variance (ANOVA) If the variation is primarily due to differences between groups then we would conclude the means are different and reject H 0. If the variation is primarily due to differences within the groups then we would conclude the means are the same and accept H 0.

Analysis of Variance (ANOVA) The relative sizes of between and within group variation are measured by comparing two estimates of the variance. ns x̄ 2 is used to estimate n  x̄ 2 and  2. If the means are equal ns x̄ 2 will be an unbiased estimator of  2. If the means are not equal ns x̄ 2 will overestimate  2.

Sampling Distribution of Given H 0 is True   Sample means are close together because they are drawn from the same sampling distribution when H 0 is true. ANOVA

Sampling Distribution of Given H 0 is False 33 33 11 11 22 22 Sample means come from different sampling distributions and are not as close together when H 0 is false. ANOVA

Analysis of Variance (ANOVA) The second way of estimating the population variance is to find the average of the variances of the different groups. This approach provides an unbiased estimate regardless whether or not the null hypothesis is true.

Analysis of Variance (ANOVA) If we take the ratio of the two approaches we have a measure that has an expected value of 1 if the null hypothesis is true. It will be larger than 1 if the null hypothesis is false.

Analysis of Variance (ANOVA) If the null hypothesis is true and the conditions for conducting the ANOVA test are met then the sampling distribution of the ratio is an F distribution with k-1 degrees of freedom in the numerator and n T – k degrees of freedom in the denominator.

F Distribution 

As before  is the probability of rejecting H 0 when it is true (the probability of making a Type I error). F  is the critical value such that an area equal to  lies in the upper-most tail. For example, with 5 degrees of freedom in the denominator and 10 degrees of freedom in the numerator, an F value of 4.74 would capture an area equal to 0.05 in the tail.

ANOVA Hypothesis Test The steps for conducting an ANOVA hypothesis test are the same as for conducting a hypothesis test of the mean: 1.State the hypotheses 2.State the rejection rule 3.Calculate the test statistic 4.State the result of the test and its implications

ANOVA Table Typically when we do the test we organize the calculations in table with a specific format: Source of variation Sum of Squares Degrees of freedom Mean Square F TreatmentsSSTRk-1MSTRF ErrorSSEn T -kMSE TotalSSTn T -1

Between-Treatments Estimate of Population Variance Denominator is the degrees of freedom associated with SSTR Numerator is called the sum of squares due to treatments (SSTR) The estimate of  2 based on the variation of the sample means is called the mean square due to treatments and is denoted by MSTR.

Between-Treatments Estimate of Population Variance Assume there are the same number of observations in each group so that n j = n.

The estimate of  2 based on the variation of the sample observations within each sample is called the mean square error and is denoted by MSE. Within-Treatments Estimate of Population Variance s 2 Denominator is the degrees of freedom associated with SSE Numerator is called the sum of squares due to error (SSE)

Assume there are the same number of observations in each group so that n j = n and n T = nk. Within-Treatments Estimate of Population Variance

ANOVA Table With the entire data set as one sample, the formula for computing the total sum of squares, SST, is: SST divided by its degrees of freedom n T -1 is the overall sample variance that would be obtained if we treated the entire set of observations as one data set.

Source of Variation Sum of Squares Degrees of Freedom Mean Square F Treatments Error Total k - 1 n T - 1 SSTR SSE SST n T - k SST is partitioned into SSTR and SSE. SST’s degrees of freedom (d.f.) are partitioned into SSTR’s d.f. and SSE’s d.f. ANOVA Table

ABC Assume we are interested in finding if the average number of cars owned is different for three different towns. Five people are interviewed in each town. Assume  =.05 ANOVA Example

Analysis of Variance (ANOVA) H 0 :  1 =  2 =  3 H a : Not all of the population means are equal Reject H 0 if: F > F  F > 3.89 Given k - 1 = = 2 df in the numerator and n T - k = = 12 df in the denominator

ABC Mean= ANOVA Example = ( )/3 = 9/3 = 3

Analysis of Variance (ANOVA) SSTR = 5(2-3) 2 + 5(2.2-3) 2 + 5(4.8-3) 2 = 24.4 SSE = (0-2) 2 + (1-2) 2 + (6-2) 2 + (2-2) 2 + (1-2) 2 + (2-2.2) 2 + (3-2.2) 2 + (4-2.2) 2 + (2-2.2) 2 + (0-2.2) 2 + (12-4.8) 2 + (0-4.8) 2 + (2-4.8) 2 + (6-4.8) 2 + (4-4.8) 2 =115.6

ANOVA Table Source of variation Sum of Squares Degrees of freedom Mean Square F Treatments Error Total14014

Anova: Single Factor SUMMARY GroupsCountSumAverageVariance Column Column Column ANOVA Source of VariationSSdfMSFP-valueF crit Between Groups Within Groups Total14014

ABC Assume we are interested in finding if the average number of bedrooms per home is different for three different towns. Data on four houses were collected in each town. Assume  =.05 Graded Homework P. 401, #7 (just do a hypothesis test)