ANOVA Single Factor Models Single Factor Models. ANOVA ANOVA (ANalysis Of VAriance) is a natural extension used to compare the means more than 2 populations.

Slides:



Advertisements
Similar presentations
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 10 The Analysis of Variance.
Advertisements

ANOVA Two Factor Models Two Factor Models. 2 Factor Experiments Two factors can either independently or together interact to affect the average response.
Analysis of Variance (ANOVA) ANOVA can be used to test for the equality of three or more population means We want to use the sample results to test the.
Design of Experiments and Analysis of Variance
1 Chapter 10 Comparisons Involving Means  1 =  2 ? ANOVA Estimation of the Difference between the Means of Two Populations: Independent Samples Hypothesis.
Chapter 10 Comparisons Involving Means
© 2010 Pearson Prentice Hall. All rights reserved Single Factor ANOVA.
1 1 Slide © 2009, Econ-2030 Applied Statistics-Dr Tadesse Chapter 10: Comparisons Involving Means n Introduction to Analysis of Variance n Analysis of.
Independent Sample T-test Formula
ANOVA Determining Which Means Differ in Single Factor Models Determining Which Means Differ in Single Factor Models.
Lesson #23 Analysis of Variance. In Analysis of Variance (ANOVA), we have: H 0 :  1 =  2 =  3 = … =  k H 1 : at least one  i does not equal the others.
PSY 307 – Statistics for the Behavioral Sciences
Basic concept of statistics Measures of central Measures of central tendency Measures of dispersion & variability.
1 Pertemuan 13 Analisis Ragam (Varians) - 2 Matakuliah: I0272 – Statistik Probabilitas Tahun: 2005 Versi: Revisi.
1 Pertemuan 19 Analisis Ragam (ANOVA)-1 Matakuliah: A0064 / Statistik Ekonomi Tahun: 2005 Versi: 1/1.
ANOVA Randomized Block Models Randomized Block Models 2-Factor Without Replication.
Chapter 12: Analysis of Variance
F-Test ( ANOVA ) & Two-Way ANOVA
CHAPTER 3 Analysis of Variance (ANOVA) PART 1
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 14 Analysis.
1 One Way Analysis of Variance – Designed experiments usually involve comparisons among more than two means. – The use of Z or t tests with more than two.
Analysis of Variance or ANOVA. In ANOVA, we are interested in comparing the means of different populations (usually more than 2 populations). Since this.
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 Tests with two+ groups We have examined tests of means for a single group, and for a difference if we have a matched sample (as in husbands and wives)
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 13 Experimental Design and Analysis of Variance nIntroduction to Experimental Design.
1 1 Slide Analysis of Variance Chapter 13 BA 303.
Chapter 11 HYPOTHESIS TESTING USING THE ONE-WAY ANALYSIS OF VARIANCE.
 The idea of ANOVA  Comparing several means  The problem of multiple comparisons  The ANOVA F test 1.
PSY 307 – Statistics for the Behavioral Sciences Chapter 16 – One-Factor Analysis of Variance (ANOVA)
One-Factor Analysis of Variance A method to compare two or more (normal) population means.
Basic concept Measures of central tendency Measures of central tendency Measures of dispersion & variability.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
INTRODUCTION TO ANALYSIS OF VARIANCE (ANOVA). COURSE CONTENT WHAT IS ANOVA DIFFERENT TYPES OF ANOVA ANOVA THEORY WORKED EXAMPLE IN EXCEL –GENERATING THE.
Chapter 19 Analysis of Variance (ANOVA). ANOVA How to test a null hypothesis that the means of more than two populations are equal. H 0 :  1 =  2 =
Comparing Three or More Means ANOVA (One-Way Analysis of Variance)
Lecture 9-1 Analysis of Variance
Analysis of Variance (One Factor). ANOVA Analysis of Variance Tests whether differences exist among population means categorized by only one factor or.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Chapter 12 Introduction to Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Eighth Edition by Frederick.
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.
IE241: Introduction to Design of Experiments. Last term we talked about testing the difference between two independent means. For means from a normal.
ANalysis Of VAriance can be used to test for the equality of three or more population means. H 0 :  1  =  2  =  3  = ... =  k H a : Not all population.
1/54 Statistics Analysis of Variance. 2/54 Statistics in practice Introduction to Analysis of Variance Analysis of Variance: Testing for the Equality.
1 Lecture 15 One Way Analysis of Variance  Designed experiments usually involve comparisons among more than two means.  The use of Z or t tests with.
CHAPTER 3 Analysis of Variance (ANOVA) PART 2 =TWO- WAY ANOVA WITHOUT REPLICATION.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
Chapter 12 Introduction to Analysis of Variance
DSCI 346 Yamasaki Lecture 4 ANalysis Of Variance.
1 Pertemuan 19 Analisis Varians Klasifikasi Satu Arah Matakuliah: I Statistika Tahun: 2008 Versi: Revisi.
The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.
Chapter 13 Analysis of Variance (ANOVA). ANOVA can be used to test for differences between three or more means. The hypotheses for an ANOVA are always:
Lecture notes 13: ANOVA (a.k.a. Analysis of Variance)
CHAPTER 3 Analysis of Variance (ANOVA) PART 1
Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc.
CHAPTER 3 Analysis of Variance (ANOVA) PART 1
i) Two way ANOVA without replication
CHAPTER 3 Analysis of Variance (ANOVA)
Statistics Analysis of Variance.
MKT 317 February 12, 2010.
Statistics for Business and Economics (13e)
Econ 3790: Business and Economic Statistics
Analysis of Variance Objective
Chapter 10 – Part II Analysis of Variance
Quantitative Methods ANOVA.
Presentation transcript:

ANOVA Single Factor Models Single Factor Models

ANOVA ANOVA (ANalysis Of VAriance) is a natural extension used to compare the means more than 2 populations. Basic Question: Even if the true means of n populations were equal (i.e.          we cannot expect the sample means (  x 1,  x 2,  x 3,  x 4 ) to be equal. So when we get different values for the  x’s, –How much is due to randomness? –How much is due to the fact that we are sampling from different populations with possibly different  j ’s.

ANOVA TERMINOLOGY Response Variable (y) –What we are measuring Experimental Units –The individual unit that we will measure Factors –Independent variables whose values can change to affect the outcome of the response variable, y Levels of Factors –Values of the factors Treatments –The combination of the levels of the factors applied to an experimental unit

Example We want to know how combinations of different amounts of water (1 ac-ft, 3 ac-ft, 5 ac-ft) and different fertilizers (A, B, C) affect crop yields Response variable crop yield (bushels/acre) – crop yield (bushels/acre) Experimental unit –Each acre that receives a treatment (2)Factors (2) –Water and fertilizer (3 for Water; 3 for Fertilizer)Levels (3 for Water; 3 for Fertilizer) –Water: 1, 3, 5; Fertilizer: A, B, C (9 = 3x3)Treatments (9 = 3x3) –1A, 3A, 5A, 1B, 3B, 5B, 1C, 3C, 5C

Single Factor ANOVA Basic Assumptions If we focus on only one factor (e.g. fertilizer type in the previous example), this is called single factor ANOVA. –In this case, levels and treatments are the same thing since there are no combinations between factors. Assumptions for Single Factor ANOVA 1.The distribution of each population in the comparison has a normal distribution 2.The standard deviations of each population (although unknown) are assumed to be equal (i.e.          3.Sampling is: Random Independent

Example The university would like to know if the delivery mode of the introductory statistics class affects the performance in the class as measured by the scores on the final exam. The class is given in four different formats: –Lecture –Text Reading –Videotape –Internet The final exam scores from random samples of students from each of the four teaching formats was recorded.

Samples

Summary There is a single factor under observation – teaching format There are k = 4 different treatments (or levels of teaching formats) The number of observations (experimental units) are n 1 = 7, n 2 = 8, n 3 = 6, n 4 = 5 total number of observations, n = 26

Why aren’t all the  x’s the same? Between Treatment Variability (Treatment)There is variability due to the different treatments -- Between Treatment Variability (Treatment) Within Treatment Variability (Error)There is variability due to randomness within each treatment -- Within Treatment Variability (Error) Between Treatment Variability If the average Between Treatment Variability is “large” Within Treatment Variability compared to the average Within Treatment Variability, we can reasonably conclude that there really are differences among the population means (i.e. at least one μ j differs from the others). BASIC CONCEPT

Basic Questions Given this basic concept, the natural questions are: –What is “variability” due to treatment and due to error and how are they measured? –What is “average variability” due to treatment and due to error and how are they measured? –What is “large”? How much larger than the observed average variability due to error does the observed average variability due to treatment have to be before we are convinced that there are differences in the true population means (the µ’s)?

How Is “Total” Variability Measured? Sum of Square Deviations Variability is defined as the Sum of Square Deviations (from the grand mean). So, SSTSST (Total Sum of Squares) – Sum of Squared Deviations of all observations from the grand mean. SSTrSSTr (Between Treatment Sum of Squares) –Sum of Square Deviations Due to Different Treatments SSESSE (Within Treatment Sum of Squares) –Sum of Square Deviations Due to Error SST = SSTr + SSE

How is “Average” Variability Measured? “Average” Variability is measured in: Mean Square Values Mean Square Values (MSTr and MSE) –Found by dividing SSTr and SSE by their respective degrees of freedom VariabilitySSDFMean Square (MS) Variability SS DF Mean Square (MS) Between Tr. (Treatment) SSTr k-1 SSTr/DF TR Within Tr. (Error) SSE n-k SSE/DF E TOTAL SST n-1 ANOVA TABLE # observations -1 # treatments -1DFT - DFTR

Formula for Calculating SST Calculating SST Just like the numerator of the variance assuming all (26) entries come from one population

Formula for Calculating SSTr Calculating SSTr Between Treatment Variability Replace all entries within each treatment by its mean – now all the variability is between (not within) treatments

Formula for Calculating SSE Calculating SSE (Within Treatment Variability) The difference between the SST and SSTr ---

Can we Conclude a Difference Among the 4 Teaching Formats? We conclude that at least one population mean differs from the others if the average between treatment variability is large compared to the average within treatment variability, that is if MSTr/MSE is “large”. F distributionF-statistic (=MSTr/MSE)The ratio of the two measures of variability for these normally distributed random variables has an F distribution and the F-statistic (=MSTr/MSE) is compared to a critical F-value from an F distribution with: –Numerator degrees of freedom = DFTr –Denominator degrees of freedom = DFE at least one population mean differs from the othersIf the ratio of MSTr to MSE (the F-statistic) exceeds the critical F-value, we can conclude that at least one population mean differs from the others.

Can We Conclude Different Teaching Formats Affect Final Exam Scores? The F-test H 0 :         H A : At least one  j differs from the others Select α =.05. Reject H 0 (Accept H A ) if:

Hand Calculations for the F- test Cannot conclude there is a difference among the μ j ’s

Excel Approach

EXCEL OUTPUT p-value = >.05 Cannot conclude differences

REVIEW ANOVA Situation and Terminology –Response variable, Experimental Units, Factors, Levels, Treatments, Error Basic Concept –If the “average variability” between treatments is “a lot” greater than the “average variability” due to error – conclude that at least one mean differs from the others. Single Factor Analysis –By Hand –By Excel