 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.

Slides:

Advertisements

Similar presentations

Statistical Techniques I

Advertisements

BPS - 5th Ed. Chapter 241 One-Way Analysis of Variance: Comparing Several Means.

Hypothesis Testing Steps in Hypothesis Testing:

Chapter 12 ANALYSIS OF VARIANCE.

Analysis and Interpretation Inferential Statistics ANOVA

Ka-fu Wong © 2003 Chap Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

© 2010 Pearson Prentice Hall. All rights reserved Single Factor ANOVA.

1 1 Slide © 2009, Econ-2030 Applied Statistics-Dr Tadesse Chapter 10: Comparisons Involving Means n Introduction to Analysis of Variance n Analysis of.

Experimental Design & Analysis

Statistics Are Fun! Analysis of Variance

Lesson #23 Analysis of Variance. In Analysis of Variance (ANOVA), we have: H 0 :  1 =  2 =  3 = … =  k H 1 : at least one  i does not equal the others.

Ch 15 - Chi-square Nonparametric Methods: Chi-Square Applications

Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson10-1 Lesson 10: Analysis of Variance.

Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.

AM Recitation 2/10/11.

HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 14 Analysis.

Chapter 13 – 1 Chapter 12: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Errors Testing the difference between two.

Analysis of Variance or ANOVA. In ANOVA, we are interested in comparing the means of different populations (usually more than 2 populations). Since this.

12-1 Chapter Twelve McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.

1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.

1 Tests with two+ groups We have examined tests of means for a single group, and for a difference if we have a matched sample (as in husbands and wives)

1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.

Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.

Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.

12-1 Chapter Twelve McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.

Chapter 10 Analysis of Variance.

1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.

Chapter 19 Analysis of Variance (ANOVA). ANOVA How to test a null hypothesis that the means of more than two populations are equal. H 0 :  1 =  2 =

Chapter 12 Analysis of Variance. An Overview We know how to test a hypothesis about two population means, but what if we have more than two? Example:

Testing Differences in Population Variances

© Copyright McGraw-Hill 2000

Analysis of Variance (One Factor). ANOVA Analysis of Variance Tests whether differences exist among population means categorized by only one factor or.

Previous Lecture: Phylogenetics. Analysis of Variance This Lecture Judy Zhong Ph.D.

Chapter 9: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Type I and II Errors Testing the difference between two means.

Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.

Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.

Random samples of size n 1, n 2, …,n k are drawn from k populations with means  1,  2,…,  k and with common variance  2. Let x ij be the j-th measurement.

12-1 Chapter Twelve McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.

Inferences Concerning Variances

Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal

Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal

CHAPTER 12 ANALYSIS OF VARIANCE Prem Mann, Introductory Statistics, 7/E Copyright © 2010 John Wiley & Sons. All right reserved.

Statistics for Political Science Levin and Fox Chapter Seven

1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.

Significance Tests for Regression Analysis. A. Testing the Significance of Regression Models The first important significance test is for the regression.

Formula for Linear Regression y = bx + a Y variable plotted on vertical axis. X variable plotted on horizontal axis. Slope or the change in y for every.

CHAPTER 7: TESTING HYPOTHESES Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.

 What is Hypothesis Testing?  Testing for the population mean  One-tailed testing  Two-tailed testing  Tests Concerning Proportions  Types of Errors.

Analysis of Variance. The F Distribution Uses of the F Distribution – test whether two samples are from populations having equal variances – to compare.

Chapter 9 Introduction to the t Statistic

Analysis of Variance Chapter 12 McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.

Chapter 13 Analysis of Variance (ANOVA). ANOVA can be used to test for differences between three or more means. The hypotheses for an ANOVA are always:

Chapter 11 Created by Bethany Stubbe and Stephan Kogitz.

Analysis of Variance . Chapter 12.

Characteristics of F-Distribution

Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing

i) Two way ANOVA without replication

Basic Practice of Statistics - 5th Edition

Statistics Analysis of Variance.

Post Hoc Tests on One-Way ANOVA

Post Hoc Tests on One-Way ANOVA

CHAPTER 12 ANALYSIS OF VARIANCE

Econ 3790: Business and Economic Statistics

Analysis of Variance.

Chapter 10 – Part II Analysis of Variance

Statistical Inference for the Mean: t-test

Quantitative Methods ANOVA.

Week ANOVA Four.

Presentation transcript:

 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss the general idea of analysis of variance.  Organize data into an ANOVA table.  Conduct a test of hypothesis among three or more treatment means. Our Objectives

 The F Distribution.  Comparing Two Population Variances.  The ANOVA Test-One Way.  Inferences about Pairs of Treatment Means. Analysis of Variance

 A probability distribution used to test whether two samples are from populations having equal variances.  Also used when comparing several population means simultaneously using a technique called analysis of variance (ANOVA).  In both above situations, the populations must follow a normal distribution. The F Distribution

 There is a family of F distributions.  Each determined by 2 parameters: the degrees of freedom (df) in the numerator and the df in the denominator  Ex: df=(19,6)  Its shape changes as the df change Characteristics of the F Distribution

 The distribution is continuous.  It cannot be negative.  It is positively skewed, or skewed to the right  The long tail of the distribution is to the right-hand side  As the df in both numerator and denominator increases, the F- distribution tends to a Normal distribution  It is asymptotic.  Appendix B.4 calculates the Critical Values for a 0.05 and 0.01 level of significance (α). Characteristics of the F Distribution

α = 0.05 Critical Value = 3.87 Accept H 0 For an F(6,7), the area after 3.87 is 0.05.

 We assume that the ratio of the sample SDs follow an F distribution with n 1 -1 and n 2 -1 degrees of freedom. Comparing Two Population Variances. H 0 : σ 1 2 = σ 2 2 H 1 : σ 1 2 ≠ σ 2 2

Hypothesis Testing (Two-Tailed) H 0 : σ 1 2 = σ 2 2 H 1 : σ 1 2 ≠ σ 2 2 α/2 Critical Value Accept H 0  This is a Two-tailed test. So divide the significance level in half (α/2).  Select samples: n 1 observations from the first population and n 2 from the second.  Test statistic for comparing two variances is F o s 1 squared and s 2 squared are samples variances. o Use n 1 -1 and n 2 -1 degrees of freedom in Table in App B.4. o The larger sample variance is placed in numerator.

Hypothesis Testing (Two-Tailed) H 0 : σ 1 2 = σ 2 2 H 1 : σ 1 2 ≠ σ 2 2 α/2 Critical Value Accept H 0 If F is less than the Critical Value, then we fail to reject H 0  By putting the larger variance value in the numerator, we are forcing the F ratio to be at least 1. This allows us to always use the right tail of distribution.

Hypothesis Testing (One-Tailed) H 0 : σ 1 2 ≤ σ 2 2 H 1 : σ 1 2 > σ 2 2 α Critical Value Accept H 0 If F is less than the Critical Value, then we fail to reject H 0

Example 4, page 412. First Population: A random sample of five observations resulted in a standard deviation of 12. Second Population: A random sample of seven observations resulted in a standard deviation of 7. At the 0.01 significance level, is there more variation in the first population? H 0 : σ 1 2 ≤ σ 2 2 H 1 : σ 1 2 > σ 2 2

Example 4, page Critical Value=9.15 Accept H 0 F is less than the Critical Value, then we fail to reject H 0 H 0 : σ 1 2 ≤ σ 2 2 H 1 : σ 1 2 > σ 2 2 Calculating the Critical Value: o α=0.01 o one-tailed test o df numerator = 5-1 = 4 o df denominator = 7-1 = 6 o Using App B.4, CV=9.15

We also use F statistic in ANOVA to compare three or more population means to determine whether they could be equal.  Assumptions for using ANOVA:  Populations follow normal distribution.  Populations have equal SD.  Populations are independent.  Is there a difference in the means between populations (treatments)? ANOVA (Analysis of Variance)

Consider, ANOVA (Analysis of Variance) H 0 : μ 1 = μ 2 = μ 3 = … = μ n H 1 : At least two of the means are not equal

Example A manager of a regional financial center wishes to compare the productivity, as measured by the number of customers served, among three employees. Four days are randomly selected and the number of customers served by each employee is recorded. Is there a difference in the mean number of customers served? WolfWhiteKorosa

Comparing productivity of 3 employees (different population means)

If the population means are the same

The ANOVA Test  Objective: to determine whether the various sample means came from a single population or populations with different means.  How does ANOVA work?  we compare sample means through their variances.  Use the ANOVA assumption that the populations SD’s are equal.  Estimate the population variance two ways:  If ratio is equal to 1, we conclude that population means are the same. If quite different from 1, conclude that population means are different.  ANOVA tests were first developed for applications in agriculture.  This is reflected in the use of the term treatment to identify the different populations being studied.

 State the null and alternate hypothesis.  Given the significance level α and using the F distribution, find the Critical Value.  Identify the Accept and Reject Regions.  Compute the statistic F using an ANOVA table.  Check if the statistic F falls in Accept or Reject Region. Hypothesis testing using ANOVA H 0 : μ 1 = μ 2 = μ 3 = … = μ n H 1 : At least two of the means are not equal

F statistic: The first estimate of the population variance is based on the treatments, that is, the difference between the means. The second estimate of the population variance is the estimate within the treatments. The F statistic

 Let be the overall grand mean.  Let be the mean for treatment c.  Compute three measures of variance (SS total, SSE and SST)  SS total: The sum of the squared error between each sample and the overall mean.  SSE: The sum of the squared error between each sample and its treatment (group) mean. ANOVA (Computing F).

 SST: The sum of the square error of each treatment mean and the overall mean weighted by the number of observations in each treatment.  SS total = SST + SSE ANOVA (Computing F).

SST WolfWhiteKoros a

ANOVA (Computing F). SSE WolfWhiteKorosa (55-56) 2 (66-70) 2 (47-48) 2 (54-56) 2 (76-70) 2 (51-48) 2 (59-56) 2 (67-70) 2 (46-48) 2 (56-56) 2 (71-70) 2 (48-48) 2 Sum = 14Sum=62Sum=14

ANOVA (Computing F). SS total WolfWhiteKorosa (55-58) 2 (66-58) 2 (47-58) 2 (54-58) 2 (76-58) 2 (51-58) 2 (59-58) 2 (67-58) 2 (46-58) 2 (56-58) 2 (71-58) 2 (48-58) 2

ANOVA (Computing F). ANOVA Table Sum of Squares Degrees of freedom Mean SquareF SSTk-1SST/(k-1)=MSTMST/MSE SSEn-kSSE/(n-k)=MSE SS totaln-1 k = the number of treatments n = the overall sample size k-1 = degrees of freedom in the numerator n-k = degrees of freedom in the denominator

ANOVA (Computing F). ANOVA Table Sum of Squares Degrees of freedom Mean SquareF 992k-1=2992/2=496496/10= n-k=990/9= k =3 n = 12

Example 8, page 421. T1: T2: T3: Use a 0.05 significance level. H 0 : μ 1 = μ 2 = μ 3 H 1 : At least two of the means are not equal

Example 8, page 421. Using a 0.05 significance level. => Critical Value = 3.89 H 0 : μ 1 = μ 2 = μ 3 H 1 : At least two of the means are not equal k = the number of treatments = 3 n = the overall sample size = 15 k-1 = degrees of freedom in the numerator = 2 n-k = degrees of freedom in the denominator = Critical Value = 3.89 Accept H 0 F(2,12)

Example 8, page 421. SSE = (9-9.67) 2 + (7-9.67) 2 + ( ) 2 + (9-9.67) 2 + ( ) 2 + ( ) 2 + (13-15) 2 + (20-15) 2 + (14-15) 2 + (13-15) 2 + ( ) 2 + ( ) 2 + ( ) 2 + ( ) 2 + ( ) 2 => SSE = SST = 6( ) 2 + 4( ) 2 + 5( ) 2 = SS total = =

ANOVA Table Sum of Squares Degrees of freedom Mean SquareF SST=70.4k-1 =2SST/(k-1)=MST = 35.2 MST/MSE = 5.12 SSE=82.53n-k = 15-3 =12SSE/(n-k)=MSE = 6.88 SS total= n-1=14 Example 8, page > CV= 3.89, so we reject the null hypothesis. At least two of the treatments population means are not the same.

Inferences about Pairs of Treatment Means  When we reject the null hypothesis and conclude that all the treatment (or population) means are not equal, we may want to know which treatment means differ.  Students opinions ex.: if the students opinion do differ, the question is: Between which groups do the treatment means differ?  The simplest way to answer this question is through confidence intervals.

 To check if there is a difference between μ 1 and μ 2, we construct a Confidence interval using, is the mean of 1 st sample; is the mean of 2 nd sample; t is obtained from Appendix B.2 with n-k degrees of freedom; MSE is obtained from ANOVA table [SSE/(n-k)]; n 1 and n 2 are number of observations in 1 st and 2 nd sample. Inferences about Pairs of Treatment Means (Confidence interval for the difference in treatment means)

 We compute confidence interval limits.  If confidence interval includes zero, we conclude that there is no difference between the treatment means.  If CI does not include zero, we conclude that there is a difference between the treatment means. Inferences about Pairs of Treatment Means

Example 11, page 425. T1: T2: T3: Use a 0.05 significance level. H 0 : μ 1 = μ 2 = μ 3 H 1 : At least two of the means are not equal

Example 11, page 425. Using a 0.05 significance level. => Critical Value = 4.26 H 0 : μ 1 = μ 2 = μ 3 H 1 : At least two of the means are not equal k = the number of treatments = 3 n = the overall sample size = 12 k-1 = degrees of freedom in the numerator = 2 n-k = degrees of freedom in the denominator = Critical Value = 4.26 Accept H 0 F(2,9)

Example 8, page 421. SS total = =

ANOVA Table Sum of Squares Degrees of freedom Mean SquareF SST=107.2k-1 =2SST/(k-1)=MST = 53.6 MST/MSE = SSE=9.47n-k = 15-3 =9SSE/(n-k)=MSE = 1.05 SS total= n-1=14 Example 8, page > CV =4.26, so we reject the null hypothesis. At least two of the treatments population means are not the same.

Example 8, page 421. Can we conclude that treatment 1 and treatment 2 differ using a 95% level of confidence? Confidence Interval: (t has n-k = 9 df => t=2.262) CI does not include 0, so we can conclude that T1 and T2 have different means.