Chapter 14: Analysis of Variance One-way ANOVA Lecture 8

Slides:



Advertisements
Similar presentations
Chapter 14 ANOVA 1.
Advertisements

Design of Experiments and Analysis of Variance
1 1 Slide © 2009, Econ-2030 Applied Statistics-Dr Tadesse Chapter 10: Comparisons Involving Means n Introduction to Analysis of Variance n Analysis of.
ANALYSIS OF VARIANCE.
Analysis of Variance Chapter Introduction Analysis of variance compares two or more populations of interval data. Specifically, we are interested.
Chapter 3 Analysis of Variance
Lecture 10 Inference about the difference between population proportions (Chapter 13.6) One-way analysis of variance (Chapter 15.2)
Lecture 12 One-way Analysis of Variance (Chapter 15.2)
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 14 Analysis.
QNT 531 Advanced Problems in Statistics and Research Methods
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
1 1 Slide Analysis of Variance Chapter 13 BA 303.
Analysis of Variance Chapter 12 Introduction Analysis of variance compares two or more populations of interval data. Specifically, we are interested.
Analysis of Variance ( ANOVA )
Analysis of Variance ST 511 Introduction n Analysis of variance compares two or more populations of quantitative data. n Specifically, we are interested.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
© Copyright McGraw-Hill CHAPTER 12 Analysis of Variance (ANOVA)
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
Testing Hypotheses about Differences among Several Means.
INTRODUCTION TO ANALYSIS OF VARIANCE (ANOVA). COURSE CONTENT WHAT IS ANOVA DIFFERENT TYPES OF ANOVA ANOVA THEORY WORKED EXAMPLE IN EXCEL –GENERATING THE.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
Marketing Research Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Chapter 13: Inferences about Comparing Two Populations Lecture 8b Date: 15 th November 2015 Instructor: Naveen Abedin.
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
Chapter 14: Analysis of Variance One-way ANOVA Lecture 9a Instructor: Naveen Abedin Date: 24 th November 2015.
1/54 Statistics Analysis of Variance. 2/54 Statistics in practice Introduction to Analysis of Variance Analysis of Variance: Testing for the Equality.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Chapter 13 Analysis of Variance (ANOVA). ANOVA can be used to test for differences between three or more means. The hypotheses for an ANOVA are always:
Chapter 11 Analysis of Variance
CHAPTER 3 Analysis of Variance (ANOVA) PART 1
Analysis of Variance . Chapter 12.
Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc.
Keller: Stats for Mgmt & Econ, 7th Ed Analysis of Variance
Chapter 10 Two-Sample Tests and One-Way ANOVA.
Chapter 4. Inference about Process Quality
i) Two way ANOVA without replication
Applied Business Statistics, 7th ed. by Ken Black
Statistics Analysis of Variance.
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Post Hoc Tests on One-Way ANOVA
Post Hoc Tests on One-Way ANOVA
Statistics for Business and Economics (13e)
Chapter 10 Two-Sample Tests and One-Way ANOVA.
Econ 3790: Business and Economic Statistics
Comparing Three or More Means
Chapter 11 Analysis of Variance
Analysis of Variance (ANOVA)
Chapter 11: Introduction to Hypothesis Testing Lecture 5b
Chapter 11: Inference for Distributions of Categorical Data
Ch10 Analysis of Variance.
One-Way Analysis of Variance
Chapter 13 Group Differences
Chapter 13: Inferences about Comparing Two Populations Lecture 7a
Chapter 11: Introduction to Hypothesis Testing Lecture 5c
Chapter 13: Inferences about Comparing Two Populations
Facts from figures Having obtained the results of an investigation, a scientist is faced with the prospect of trying to interpret them. In some cases the.
Chapter 12: Introduction to Analysis of Variance
Chapter 26 Comparing Counts Copyright © 2009 Pearson Education, Inc.
Chapter 10 Introduction to the Analysis of Variance
Chapter 26 Comparing Counts.
Chapter 15 Analysis of Variance
Chapter 10 – Part II Analysis of Variance
STATISTICS INFORMED DECISIONS USING DATA
Presentation transcript:

Chapter 14: Analysis of Variance One-way ANOVA Lecture 8 Instructor: Naveen Abedin

Introduction So far, we have seen how the t-test for “difference in two population means” can be conducted to make inferences about comparing two populations. If we are interested in making statistical inferences about two or more populations, we can use a technique called the Analysis of Variance (ANOVA)

Introduction (cont.) The Analysis of Variance technique determines whether differences exist between population means. The procedure however works by analyzing the sample variances. The procedure was first applied in an experiment to determine whether different treatments of fertilizers produced different crop yields. So this procedure is designed primarily to determine whether there are significant differences between treatment means. Therefore, ANOVA analyzes the variance of sample data to determine whether we can infer that population means differ. When ANOVA is applied to Independent Samples, it is called One-Way Analysis of Variance.

Set up Consider k populations from which samples have been independently collected: Let us call the collection of this population j (j = 1, 2, 3…..k). This makes the collection of population parameters and . For each sample we can then calculate and

Example 1 The following example has been taken from pg 520 of the textbook: Do ownership of stocks vary by age? A financial analyst wanting to test this theory gathered a sample of 366 American households, and recorded the age of the head of the household, and asked them what proportion of its financial assets are invested in stock markets. The analyst then split the record according to four age categories. These age categories are referred to as different treatments.

Example 1: STEP 1 Step 1: Set up the test design Here we are dealing with four populations, so we have four parameters: The null hypothesis represents the scenario where there are no differences between population means: The ANOVA determines whether there is enough statistical evidence to show that the null hypothesis is false, so the alternative hypothesis will always be:

Example 1: STEP 2 Step 2: Calculate the test statistic Suppose that your study of interest has k different treatment populations. You have extracted a sample from each of these k populations, and each sample observation if xij. Total number of observations in each sample is ni. Grand mean is the addition of all observations from all samples divided by the total number of observations from all samples

Example 1: STEP 2 (cont.) For this study, the different components are: Response variable: percentage of assets invested in stocks (i.e. what we are observing) Experimental units: heads of households (the subject interviewed in our samples) Factor: age. There are 4 factor levels (the variable used to disaggregate our data)

Example 1: STEP 2 (Part 1) If the null hypothesis is true, the population means would all be equal, and so we would expect the sample means to be close to one another. If the alternative hypothesis is true, we would expect to observe large differences between sample means. The statistic that measures the proximity (closeness) of the sample means to each other is called the between-treatments variation. It is calculated using SST – Sum of Squares for Treatments. Sum of squares for treatments measures the variation that exists between the treatments, i.e. the variations between the different age categories.

Example 1: STEP 2 (Part 1 cont.) If the sample means are close to each other, then all of the sample means would be close to the grand mean, and as a result SST would be small. This goes by the intuition that if the sample means were all equal to one another, then SST = 0. A small value of SST would imply that the null hypothesis is probably true. the grand mean is not simply the average of all the sample means. it is adding up all the observations from all the samples and dividing by the total number of observations collected from all the samples.

Example 1: STEP 2 (Part 1 cont.) If large differences exist between the sample means, then at least some sample means differ considerably from the grand mean producing a large SST. And if SST is significantly large, then we can reject the null hypothesis in favor of alternative hypothesis. Thus the question is, what deems SST to be a significantly large figure.

Example 1: STEP 2 (Part 2) So the question we are now facing is how large does SST have to be for us to justify rejecting the null hypothesis? To answer this, we need to know how much variation exists within each sample, i.e. how much variation exists in the percentage of assets owned within each sample. This is measured by the within-treatments variation, denoted by SSE – Sum of Squares for Error. SSE provides a measure of the amount of variation in the response variable that is not caused by the treatments, e.g. other factors besides the age of household head that may influence the amount of stocks purchased, such as income, occupation, family size etc. All of these other sources of variation are lumped together under a single category called error. This source of variation is measured by the Sum of Squares Error. *remember response variable in our study is the amount of stocks owned

Example 1: STEP 2 (Part 2 cont.)

Example 1: STEP 2 (Part 2 cont.)

Example 1: STEP 2 (Part 3) The next step is to calculate the mean squares: The Mean Square for Treatments is computed by dividing SST by the number of treatments (k) minus 1. The Mean Square for Error is determined by dividing SSE by the total sample size (n) minus the number of treatments (k)

Example 1: STEP 2 (Part 4) Finally, the test statistic can now be calculated: The test statistic follows the F-distribution with v1 = k – 1 and v2 = n – k degrees of freedom, where k is the number of treatments and n is the number of observation in total in the entire study.

Example 1: STEP 3 and 4 Step 3: Calculate the Rejection Region The F-statistic is calculated to determine whether the value of SST is large enough to reject the null hypothesis. SST and F statistic are positively related, so a large enough test statistic will ensure if there is sufficient evidence present to reject the null hypothesis The null hypothesis is rejected if the test statistic, F is higher than Fα, k – 1, n – k . Step 4: The Decision: We have sufficient evidence to reject the null hypothesis as F = 2.79

ANOVA table The results of the analysis of variance are usually reported in an analysis of variance (ANOVA) table.

ANOVA table (cont.) For Example 1, the ANOVA table is: SS (Total) = SST + SSE SS (Total) / n – 1 = sample variance

Summary The analysis of variance tests to determine whether there is evidence of differences between two or more population means. (The t-test done in Chapter 13 can only be used to compare two population means only). SST: This measures the variation attributed to the differences between the treatment means (variation of each sample mean to the grand mean). If this is statistically significantly high, then we can reject the null hypothesis. SSE: This measures the variation within samples (differences attributed to other causes besides the factor). It represents the variations that is unexplained by the different treatments.