Introduction to Analysis of Variance (ANOVA)

Slides:



Advertisements
Similar presentations
Multiple-choice question
Advertisements

© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 12: Analysis of Variance: Differences among Means of Three or More Groups.
Smith/Davis (c) 2005 Prentice Hall Chapter Thirteen Inferential Tests of Significance II: Analyzing and Interpreting Experiments with More than Two Groups.
Using Statistics in Research Psych 231: Research Methods in Psychology.
Independent Sample T-test Formula
Lecture 10 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
ANOVA Analysis of Variance: Why do these Sample Means differ as much as they do (Variance)? Standard Error of the Mean (“variance” of means) depends upon.
Experimental Design & Analysis
Analysis of Variance: Inferences about 2 or More Means
Chapter 3 Analysis of Variance
PSY 307 – Statistics for the Behavioral Sciences
Intro to Statistics for the Behavioral Sciences PSYC 1900
Lecture 9: One Way ANOVA Between Subjects
One-way Between Groups Analysis of Variance
Anthony J Greene1 ANOVA: Analysis of Variance 1-way ANOVA.
Statistics for the Social Sciences
Comparing Several Means: One-way ANOVA Lesson 14.
Statistical Methods in Computer Science Hypothesis Testing II: Single-Factor Experiments Ido Dagan.
The t-test Inferences about Population Means when population SD is unknown.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
Repeated ANOVA. Outline When to use a repeated ANOVA How variability is partitioned Interpretation of the F-ratio How to compute & interpret one-way ANOVA.
ANOVA Chapter 12.
The basic idea So far, we have been comparing two samples
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
ANOVA Greg C Elvers.
1 Tests with two+ groups We have examined tests of means for a single group, and for a difference if we have a matched sample (as in husbands and wives)
Chapter 13: Introduction to Analysis of Variance
Chapter 11 HYPOTHESIS TESTING USING THE ONE-WAY ANALYSIS OF VARIANCE.
© Copyright McGraw-Hill CHAPTER 12 Analysis of Variance (ANOVA)
PSY 307 – Statistics for the Behavioral Sciences Chapter 16 – One-Factor Analysis of Variance (ANOVA)
One-way Analysis of Variance 1-Factor ANOVA. Previously… We learned how to determine the probability that one sample belongs to a certain population.
Between-Groups ANOVA Chapter 12. >When to use an F distribution Working with more than two samples >ANOVA Used with two or more nominal independent variables.
Testing Hypotheses about Differences among Several Means.
Chapter 14 – 1 Chapter 14: Analysis of Variance Understanding Analysis of Variance The Structure of Hypothesis Testing with ANOVA Decomposition of SST.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
I. Statistical Tests: A Repetive Review A.Why do we use them? Namely: we need to make inferences from incomplete information or uncertainty þBut we want.
Statistics for the Social Sciences Psychology 340 Fall 2013 Tuesday, October 15, 2013 Analysis of Variance (ANOVA)
INTRODUCTION TO ANALYSIS OF VARIANCE (ANOVA). COURSE CONTENT WHAT IS ANOVA DIFFERENT TYPES OF ANOVA ANOVA THEORY WORKED EXAMPLE IN EXCEL –GENERATING THE.
One-Way Analysis of Variance
Comparing Three or More Means ANOVA (One-Way Analysis of Variance)
Statistics for the Social Sciences Psychology 340 Fall 2012 Analysis of Variance (ANOVA)
One-Way ANOVA ANOVA = Analysis of Variance This is a technique used to analyze the results of an experiment when you have more than two groups.
General Linear Model 2 Intro to ANOVA.
Chapter 14 – 1 Chapter 14: Analysis of Variance Understanding Analysis of Variance The Structure of Hypothesis Testing with ANOVA Decomposition of SST.
Chapter 13 - ANOVA. ANOVA Be able to explain in general terms and using an example what a one-way ANOVA is (370). Know the purpose of the one-way ANOVA.
Analysis of Variance (One Factor). ANOVA Analysis of Variance Tests whether differences exist among population means categorized by only one factor or.
Previous Lecture: Phylogenetics. Analysis of Variance This Lecture Judy Zhong Ph.D.
1 ANALYSIS OF VARIANCE (ANOVA) Heibatollah Baghi, and Mastee Badii.
Three Broad Purposes of Quantitative Research 1. Description 2. Theory Testing 3. Theory Generation.
Chapter 12 Introduction to Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Eighth Edition by Frederick.
Psy 230 Jeopardy Related Samples t-test ANOVA shorthand ANOVA concepts Post hoc testsSurprise $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.
Research Methods and Data Analysis in Psychology Spring 2015 Kyle Stephenson.
Introduction to ANOVA Research Designs for ANOVAs Type I Error and Multiple Hypothesis Tests The Logic of ANOVA ANOVA vocabulary, notation, and formulas.
Chapter 9 Introduction to the Analysis of Variance Part 1: Oct. 22, 2013.
Analysis of Variance 11/6. Comparing Several Groups Do the group means differ? Naive approach – Independent-samples t-tests of all pairs – Each test doesn't.
O A post-hoc test is needed after we complete an ANOVA in order to determine which groups differ from each other. o Do not conduct a post-hoc test unless.
Formula for Linear Regression y = bx + a Y variable plotted on vertical axis. X variable plotted on horizontal axis. Slope or the change in y for every.
ANOVA: Why analyzing variance to compare means?.
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 10 Introduction to the Analysis.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 4 Investigating the Difference in Scores.
Between-Groups ANOVA Chapter 12. Quick Test Reminder >One person = Z score >One sample with population standard deviation = Z test >One sample no population.
CHAPTER 10: ANALYSIS OF VARIANCE(ANOVA) Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Inferential Statistics Psych 231: Research Methods in Psychology.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
Chapter 12 Introduction to Analysis of Variance
Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc.
Inferential Statistics
Introduction to ANOVA.
Psych 231: Research Methods in Psychology
Presentation transcript:

Introduction to Analysis of Variance (ANOVA) One-Way ANOVA Introduction to Analysis of Variance (ANOVA)

What is ANOVA? ANOVA is short for ANalysis Of VAriance Used with 3 or more groups to test for MEAN DIFFS. E.g., caffeine study with 3 groups: No caffeine Mild dose Jolt group Level is value, kind or amount of IV Treatment Group is people who get specific treatment or level of IV Treatment Effect is size of difference in means

Rationale for ANOVA (1) We have at least 3 means to test, e.g., H0: 1 = 2 = 3. Could take them 2 at a time, but really want to test all 3 (or more) at once. Instead of using a mean difference, we can use the variance of the group means about the grand mean over all groups. Logic is just the same as for the t-test. Compare the observed variance among means (observed difference in means in the t-test) to what we would expect to get by chance.

Rationale for ANOVA (2) Suppose we drew 3 samples from the same population. Our results might look like this: Note that the means from the 3 groups are not exactly the same, but they are close, so the variance among means will be small.

Rationale for ANOVA (3) Suppose we sample people from 3 different populations. Our results might look like this: Note that the sample means are far away from one another, so the variance among means will be large.

Rationale for ANOVA (4) Suppose we complete a study and find the following results (either graph). How would we know or decide whether there is a real effect or not? To decide, we can compare our observed variance in means to what we would expect to get on the basis of chance given no true difference in means.

Review When would we use a t-test versus 1-way ANOVA? In ANOVA, what happens to the variance in means (between cells) if the treatment effect is large?

Rationale for ANOVA We can break the total variance in a study into meaningful pieces that correspond to treatment effects and error. That’s why we call this Analysis of Variance. Definitions of Terms Used in ANOVA: The Grand Mean, taken over all observations. The mean of any level of a treatment. The mean of a specific level (1 in this case) of a treatment. The observation or raw data for the ith person.

A treatment or IV effect The ANOVA Model A treatment effect is the difference between the overall, grand mean, and the mean of a cell (treatment level). Error is the difference between a score and a cell (treatment level) mean. The ANOVA Model: An individual’s score A treatment or IV effect The grand mean is + Error +

A treatment or IV effect The ANOVA Model The grand mean A treatment or IV effect Error The graph shows the terms in the equation. There are three cells or levels in this study. The IV effect and error for the highest scoring cell is shown.

ANOVA Calculations Sums of squares (squared deviations from the mean) tell the story of variance. The simple ANOVA designs have 3 sums of squares. The total sum of squares comes from the distance of all the scores from the grand mean. This is the total; it’s all you have. The within-group or within-cell sum of squares comes from the distance of the observations to the cell means. This indicates error. The between-cells or between-groups sum of squares tells of the distance of the cell means from the grand mean. This indicates IV effects.

Computational Example: Caffeine on Test Scores G1: Control G2: Mild G3: Jolt Test Scores 75=79-4 80=84-4 70=74-4 77=79-2 82=84-2 72=74-2 79=79+0 84=84+0 74=74+0 81=79+2 86=84+2 76=74+2 83=79+4 88=84+4 78=74+4 Means 79 84 74 SDs (N-1) 3.16

Total Sum of Squares G1 75 79 16 Control 77 4 M=79 SD=3.16 81 83 G2 80 SD=3.16 81 83 G2 80 1 M=84 82 9 84 25 86 49 88 G3 70 M=74 72 74 76 78 Sum 370 Total Sum of Squares

In the total sum of squares, we are finding the squared distance from the Grand Mean. If we took the average, we would have a variance.

Within Sum of Squares G1 75 79 16 Control 77 4 M=79 SD=3.16 81 83 G2 SD=3.16 81 83 G2 80 84 M=84 82 86 88 G3 70 74 M=74 72 76 78 Sum 120 Within Sum of Squares

Within sum of squares refers to the variance within cells Within sum of squares refers to the variance within cells. That is, the difference between scores and their cell means. SSW estimates error.

Between Sum of Squares G1 79 Control M=79 SD=3.16 G2 84 25 M=84 G3 74 Control M=79 SD=3.16 G2 84 25 M=84 G3 74 M=74 Sum 250 Between Sum of Squares

The between sum of squares relates the Cell Means to the Grand Mean The between sum of squares relates the Cell Means to the Grand Mean. This is related to the variance of the means.

ANOVA Source Table (1) Source SS df MS F Between Groups 250 k-1=2 250/2= 125 =MSB F = MSB/MSW = 125/10 =12.5 Within Groups 120 N-k= 15-3=12 120/12 = 10 = MSW Total 370 N-1=14

ANOVA Source Table (2) df – Degrees of freedom. Divide the sum of squares by degrees of freedom to get MS, Mean Squares, which are population variance estimates. F is the ratio of two mean squares. F is another distribution like z and t. There are tables of F used for significance testing.

The F Distribution

F Table – Critical Values Numerator df: dfB dfW 1 2 3 4 5 5 5% 1% 6.61 16.3 5.79 13.3 5.41 12.1 5.19 11.4 5.05 11.0 10 5% 4.96 10.0 4.10 7.56 3.71 6.55 3.48 5.99 3.33 5.64 12 5% 4.75 9.33 3.89 6.94 3.49 5.95 3.26 3.11 5.06 14 5% 4.60 8.86 3.74 6.51 3.34 5.56 5.04 2.96 4.70

Review What are critical values of a statistics (e.g., critical values of F)? What are degrees of freedom? What are mean squares? What does MSW tell us?

Review 6 Steps Set alpha (.05). State Null & Alternative H0: H1: not all  are =. Calculate test statistic: F=12.5 Determine critical value F.05(2,12) = 3.89 Decision rule: If test statistic > critical value, reject H0. Decision: Test is significant (12.5>3.89). Means in population are different.

Post Hoc Tests If the t-test is significant, you have a difference in population means. If the F-test is significant, you have a difference in population means. But you don’t know where. With 3 means, could be A=B>C or A>B>C or A>B=C. We need a test to tell which means are different. Lots available, we will use 1.

Tukey HSD (1) Use with equal sample size per cell. HSD means honestly significant difference.  is the Type I error rate (.05). Is a value from a table of the studentized range statistic based on alpha, dfW (12 in our example) and k, the number of groups (3 in our example). Is the mean square within groups (10). Is the number of people in each group (5). MSW Result for our example. From table

Tukey HSD (2) To see which means are significantly different, we compare the observed differences among our means to the critical value of the Tukey test. The differences are: 1-2 is 79-84 = -5 (say 5 to be positive). 1-3 is 79-74 = 5 2-3 is 84-74 = 10. Because 10 is larger than 5.33, this result is significant (2 is different than 3). The other differences are not significant. Review 6 steps.

Review What is a post hoc test? What is its use? Describe the HSD test. What does HSD stand for?

Test Another name for mean square is _________. standard deviation sum of squares treatment level variance

Test When do we use post hoc tests? a. after a significant overall F test b. after a nonsignificant overall F test c. in place of an overall F test d. when we want to determine the impact of different factors