Statistics Used in One-way Analysis of Variance BUSI 6480 Lecture 2.

Slides:



Advertisements
Similar presentations
Prepared by Lloyd R. Jaisingh
Advertisements

Hypothesis Testing Steps in Hypothesis Testing:
Independent t -test Features: One Independent Variable Two Groups, or Levels of the Independent Variable Independent Samples (Between-Groups): the two.
Chapter 11 Analysis of Variance
Chapter 12 ANALYSIS OF VARIANCE.
© 2010 Pearson Prentice Hall. All rights reserved Single Factor ANOVA.
Statistics for Managers Using Microsoft® Excel 5th Edition
MARE 250 Dr. Jason Turner Hypothesis Testing II. To ASSUME is to make an… Four assumptions for t-test hypothesis testing:
Chapter 11 Analysis of Variance
Analysis of Variance. Experimental Design u Investigator controls one or more independent variables –Called treatment variables or factors –Contain two.
Statistics Are Fun! Analysis of Variance
Chapter Topics The Completely Randomized Model: One-Factor Analysis of Variance F-Test for Difference in c Means The Tukey-Kramer Procedure ANOVA Assumptions.
Chapter 3 Analysis of Variance
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 17 Analysis of Variance
Copyright ©2011 Pearson Education 11-1 Chapter 11 Analysis of Variance Statistics for Managers using Microsoft Excel 6 th Global Edition.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Analysis of Variance Statistics for Managers Using Microsoft.
Chap 10-1 Analysis of Variance. Chap 10-2 Overview Analysis of Variance (ANOVA) F-test Tukey- Kramer test One-Way ANOVA Two-Way ANOVA Interaction Effects.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 11-1 Chapter 11 Analysis of Variance Statistics for Managers using Microsoft Excel.
Chapter 12: Analysis of Variance
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
Inferential Statistics: SPSS
CHAPTER 3 Analysis of Variance (ANOVA) PART 1
© 2003 Prentice-Hall, Inc.Chap 11-1 Analysis of Variance IE 340/440 PROCESS IMPROVEMENT THROUGH PLANNED EXPERIMENTATION Dr. Xueping Li University of Tennessee.
Comparing Two Population Means
© 2002 Prentice-Hall, Inc.Chap 9-1 Statistics for Managers Using Microsoft Excel 3 rd Edition Chapter 9 Analysis of Variance.
PSY 307 – Statistics for the Behavioral Sciences Chapter 16 – One-Factor Analysis of Variance (ANOVA)
ANOVA (Analysis of Variance) by Aziza Munir
Testing Multiple Means and the Analysis of Variance (§8.1, 8.2, 8.6) Situations where comparing more than two means is important. The approach to testing.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap th Lesson Analysis of Variance.
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Analysis of Variance Statistics for Managers Using Microsoft.
Inferential Statistics
The Completely Randomized Design (§8.3)
ANOVA Assumptions 1.Normality (sampling distribution of the mean) 2.Homogeneity of Variance 3.Independence of Observations - reason for random assignment.
Lecture 9-1 Analysis of Variance
1 Always be mindful of the kindness and not the faults of others.
Previous Lecture: Phylogenetics. Analysis of Variance This Lecture Judy Zhong Ph.D.
1 ANALYSIS OF VARIANCE (ANOVA) Heibatollah Baghi, and Mastee Badii.
Hypothesis Testing. Why do we need it? – simply, we are looking for something – a statistical measure - that will allow us to conclude there is truly.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Business Statistics: A First Course (3rd Edition)
Chapter 4 Analysis of Variance
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
IE241: Introduction to Design of Experiments. Last term we talked about testing the difference between two independent means. For means from a normal.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 10, Slide 1 Two-Sample Tests and One-Way ANOVA Chapter 10.
CHAPTER 10 ANOVA - One way ANOVa.
Handout Six: Sample Size, Effect Size, Power, and Assumptions of ANOVA EPSE 592 Experimental Designs and Analysis in Educational Research Instructor: Dr.
Analysis of Variance STAT E-150 Statistical Methods.
Formula for Linear Regression y = bx + a Y variable plotted on vertical axis. X variable plotted on horizontal axis. Slope or the change in y for every.
Statistical Experiments
The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.
Chapter 11 Analysis of Variance
Chapter 10 Two-Sample Tests and One-Way ANOVA.
Statistics for Managers Using Microsoft Excel 3rd Edition
CHAPTER 3 Analysis of Variance (ANOVA)
Chapter 10 Two-Sample Tests and One-Way ANOVA.
Elementary Statistics
Econ 3790: Business and Economic Statistics
Chapter 11 Analysis of Variance
Single-Factor Studies
Single-Factor Studies
Reasoning in Psychology Using Statistics
Statistics for the Social Sciences
Chapter 10 – Part II Analysis of Variance
Presentation transcript:

Statistics Used in One-way Analysis of Variance BUSI 6480 Lecture 2

Design of Experiments: A historical note Two spoonfuls of vinegar three times a day (and 4 other treatments for scurvy) lost out to oranges and lemons in what Wikipedia credits as an "early" designed experiment. James Lind

Design of Experiments: A historical note In 1747, while serving as surgeon on HM Bark Salisbury, James Lind, the ship's surgeon, carried out a controlled experiment to develop a cure for scurvy. Lind selected 12 men from the ship, all suffering from scurvy, and divided them into six pairs, giving each group different additions to their basic diet for a period of two weeks. The treatments were all remedies that had been proposed at one time or another. They were:  A quart of cider every day  Twenty five gutts of elixir vitriol three times a day upon an empty stomach,vitriol  One half-pint of seawater every day  A mixture of garlic, mustard, and horseradish in a lump the size of a nutmeg  Two spoonfuls of vinegar three times a day  Two oranges and one lemon every day.

Design of Experiments: A historical note The men who had been given citrus fruits recovered dramatically within a week. One of them returned to duty after 6 days and the other became nurse to the rest. The others experienced some improvement, but nothing was comparable to the citrus fruits, which were proved to be substantially superior to the other treatments. In this study his subjects' cases "were as similar as I could have them", that is he provided strict entry requirements to reduce extraneous variation. The men were paired, which provided replication. From a modern perspective, the main thing that is missing is randomized allocation of subjects to treatments.

ANOVA: Testing hypothesized values of  2  A standard assumption is that the data come from a normally distributed population. The groups are assumed to have equal population variances.  H 0 :  2 =  0 2 H A :  2   0 2  H 0 :  2   0 2 H A :  2 >  0 2   2 =(n-1) /   2 where is the sample variance and   2 is the hypothesized variance.

F statistic for ANOVA  Ratio of two chi-square statistics F = (  v1 2 /v 1 )/ (  v2 2 / v 2 ) where v 1 and v 2 are the numerator and denominator degrees of freedom. E(F) = v 2 /(v 2 – 2) Thus, expected value of F is approximately one.

Model for One-way Anova Y ij =  +  j +  i(j) where  j is a fixed effect, i = 1,..., n; j = 1,..., p H 0 :  1 =  1 =... =  p = 0 H 1 :  j  0 for some j Y ij = + ( - ) + ( Y ij - ) Score Grand TreatmentError mean effect effect = SST = SSB + SSW

Notation for Sums of Squares = [AS] /np = [Y] /n = [A] The Letter A represents all observations belonging to a level of treatment A. The notation [A] means to square the sum of Y’s within each treatment level and divide by the number of observations within each level of treatment A. The letters AS represent the treatment and subject within the treatment level. The notation [AS] means to square the sum of observations within each subject treatment level. Note that there is only one observation within each subject treatment level. The letter Y represents all observations of the dependent variable. The notation [Y] means to square the sum of all response observations and divide by the number of responses.

Sums of Squares using Symbols  SST = [AS] –[Y]  SSB = [A] – [Y]  SSW = [AS] – [A] Three terms are used in computing the sums of squares for a one-way ANOVA

Expected Values of Error Terms  E(  i(j) ) =   = 0  E(  i(j) ) = n   = 0  E(  i(j) 2 ) = n     E(  i(j) ) 2 = n     E(  i(j) 2 ) = np   2  E(  i(j) ) 2 = np   2

Expected Value of Mean Sum of Squares for the Fixed Effects CR-p Design  E(MSB) =   2 +  E(MSW) =   2  E(F) ≈ E(MSB)/E(MSW) = (   2 + ) /   2  If H 0 is true and all  j = 0, then E(F) ≈ 1. What is the E(SSB)? What is the E(SSW)?

Expected Value of Mean Sum of Squares for the Random Effects CR-p Design  E(MSB) =   2 + n   2  E(MSW) =   2  E(F) ≈ E(MSB)/E(MSW) = (   2 + n   2 ) /   2  If H 0 is true and all   2 = 0, then E(F) ≈ 1.  Remember for a random effects model  i is a random variable with mean 0 and variance   2. What is the E(SSB)? What is the E(SSW)?

How do you know what ratio of sums of squares to form for the F test?  By finding the expected Mean Sum of Squares, the F statistic can be correctly computed. This will become handy as the designs become more complicated.  The expected values of the mean squares for the fixed and random effects model lead to the same ratios of mean sums of squares for the CR-p design. This will not always be true for more complex designs.

Assumptions for CR-p F Assumptions 1. Data come from normally distributed populations. 2. Observations within cells are random or at least observations are randomly assigned to cells. (Cells are determined by treatment levels.) 3. Numerator and denominator of F statistic are independent. 4. Numerator and denominator are estimates of the same population variance,   2, when H 0 is true.

Model Assumptions for CR-p  The model Y ij =  +  j +  i(j) contains all the sources of variation that affect Y ij.  The experiment contains all the treatment levels of interest.  The error effect,  i(j) is (a) independent of other error terms, (b) normally distributed within each treatment level, ( c) mean is equal to zero, and (d) variance is constant (   2 ) across treatment level.

Testing for Homogeneity of Variance  H 0 :  1 2 =  2 2 = … =  p 2

Levene Test / Brown-Forsythe Test for testing for homogeneity of variance  Levene's test is an alternative to the Bartlett test. It is less sensitive than the Bartlett test to departures from normality. If there is strong evidence that the data do in fact come from a normal, or nearly normal distribution, then Bartlett's test has better performance.Bartlett test  Levene’s test: replace each observation by the absolute value of the deviation of the observation from the group mean and run a one-way ANOVA.  Brown-Forsythe test: modify Levene’s test by using the deviation of each observation from the group median instead of the group mean.

HOV: Homogeneity of Variance If the null hypothesis is rejected use the Welch or Brown-Forsythe test ANOVA test on the means. Assume 5% significance level as the default value.

Transformation of the Dependent Variable to Achieve HOV or Normality – more effective with unequal group sizes.

Rules of Thumb for Transformations of Y

Plotting Group Variances to Determine Transformation. Plot the variance of the group (treatment level) by the group mean (x-axis). Draw a straight line (least squares line) through the points and find the slope, . Use p = 1-  to determine the transformation of the form: y = x p Round off p and use the closest transformation listed.

Another method for selecting the transformation Use the Smallest Range Criteria

Kruskal Wallis: Nonparametric Counterpart to the One-way ANOVA

Guidelines for Alternative Tests

Run the following Data using SAS and SPSS (select HOV and Welch options) one way ANOVA  /* SAS Commands ***/  DM "Log;Clear;OUT;Clear;" ;  Data mydata;  Input Treat1 Treat2 Treat3;  datalines;         ;

Create one column of responses and another column with the grouping variable  Data Treat1;  set mydata;  resp = Treat1;  Data Treat2;  set mydata;  resp = Treat2;  Data Treat3;  set mydata;  resp = Treat3;  Data myAnovaData;  Set Treat1 Treat2 Treat3;  If _N_ <= 21 then Level = 3;  If _N_ <= 14 then Level = 2;  If _N_ <= 7 then Level = 1;  Keep resp Level;  proc print data = myAnovaData;  proc export data=myAnovaData outfile='d:MyAnovaDatainColformat.dat' dbms=dlm replace;

SAS proc to get Welch and Levine HOV test  proc glm data = myAnovaData;  class Level;  model resp = Level;  means Level / hovtest welch ;  run;  quit;

SPSS Analyze > Compare Means > One-Way ANOVA

SPSS Options for the One-Way ANOVA Test for Equal Group Variances Test for Means assuming unequal variances