Copyright (c) Bani K. Mallick1 STAT 651 Lecture #13.

Slides:



Advertisements
Similar presentations
BPS - 5th Ed. Chapter 241 One-Way Analysis of Variance: Comparing Several Means.
Advertisements

CHAPTER 25: One-Way Analysis of Variance Comparing Several Means
Chapter 14, part D Statistical Significance. IV. Model Assumptions The error term is a normally distributed random variable and The variance of  is constant.
Inference for Regression
Analysis of Variance (ANOVA) ANOVA can be used to test for the equality of three or more population means We want to use the sample results to test the.
Econ 140 Lecture 81 Classical Regression II Lecture 8.
Design of Experiments and Analysis of Variance
Copyright (c) Bani Mallick1 Stat 651 Lecture 5. Copyright (c) Bani Mallick2 Topics in Lecture #5 Confidence intervals for a population mean  when the.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #17.
© 2010 Pearson Prentice Hall. All rights reserved Single Factor ANOVA.
Lecture 23: Tues., Dec. 2 Today: Thursday:
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #18.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture 9.
Part I – MULTIVARIATE ANALYSIS
Copyright (c) Bani Mallick1 STAT 651 Lecture 7. Copyright (c) Bani Mallick2 Topics in Lecture #7 Sample size for fixed power Never, ever, accept a null.
Copyright (c) Bani Mallick1 STAT 651 Lecture 10. Copyright (c) Bani Mallick2 Topics in Lecture #10 Comparing two population means using rank tests Comparing.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #16.
Copyright (c) Bani Mallick1 Lecture 4 Stat 651. Copyright (c) Bani Mallick2 Topics in Lecture #4 Probability The bell-shaped (normal) curve Normal probability.
Lecture 9: One Way ANOVA Between Subjects
Two Groups Too Many? Try Analysis of Variance (ANOVA)
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #20.
Copyright © 2006 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide Are the Means of Several Groups Equal? Ho:Ha: Consider the following.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #15.
Lecture 12 One-way Analysis of Variance (Chapter 15.2)
Copyright (c) Bani K. Mallick1 STAT 651 Lecture # 12.
Copyright (c) Bani Mallick1 STAT 651 Lecture # 11.
Copyright (c) Bani K. mallick1 STAT 651 Lecture #14.
5-3 Inference on the Means of Two Populations, Variances Unknown
Two-Way Analysis of Variance STAT E-150 Statistical Methods.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Chapter 12: Analysis of Variance
F-Test ( ANOVA ) & Two-Way ANOVA
QNT 531 Advanced Problems in Statistics and Research Methods
1 Tests with two+ groups We have examined tests of means for a single group, and for a difference if we have a matched sample (as in husbands and wives)
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
 The idea of ANOVA  Comparing several means  The problem of multiple comparisons  The ANOVA F test 1.
CHAPTER 14 MULTIPLE REGRESSION
+ Chapter 12: Inference for Regression Inference for Linear Regression.
ANOVA One Way Analysis of Variance. ANOVA Purpose: To assess whether there are differences between means of multiple groups. ANOVA provides evidence.
ANOVA (Analysis of Variance) by Aziza Munir
Basic concept Measures of central tendency Measures of central tendency Measures of dispersion & variability.
Between-Groups ANOVA Chapter 12. >When to use an F distribution Working with more than two samples >ANOVA Used with two or more nominal independent variables.
Randomized completely Block design(RCBD). In this design we are interested in comparing t treatment by using b blocks.
ANOVA Conceptual Review Conceptual Formula, Sig Testing Calculating in SPSS.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Comparing Three or More Means ANOVA (One-Way Analysis of Variance)
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
General Linear Model 2 Intro to ANOVA.
ANOVA: Analysis of Variance.
1 ANALYSIS OF VARIANCE (ANOVA) Heibatollah Baghi, and Mastee Badii.
Principles of Biostatistics ANOVA. DietWeight Gain (grams) Standard910 8 Junk Food Organic Table shows weight gains for mice on 3 diets.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.3 Two-Way ANOVA.
Chapter 10 The t Test for Two Independent Samples
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
CHAPTER 27: One-Way Analysis of Variance: Comparing Several Means
Copyright (c) Bani K. Mallick1 STAT 651 Lecture 6.
CHAPTER 27: One-Way Analysis of Variance: Comparing Several Means
Analysis of Variance STAT E-150 Statistical Methods.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
Copyright (c) Bani Mallick1 STAT 651 Lecture 8. Copyright (c) Bani Mallick2 Topics in Lecture #8 Sign test for paired comparisons Wilcoxon signed rank.
EDUC 200C Section 9 ANOVA November 30, Goals One-way ANOVA Least Significant Difference (LSD) Practice Problem Questions?
Chapters Way Analysis of Variance - Completely Randomized Design.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Basic Practice of Statistics - 5th Edition
I. Statistical Tests: Why do we use them? What do they involve?
Presentation transcript:

Copyright (c) Bani K. Mallick1 STAT 651 Lecture #13

Copyright (c) Bani K. Mallick2 Topics in Lecture #13 Multiple comparisons, especially Fisher’s Least Significant Difference Residuals as a means of checking the normality assumption

Copyright (c) Bani K. Mallick3 Book Sections Covered in Lecture #13 Chapter 8.4 (Residuals) Chapter 9.4 (Fisher’s) Chapter 9.1 (the idea of multiple comparisons)

Copyright (c) Bani K. Mallick4 Lecture 12 Review: ANOVA Suppose we form three populations on the basis of body mass index (BMI): BMI 28 This forms 3 populations We want to know whether the three populations have the same mean caloric intake, or if their food composition differs.

Copyright (c) Bani K. Mallick5 Lecture 12 Review: ANOVA One procedure that is often followed is to do a preliminary test to see whether there are any differences among the populations Then, once you conclude that some differences exist, you allow somewhat more informality in deciding where those differences manifest themselves The first step is the ANOVA F-test

Copyright (c) Bani K. Mallick6 Lecture 12 Review: ANOVA The distance of the data to the overall mean is TSS = (Corrected) Total Sum of Squares This has degrees of freedom

Copyright (c) Bani K. Mallick7 Lecture 12 Review: ANOVA The sum of squares between groups Corrected Model) is It has t-1 degrees of freedom, so the number of populations is the degrees of freedom between groups + 1.

Copyright (c) Bani K. Mallick8 Lecture 12 Review: ANOVA The distance of the observations to their sample means is This is the Sum of Squares for Error It has degrees of freedom

Copyright (c) Bani K. Mallick9 Lecture 12 Review: ANOVA Next comes the F-statistic It is the ratio of the mean square for the corrected model to the mean square for error Large values indicate rejection of the null hypothesis

Copyright (c) Bani K. Mallick10 Lecture 12 Review: ANOVA The F-statistic is compared to the F- distribution with t-1 and degrees of freedom. See Table 8,which lists the cutoff points in terms of . If the F-statistic exceeds the cutoff, you reject the hypothesis of equality of all the means. SPSS gives you the p-value (significance level) for this test

Copyright (c) Bani K. Mallick11 Lecture 12 Review: ANOVA The F-statistic is compared to the F- distribution with df 1 = t-1 and degrees of freedom. For example if you have 3 populations, 6 observations for each population, then there are 18 total observations. The degrees of freedom are 2 and 15. If you want a type I error of 5%, look at df 1 = 2, df 2 = 15,  =.05 to get a critical value of 3.68: try this out!

Copyright (c) Bani K. Mallick12 Lecture 12 Review: ANOVA If the populations have a common variance  2, the Mean squared error estimates it. You take the square root of the MSE to estimate 

Copyright (c) Bani K. Mallick13 Lecture 12 Review: ANOVA The critical value of 2 and 181 df for an F-test at Type I error 0.05 is about 3.05 Hence F > 3.05, so the p-value is < 0.05

Copyright (c) Bani K. Mallick14 ANOVA in SPSS “Analyze”, “General Linear Model”, “Univariate” “Fixed factor” = the variable defining the populations Always “Save” unstandardized residuals “Posthoc”: Move factor to right and click on LSD

Copyright (c) Bani K. Mallick15 ANOVA Table

Copyright (c) Bani K. Mallick16 Fisher’s Least Significant Distance (LSD) Suppose that we determine that there are at least some differences among t population means. Fisher’s Least Significant Difference is one way to tell which ones are different The main reason to use it is convenience: all comparisons can be done with the click of a mouse It does not guarantee longer or shorter confidence intervals

Copyright (c) Bani K. Mallick17 Fisher’s Least Significant Distance (LSD) For example, suppose there are t = 3 populations. The null hypothesis is The alternative is: But this does not tell you which populations are different, only that some are

Copyright (c) Bani K. Mallick18 Fisher’s Least Significant Distance (LSD) The null hypothesis is The alternative is: There are 4 possibilities: Fishers LSD is a way of getting this directly

Copyright (c) Bani K. Mallick19 Fisher’s LSD We have done an ANOVA, and now we want to compare two specific populations. Fisher’s LSD differs from our usual 2- population comparisons in two features: The degrees of freedom (n T - t) not n 1 +n 2 -2 The pooled standard deviation (square root of MSE = SSE/(n T - t), not s P

Copyright (c) Bani K. Mallick20 Review: Comparing Two Populations If you can reasonably believe that the population sd’s are nearly equal, it is customary to pick the equal variance assumption and estimate the common standard deviation by

Copyright (c) Bani K. Mallick21 Comparing Two Populations: Usual and Fisher LSD Usual Fisher

Copyright (c) Bani K. Mallick22 ROS Data ROS data has three groups: Fish oil diet, Fish- like oil diet, and Corn Oil We want to compare their responses to butyrate

Copyright (c) Bani K. Mallick23 ANOVA ROS data, log scale. What do you see?

Copyright (c) Bani K. Mallick24 ANOVA ROS data, log scale. What do you see? Maybe different variances, but sample sizes are small

Copyright (c) Bani K. Mallick25 ANOVA ROS data, log scale. No major changes in means?

Copyright (c) Bani K. Mallick26 ANOVA ROS data has three groups: Fish oil diet, Fish- like oil diet, and Corn Oil What was the total sample size? n = 30 Tests of Between-Subjects Effects Dependent Variable: log(Butyrate) - log(Control ) 5.188E-02 a E E E Source Corrected Model Intercept DIETGRP Error Total Corrected Total Type III Sum of SquaresdfMean SquareFSig. R Squared =.015 (Adjusted R Squared = -.058) a.

Copyright (c) Bani K. Mallick27 ANOVA ROS data: any evidence that the population means are different in their change after butyrate exposure?

Copyright (c) Bani K. Mallick28 ANOVA ROS data: any evidence that the population means are different in their change after butyrate exposure? No, the p-value is 0.818! This matches the box plots

Copyright (c) Bani K. Mallick29 ROS Data Testing for Normality in ANOVA I use the General Linear Model to define these residuals Form the residuals, which are simply the differences of the data with their group sample mean Then do a q-q plot Useful if you have many groups with a small number of observations per group

Copyright (c) Bani K. Mallick30 ANOVA Here is the Q-Q plot. How’s it look?

Copyright (c) Bani K. Mallick31 ROS Data Testing for Normality in ANOVA: Illustrate saving residuals: “general linear model”, “univariate”, “save” (select “unstandardized” to create the residual variable ) Illustrate q-q- plot on residuals Illustrate editing a chart object to change titles and the like

Copyright (c) Bani K. Mallick32 ROS Data Fisher’s LSD. Note how all p-values are > Multiple Comparisons Dependent Variable: log(Butyrate) - log(Control) LSD 6.825E E E E E E (J) Diet Group Fish oil diet Corn oil diet FAEE oil diet Corn oil diet FAEE oil diet Fish oil diet (I) Diet Group FAEE oil diet Fish oil diet Corn oil diet Mean Difference (I-J)Std. Error Pvalues Sig.Lower BoundUpper Bound 95% Confidence Interval Based on observed means.

Copyright (c) Bani K. Mallick33 ROS Data: Compare Fish to Corn oil Mean for fish – mean for corn = Multiple Comparisons Dependent Variable: log(Butyrate) - log(Control) LSD 6.825E E E E E E (J) Diet Group Fish oil diet Corn oil diet FAEE oil diet Corn oil diet FAEE oil diet Fish oil diet (I) Diet Group FAEE oil diet Fish oil diet Corn oil diet Mean Difference (I-J)Std. Error Pvalues Sig.Lower BoundUpper Bound 95% Confidence Interval Based on observed means.

Copyright (c) Bani K. Mallick34 ROS Data: Compare Fish to Corn oil Mean for fish – mean for corn = Standard error = Multiple Comparisons Dependent Variable: log(Butyrate) - log(Control) LSD 6.825E E E E E E (J) Diet Group Fish oil diet Corn oil diet FAEE oil diet Corn oil diet FAEE oil diet Fish oil diet (I) Diet Group FAEE oil diet Fish oil diet Corn oil diet Mean Difference (I-J)Std. Error Pvalues Sig.Lower BoundUpper Bound 95% Confidence Interval Based on observed means.

Copyright (c) Bani K. Mallick35 ROS Data: Compare Fish to Corn oil Mean for fish – mean for corn = Standard error = CI (95%) = Multiple Comparisons Dependent Variable: log(Butyrate) - log(Control) LSD 6.825E E E E E E (J) Diet Group Fish oil diet Corn oil diet FAEE oil diet Corn oil diet FAEE oil diet Fish oil diet (I) Diet Group FAEE oil diet Fish oil diet Corn oil diet Mean Difference (I-J)Std. Error Pvalues Sig.Lower BoundUpper Bound 95% Confidence Interval Based on observed means.

Copyright (c) Bani K. Mallick36 ROS Data: Compare Fish to Corn oil Mean for fish – mean for corn = Standard error = CI (95%) = to.3596 Multiple Comparisons Dependent Variable: log(Butyrate) - log(Control) LSD 6.825E E E E E E (J) Diet Group Fish oil diet Corn oil diet FAEE oil diet Corn oil diet FAEE oil diet Fish oil diet (I) Diet Group FAEE oil diet Fish oil diet Corn oil diet Mean Difference (I-J)Std. Error Pvalues Sig.Lower BoundUpper Bound 95% Confidence Interval Based on observed means.

Copyright (c) Bani K. Mallick37 Concho Water Snake Illustration A numerical example will help illustrate this idea. I’ll consider comparing tail lengths of female Concho Water Snakes with age classes 2,3, and 4. Sample sizes Sample sd: Sample means:

Copyright (c) Bani K. Mallick38 Female Concho Water Snakes, Ages 2-4, Tail Length

Copyright (c) Bani K. Mallick39 Female Concho Water Snakes, Ages 2-4, Tail Length

Copyright (c) Bani K. Mallick40 Female Concho Water Snakes, Ages 2-4, Tail Length: are they different in population means?

Copyright (c) Bani K. Mallick41 Concho Water Snake Example Multiple Comparisons Dependent Variable: Tail Length LSD * * * * * * (J) Age (I) Age Mean Difference (I-J)Std. ErrorSig.Lower BoundUpper Bound 95% Confidence Interval Based on observed means. The mean difference is significant at the.05 level. *.

Copyright (c) Bani K. Mallick42 Concho Water Snake Illustration: Hand Calculations Sample size factor for comparing the age groups Sample mean difference

Copyright (c) Bani K. Mallick43 Concho Water Snake Illustration n T – t = 34 degrees of freedom for error MSE = ,  = 0.05 = 9.76 to 33.10: compare with output

Copyright (c) Bani K. Mallick44 Female Concho Water Snakes, Ages 2-4, Tail Length We need a method that allows for non- normal data!