T-T ESTS AND A NALYSIS OF V ARIANCE Jennifer Kensler.

Slides:



Advertisements
Similar presentations
LISA Short Course: A Tutorial in t-tests and ANOVA using JMP Laboratory for Interdisciplinary Statistical Analysis Anne Ryan Assistant Professor of Practice.
Advertisements

BPS - 5th Ed. Chapter 241 One-Way Analysis of Variance: Comparing Several Means.
Independent t -test Features: One Independent Variable Two Groups, or Levels of the Independent Variable Independent Samples (Between-Groups): the two.
Lecture 6 Outline – Thur. Jan. 29
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
© 2010 Pearson Prentice Hall. All rights reserved Single Factor ANOVA.
ANOVA notes NR 245 Austin Troy
Lecture 6 Outline: Tue, Sept 23 Review chapter 2.2 –Confidence Intervals Chapter 2.3 –Case Study –Two sample t-test –Confidence Intervals Testing.
BCOR 1020 Business Statistics
Chapter Goals After completing this chapter, you should be able to:
Test statistic: Group Comparison Jobayer Hossain Larry Holmes, Jr Research Statistics, Lecture 5 October 30,2008.
Final Review Session.
Experimental Design Terminology  An Experimental Unit is the entity on which measurement or an observation is made. For example, subjects are experimental.
Analysis of Differential Expression T-test ANOVA Non-parametric methods Correlation Regression.
Lecture 9: One Way ANOVA Between Subjects
Analysis of Variance & Multivariate Analysis of Variance
Student’s t statistic Use Test for equality of two means
T-T ESTS AND A NALYSIS OF V ARIANCE Jennifer Kensler.
T-T ESTS AND A NALYSIS OF V ARIANCE Jennifer Kensler.
Laboratory for Interdisciplinary Statistical Analysis Anne Ryan Virginia Tech.
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. Business Statistics: Communicating with Numbers By Sanjiv Jaggia.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
LISA Short Course Series R Statistical Analysis Ning Wang Summer 2013 LISA: R Statistical AnalysisSummer 2013.
Chapter 12: Analysis of Variance
F-Test ( ANOVA ) & Two-Way ANOVA
Statistical Analysis Statistical Analysis
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 2 – Slide 1 of 25 Chapter 11 Section 2 Inference about Two Means: Independent.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Comparing Three or More Means 13.
T-tests and ANOVA using JMP Kristopher Patton April 7, 2015 * institute-state-university-virginia-tech/
Comparing Two Population Means
R EGRESSION Jennifer Kensler. Laboratory for Interdisciplinary Statistical Analysis Collaboration From our website request a meeting for personalized.
Chapter 11 HYPOTHESIS TESTING USING THE ONE-WAY ANALYSIS OF VARIANCE.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Experimental Design and Analysis of Variance Chapter 10.
ANOVA (Analysis of Variance) by Aziza Munir
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
Business Statistics: Contemporary Decision Making, 3e, by Black. © 2001 South-Western/Thomson Learning 11-1 Business Statistics, 3e by Ken Black Chapter.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
Two-Sample Inference Procedures with Means. Of the following situations, decide which should be analyzed using one-sample matched pair procedure and which.
A Course In Business Statistics 4th © 2006 Prentice-Hall, Inc. Chap 9-1 A Course In Business Statistics 4 th Edition Chapter 9 Estimation and Hypothesis.
Comparing Three or More Means ANOVA (One-Way Analysis of Variance)
Education 793 Class Notes Presentation 10 Chi-Square Tests and One-Way ANOVA.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Nonparametric Statistics
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
Chapter 13 - ANOVA. ANOVA Be able to explain in general terms and using an example what a one-way ANOVA is (370). Know the purpose of the one-way ANOVA.
Analysis of Variance (One Factor). ANOVA Analysis of Variance Tests whether differences exist among population means categorized by only one factor or.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.1 One-Way ANOVA: Comparing.
1 ANALYSIS OF VARIANCE (ANOVA) Heibatollah Baghi, and Mastee Badii.
Text Exercise 12.2 (a) (b) (c) Construct the completed ANOVA table below. Answer this part by indicating what the f test statistic value is, what the appropriate.
12-1 Chapter Twelve McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Business Statistics: A First Course (3rd Edition)
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
T-T ESTS AND A NALYSIS OF V ARIANCE Jennifer Kensler July 13, 2010 Fralin Auditorium, Virginia Tech This presentation is annotated. Please click on the.
Copyright c 2001 The McGraw-Hill Companies, Inc.1 Chapter 11 Testing for Differences Differences betweens groups or categories of the independent variable.
Kin 304 Inferential Statistics Probability Level for Acceptance Type I and II Errors One and Two-Tailed tests Critical value of the test statistic “Statistics.
Analysis of Variance STAT E-150 Statistical Methods.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 11 Testing for Differences Differences betweens groups or categories of the independent.
Lecture 8 Estimation and Hypothesis Testing for Two Population Parameters.
Formula for Linear Regression y = bx + a Y variable plotted on vertical axis. X variable plotted on horizontal axis. Slope or the change in y for every.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Experimental Design and Analysis of Variance Chapter 11.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 4 Investigating the Difference in Scores.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
C HAPTER 11 C OMPARING TWO POPULATIONS OR T REATMENTS How can the data from two independent populations or treatments be evaluated to determine causation?
i) Two way ANOVA without replication
Comparing Three or More Means
Chapter 10 – Part II Analysis of Variance
ANOVA: Analysis of Variance
Presentation transcript:

T-T ESTS AND A NALYSIS OF V ARIANCE Jennifer Kensler

Laboratory for Interdisciplinary Statistical Analysis Virginia Tech’s source for expert statistical analysis since 1948 Walk-In Consulting: Monday—Friday* 12-2PM for questions <30 minutes * Mon—Thurs in summer * We help with research—not class projects or homework Collaboration: Personalized statistical advice Great advice right now: Meet with LISA before collecting your data Short Courses: Designed to help graduate students apply statistics in their research

Laboratory for Interdisciplinary Statistical Analysis Virginia Tech’s source for expert statistical analysis since 1948 Walk-In Consulting: Monday—Friday* 12-2PM for questions <30 minutes * Mon—Thurs in summer * We help with research—not class projects or homework Collaboration: Personalized statistical advice Great advice right now: Meet with LISA before collecting your data Short Courses: Designed to help graduate students apply statistics in their research

T-T ESTS AND A NALYSIS OF V ARIANCE

O NE S AMPLE T-T EST 5

Used to test whether the population mean is different from a specified value. Example: Is the mean height of 12 year old girls greater than 60 inches? 6

S TEP 1: F ORMULATE THE H YPOTHESES The population mean is not equal to a specified value. H 0 : μ = μ 0 H a : μ ≠ μ 0 The population mean is greater than a specified value. H 0 : μ = μ 0 H a : μ > μ 0 The population mean is less than a specified value. H 0 : μ = μ 0 H a : μ < μ 0 7

S TEP 2: C HECK THE A SSUMPTIONS The sample is random. The population from which the sample is drawn is either normal or the sample size is large. 8

S TEPS 3-5 Step 3: Calculate the test statistic: Where Step 4: Calculate the p-value based on the appropriate alternative hypothesis. Step 5: Write a conclusion. 9

I RIS E XAMPLE A researcher would like to know whether the mean sepal width of a variety of irises is different from 3.5 cm. The researcher randomly measures the sepal width of 50 irises. Step 1: Hypotheses H 0 : μ = 3.5 cm H a : μ ≠ 3.5 cm 10

JMP Steps 2-4: JMP Demonstration Analyze  Distribution Y, Columns: Sepal Width Test Mean Specify Hypothesized Mean:

JMP O UTPUT Step 5 Conclusion: The mean sepal width is not significantly different from 3.5 cm. 12

T WO S AMPLE T-T EST 13

T WO S AMPLE T-T EST Two sample t-tests are used to determine whether the population mean of one group is equal to, larger than or smaller than the population mean of another group. Example: Is the mean cholesterol of people taking drug A lower than the mean cholesterol of people taking drug B? 14

S TEP 1: F ORMULATE THE H YPOTHESES The population means of the two groups are not equal. H 0 : μ 1 = μ 2 H a : μ 1 ≠ μ 2 The population mean of group 1 is greater than the population mean of group 2. H 0 : μ 1 = μ 2 H a : μ 1 > μ 2 The population mean of group 1 is less than the population mean of group 2. H 0 : μ 1 = μ 2 H a : μ 1 < μ 2 15

S TEP 2: C HECK THE A SSUMPTIONS The two samples are random and independent. The populations from which the samples are drawn are either normal or the sample sizes are large. The populations have the same standard deviation. 16

S TEPS 3-5 Step 3: Calculate the test statistic where Step 4: Calculate the appropriate p-value. Step 5: Write a Conclusion. 17

T WO S AMPLE E XAMPLE A researcher would like to know whether the mean sepal width of setosa irises is different from the mean sepal width of versicolor irises. Step 1 Hypotheses: H 0 : μ setosa = μ versicolor H a : μ setosa ≠ μ versicolor 18

JMP Steps 2-4: JMP Demonstration: Analyze  Fit Y By X Y, Response: Sepal Width X, Factor: Species 19

JMP O UTPUT Step 5 Conclusion: There is strong evidence (p- value < ) that the mean sepal widths for the two varieties are different. 20

P AIRED T-T EST 21

P AIRED T-T EST The paired t-test is used to compare the means of two dependent samples. Example: A researcher would like to determine if background noise causes people to take longer to complete math problems. The researcher gives 20 subjects two math tests one with complete silence and one with background noise and records the time each subject takes to complete each test. 22

S TEP 1: F ORMULATE THE H YPOTHESES The population mean difference is not equal to zero. H 0 : μ difference = 0 H a : μ difference ≠ 0 The population mean difference is greater than zero. H 0 : μ difference = 0 H a : μ difference > 0 The population mean difference is less than a zero. H 0 : μ difference = 0 H a : μ difference < 0 23

S TEP 2: C HECK THE ASSUMPTIONS The sample is random. The data is matched pairs. The differences have a normal distribution or the sample size is large. 24

S TEPS 3-5 Where d bar is the mean of the differences and s d is the standard deviations of the differences. Step 4: Calculate the p-value. Step 5: Write a conclusion. Step 3: Calculate the test Statistic: 25

P AIRED T-T EST E XAMPLE A researcher would like to determine whether a fitness program increases flexibility. The researcher measures the flexibility (in inches) of 12 randomly selected participants before and after the fitness program. Step 1: Formulate a Hypothesis H 0 : μ After - Before = 0 H a : μ After - Before > 0 26

P AIRED T-T EST E XAMPLE Steps 2-4: JMP Analysis: Create a new column of After – Before Analyze  Distribution Y, Columns: After – Before Test Mean Specify Hypothesized Mean: 0 27

JMP O UTPUT Step 5 Conclusion: There is not evidence that the fitness program increases flexibility. 28

O NE -W AY A NALYSIS OF V ARIANCE 29

O NE -W AY ANOVA ANOVA is used to determine whether three or more populations have different distributions. A B C Medical Treatment 30

ANOVA S TRATEGY The first step is to use the ANOVA F test to determine if there are any significant differences among means. If the ANOVA F test shows that the means are not all the same, then follow up tests can be performed to see which pairs of means differ. 31

O NE -W AY ANOVA M ODEL In other words, for each group the observed value is the group mean plus some random variation. 32

O NE -W AY ANOVA H YPOTHESIS Step 1: We test whether there is a difference in the means. 33

S TEP 2: C HECK ANOVA A SSUMPTIONS The samples are random and independent of each other. The populations are normally distributed. The populations all have the same variance. The ANOVA F test is robust to the assumptions of normality and equal variances. 34

S TEP 3: ANOVA F T EST Compare the variation within the samples to the variation between the samples. A B C A B C Medical Treatment 35

ANOVA T EST S TATISTIC Variation within groups small compared with variation between groups → Large F Variation within groups large compared with variation between groups → Small F 36

MSG The mean square for groups, MSG, measures the variability of the sample averages. SSG stands for sums of squares groups. 37

MSE Mean square error, MSE, measures the variability within the groups. SSE stands for sums of squares error. 38

S TEPS 4-5 Step 4: Calculate the p-value. Step 5: Write a conclusion. 39

ANOVA E XAMPLE A researcher would like to determine if three drugs provide the same relief from pain. 60 patients are randomly assigned to a treatment (20 people in each treatment). Step 1: Formulate the Hypotheses H 0 : μ Drug A = μ Drug B = μ Drug C H a : The μ i are not all equal. 40

S TEPS 2-4 JMP demonstration Analyze  Fit Y By X Y, Response: Pain X, Factor: Drug 41

JMP O UTPUT AND C ONCLUSION Step 5 Conclusion: There is strong evidence that the drugs are not all the same. 42

F OLLOW -U P T EST The p-value of the overall F test indicates that the level of pain is not the same for patients taking drugs A, B and C. We would like to know which pairs of treatments are different. One method is to use Tukey’s HSD (honestly significant differences). 43

T UKEY T ESTS Tukey’s test simultaneously tests JMP demonstration Oneway Analysis of Pain By Drug  Compare Means  All Pairs, Tukey HSD for all pairs of factor levels. Tukey’s HSD controls the overall type I error. 44

JMP O UTPUT The JMP output shows that drugs A and C are significantly different. 45

T WO -W AY A NALYSIS OF V ARIANCE 46

T WO -W AY ANOVA We are interested in the effect of two categorical factors on the response. We are interested in whether either of the two factors have an effect on the response and whether there is an interaction effect. An interaction effect means that the effect on the response of one factor depends on the level of the other factor. 47

I NTERACTION 48

T WO -W AY ANOVA M ODEL 49

T WO -W AY ANOVA E XAMPLE We would like to determine the effect of two alloys (low, high) and three cooling temperatures (low, medium, high) on the strength of a wire. JMP demonstration Analyze  Fit Model Y: Strength Highlight Alloy and Temp and click Macros  Factorial to Degree 50

JMP O UTPUT 51 Conclusion: There is strong evidence of an interaction between alloy and temperature.

A NALYSIS OF C OVARIANCE 52

A NALYSIS O F C OVARIANCE (ANCOVA) Covariates are variables that may affect the response but cannot be controlled. Covariates are not of primary interest to the researcher. We will look at an example with two covariates, the model is 53

ANCOVA E XAMPLE Consider the one-way ANOVA example where we tested whether the patients receiving different drugs reported different levels of pain. Perhaps age and gender may influence the pain. We can use age and gender as covariates. JMP demonstration Analyze  Fit Model Y: Pain Add: Drug Age Gender 54

JMP O UTPUT 55

C ONCLUSION The one sample t-test allows us to test whether the population mean of a group is equal to a specified value. The two-sample t-test and paired t-test allow us to determine if the population means of two groups are different. ANOVA and ANCOVA methods allow us to determine whether the population means of several groups are statistically different. 56

SAS AND SPSS For information about using SAS and SPSS to do ANOVA:

R EFERENCES Fisher’s Irises Data (used in one sample and two sample t-test examples). Flexibility data (paired t-test example): Michael Sullivan III. Statistics Informed Decisions Using Data. Upper Saddle River, New Jersey: Pearson Education, 2004: