Download presentation

Presentation is loading. Please wait.

Published byCelina Backhus Modified about 1 year ago

1
Chapter 10 Estimation and Hypothesis Testing II: Independent and Paired Sample T-Test

2
1. Describe the difference between an independent and a dependent sample 2. Describe the difference between and uses of an independent sample t-test and a paired sample t-test 3. List the assumptions for the independent sample t-test 4. Test a hypothesis using two independent samples 5. Construct confidence intervals for the difference in means for the independent sample t-test 6. List the assumptions for the paired sample t-test 7. Test a hypothesis using two dependent samples 8. Construct confidence intervals for the difference in means for the paired sample t-test © 2012 McGraw-Hill Ryerson Ltd.2 Learning Objectives:

3
Independent samples are considered to be independent of one another, if they consist of different individuals and the selection of the individuals in the first group does not influence the selection of the second group. For example, you want to determine whether grade 9 students involved in afterschool extracurricular activities have different grade point averages than grade 9 students who are not involved in after-school extracurricular activities. © 2012 McGraw-Hill Ryerson Ltd.3 LO1 Independent Samples

4
Dependent samples are considered to be dependent on one another if they consist of the same individuals and/or the selection of the individuals in the first group determines the selection of the second group. For example, you want to know if a certain diet is effective at reducing LDL cholesterol levels (the bad kind of cholesterol). You randomly select 20 individuals and measure their LDL cholesterol level. You then have the 20 individuals participate in the diet for three months. At the end of the three months, you measure the LDL cholesterol levels of the same 20 individuals. You now have two samples (or groups). © 2012 McGraw-Hill Ryerson Ltd.4 LO1 Dependent Samples

5
Independent sample t-test: ◦ When we have two samples that are independent from each other and we want to compare the means of the two samples, we use a independent sample t-test. Paired sample t-test: ◦ When we have two samples that are dependent on each other and we want to compare the means of the two samples, we use a paired sample t-test. © 2012 McGraw-Hill Ryerson Ltd.5 LO2 Independent and Paired Sample t-test

6
The variables must be measured at either the interval or ratio level of measurement. The two groups from which you collect the data must be independent of one another. The data must be normally distributed. The variance in the population must be equal for both groups, which means they are not statistically significantly different. This is also called the assumption of homogeneity of variances. © 2012 McGraw-Hill Ryerson Ltd.6 LO3 Assumptions of the Independent Sample t-test

7
The general formula for the independent sample t-test is: Technically, the formula for the independent sample t-test is: Since we are testing the null hypothesis where μ 1 = μ 2, and in the null hypothesis μ 1 -μ 2 =0, we set the term (μ 1 -μ 2 ) to zero. Since is the same as writing we often leave the term (μ 1 -μ 2,), which equals 0, out of the equation. © 2012 McGraw-Hill Ryerson Ltd.7 LO4 Independent Sample t-test for Two Means

8
© 2012 McGraw-Hill Ryerson Ltd.8 LO4 Independent Sample t-test for Two Means The point estimator for the difference in means is still. However, as we are assuming that σ = σ, we label this common variance σ 2, and estimate it by pooling the two sample variances. The pooled estimate of the variance is given by: For an independent sample t-test, the degrees of freedom are calculated as df = (n 1 + n 2 – 2)

9
© 2012 McGraw-Hill Ryerson Ltd.9 LO4 Independent Sample t-test for Two Means The pooled variance is the weighted average of the two sample variances. So that the group with more observations is more reliable, and gets a higher weighting. The standard error for a difference in two means is given in (9.2). As we assume that the two population variances are equal, we estimate them both with the common pooled variance s 2 p. So the standard error becomes:

10
© 2012 McGraw-Hill Ryerson Ltd.10 LO5 Independent Sample t-test for Two Means The confidence interval for (μ 1 – μ 2 ) will use the t- distribution for small sample sizes. The degrees of freedom for s 2 1 is (n1 – 1) and s 2 2 is (n2 – 1). When we pool the information we have (n 1 – 1) + (n 2 – 1) = (n 1 + n 2 – 2) = f Where f is the degrees of freedom. The 95% confidence interval for (μ 1 – μ 2 ) will be the point estimator plus or minus the t-table value times the standard error.

11
© 2012 McGraw-Hill Ryerson Ltd.11 LO5 Independent Sample t-test for Two Means Then, a 95% confidence interval for μ 1 – μ 2 Where the degrees of freedom for the t-distribution is f = (n 1 + n 2 – 2).

12
Recall the 5 steps of hypothesis testing: ◦ Define the Null and Alternative Hypothesis. ◦ Define the Sampling Distribution and Critical Values ◦ Calculate the Test Statistic Using the Sample Data ◦ Make the Decision Regarding the Hypothesis ◦ Interpret the results © 2012 McGraw-Hill Ryerson Ltd.12 LO4 Test of Hypotheses

13
If the data is collected using two independent samples then we use the corresponding t-test. Again the hypotheses are: H 0 : µ 1 = µ 2 H a : µ 1 ≠ µ 2 We can also perform one sided tests if we have prior knowledge of what we would be looking for. © 2012 McGraw-Hill Ryerson Ltd.13 LO4 Test of Hypotheses

14
© 2012 McGraw-Hill Ryerson Ltd.14 Test of Hypotheses LO4 We first calculate the pooled sample variance The t-test for the difference in two means (independent sample case)

15
We then reject the null hypothesis of equal means if the t-value is too large or too small. The degrees of freedom associated with the two sample t-test is f = n 1 + n We look up the t- value with in the right tail and we get the value t 0.025:f. The null hypothesis of equal means is rejected if |t| > t 0.025:f where t 0.025:f is the table value with f=n 1 +n 2 -2 degrees of freedom. © 2012 McGraw-Hill Ryerson Ltd.15 Test of Hypotheses LO4

16
"There is no criminal type" according to Dr. Charles Goring, Deputy Medical Officer of H.M. Prison, London. He measured facial characteristics on 3000 convicts according to The New York times, November 2, The article explains that there was no significant difference between facial measurements of convicts compared to the general population. A similar data set was taken in © 2012 McGraw-Hill Ryerson Ltd.16 LO4 Independent Sample t-test for Two Means – Example

17
The following data represents left ear measurements on convicts at Parkhurst prison in There are two groups used here representing Ordinary Murderers and Other Criminals Ordinary murderers Other Criminals © 2012 McGraw-Hill Ryerson Ltd.17 Independent Sample t-test for Two Means – Example LO4

18
For this data the summary statistics are: Griffiths, G.B. (1904). Measurements of One Hundred and Thirty Criminals. Biometrika, 3, Sexually AbusedComparison Group Sample Means Sample Variances Sample Sizesn 1 = 10n 2 = 10 Population Means 1 2 Population Variances © 2012 McGraw-Hill Ryerson Ltd.18 Independent Sample t-test for Two Means – Example LO4

19
© 2012 McGraw-Hill Ryerson Ltd.19 Independent Sample t-test for Two Means – Example LO4

20
© 2012 McGraw-Hill Ryerson Ltd.20 Independent Sample t-test for Two Means – Example LO5

21
© 2012 McGraw-Hill Ryerson Ltd.21 Independent Sample t-test for Two Means – Example (alternative calculation) LO5

22
The variables must be measured at either the interval or ratio level of measurement. The two groups must be dependent on one another. The differences between the two samples must be relatively normally distributed. © 2011 McGraw-Hill Ryerson Ltd.22 LO6 Assumptions for the Paired Sample t-test

23
© 2012 McGraw-Hill Ryerson Ltd.23 LO7 Paired Sample t-test for Two Means

24
© 2012 McGraw-Hill Ryerson Ltd.24 LO7 Paired Sample t-test for Two Means

25
In the example we compared the means of two types of criminals. The subjects were not related in any way. We consider them to be independent samples. The variance parameter measures the variability in ear measurements of people, and can be quite large as people are highly variable. When we compare two groups of people, we can reduce the variability in the data by using pairs of individuals. So we see toothpaste commercials comparing two brands of toothpaste and measuring the number of cavities over a six month period. © 2012 McGraw-Hill Ryerson Ltd.25 LO7 Paired Sample t-test for Two Means

26
They could use two groups of subjects and randomly assign them to a brand of toothpaste. But they normally pick sets of twins and randomly assigned one of each twin to a brand of toothpaste. In this example, the variance measures variability in twins rather than variability in people. We expect the variability to be much less. We can choose twins, or pairs of subjects matched up by demographical measurements such as age, sex etc. The idea is to control for differences due to a number of other factors and to reduce the standard errors of our estimates. © 2012 McGraw-Hill Ryerson Ltd.26 LO7 Paired Sample t-test for Two Means

27
© 2012 McGraw-Hill Ryerson Ltd.27 LO7 Paired Sample t-test for Two Means

28
© 2012 McGraw-Hill Ryerson Ltd.28 LO7 Paired Sample t-test for Two Means

29
© 2012 McGraw-Hill Ryerson Ltd.29 LO8 Paired Sample t-test for Two Means

30
Consider the left and right ear measurements on the "other criminals" in the example. Note ∑d = 8 and ∑d 2 = 84 SubjectRight ear (x 1 )Left Ear (x 2) ) Difference d=x 1 - x © 2012 McGraw-Hill Ryerson Ltd.30 LO7 Paired Sample t-test for Two Means – Example

31
© 2012 McGraw-Hill Ryerson Ltd.31 LO7,8 Paired Sample t-test for Two Means – Example

32
The independent sample t-test and the paired sample t-test are useful methods for testing hypotheses with two sample means. ◦ Hypotheses with two sample means and are considered independent use the independent sample t-test. ◦ Testing two sample means from dependent sample use the paired sample t-test. Our research often involves comparing more than two groups so we need to use what is called an analysis of variance (ANOVA). © 2012 McGraw-Hill Ryerson Ltd.32 Conclusion

Similar presentations

© 2017 SlidePlayer.com Inc.

All rights reserved.

Ads by Google