Comparing Two Population Means

Slides:



Advertisements
Similar presentations
Section 9.3 Inferences About Two Means (Independent)
Advertisements

Confidence Interval and Hypothesis Testing for:
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 10-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Chapter 10 Two-Sample Tests
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 10 Hypothesis Testing:
Chapter 10 Two-Sample Tests
Copyright ©2011 Brooks/Cole, Cengage Learning Analysis of Variance Chapter 16 1.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
PSY 307 – Statistics for the Behavioral Sciences
MARE 250 Dr. Jason Turner Hypothesis Testing II To ASSUME is to make an… Four assumptions for t-test hypothesis testing: 1. Random Samples 2. Independent.
MARE 250 Dr. Jason Turner Hypothesis Testing II. To ASSUME is to make an… Four assumptions for t-test hypothesis testing:
Chap 11-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 11 Hypothesis Testing II Statistics for Business and Economics.
BCOR 1020 Business Statistics
Chapter Goals After completing this chapter, you should be able to:
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 9-1 Introduction to Statistics Chapter 10 Estimation and Hypothesis.
1/45 Chapter 11 Hypothesis Testing II EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008.
A Decision-Making Approach
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 10-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
T-Tests Lecture: Nov. 6, 2002.
Chapter 11: Inference for Distributions
Chapter 9 Hypothesis Testing.
Ch. 9 Fundamental of Hypothesis Testing
Chapter 10, sections 1 and 4 Two-sample Hypothesis Testing Test hypotheses for the difference between two independent population means ( standard deviations.
5-3 Inference on the Means of Two Populations, Variances Unknown
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Two-Sample Tests Basic Business Statistics 10 th Edition.
Week 9 October Four Mini-Lectures QMM 510 Fall 2014.
Chapter 9 Comparing Means
AP Statistics Section 13.1 A. Which of two popular drugs, Lipitor or Pravachol, helps lower bad cholesterol more? 4000 people with heart disease were.
AM Recitation 2/10/11.
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
Hypothesis testing – mean differences between populations
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Education 793 Class Notes T-tests 29 October 2003.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 2 – Slide 1 of 25 Chapter 11 Section 2 Inference about Two Means: Independent.
Ch 11 – Inference for Distributions YMS Inference for the Mean of a Population.
Inference for a Single Population Proportion (p).
Chapter 10 Comparing Two Means Target Goal: I can use two-sample t procedures to compare two means. 10.2a h.w: pg. 626: 29 – 32, pg. 652: 35, 37, 57.
Week 111 Power of the t-test - Example In a metropolitan area, the concentration of cadmium (Cd) in leaf lettuce was measured in 7 representative gardens.
1 Objective Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means.
Inference for distributions: - Comparing two means IPS chapter 7.2 © 2006 W.H. Freeman and Company.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 10-1 Chapter 2c Two-Sample Tests.
10-1 Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Chapter 10 Two-Sample Tests Statistics for Managers using Microsoft Excel 6 th.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Two-Sample Tests and One-Way ANOVA Business Statistics, A First.
A Course In Business Statistics 4th © 2006 Prentice-Hall, Inc. Chap 9-1 A Course In Business Statistics 4 th Edition Chapter 9 Estimation and Hypothesis.
Industrial Statistics 2
AP Statistics Section 13.1 A. Which of two popular drugs, Lipitor or Pravachol, helps lower bad cholesterol more? 4000 people with heart disease were.
Chap 9-1 Two-Sample Tests. Chap 9-2 Two Sample Tests Population Means, Independent Samples Means, Related Samples Population Variances Group 1 vs. independent.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
Two sample problems:  compare the responses in two groups  each group is a sample from a distinct population  responses in each group are independent.
1 Chapter 9 Inferences from Two Samples 9.2 Inferences About Two Proportions 9.3 Inferences About Two Means (Independent) 9.4 Inferences About Two Means.
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 10 Hypothesis Testing:
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Two-Sample Tests Statistics for Managers Using Microsoft.
Lesson Comparing Two Means. Knowledge Objectives Describe the three conditions necessary for doing inference involving two population means. Clarify.
AP Statistics Chapter 24 Comparing Means.
Week111 The t distribution Suppose that a SRS of size n is drawn from a N(μ, σ) population. Then the one sample t statistic has a t distribution with n.
Comparing the Means of Two Dependent Populations.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
AP Statistics. Chap 13-1 Chapter 13 Estimation and Hypothesis Testing for Two Population Parameters.
+ Unit 6: Comparing Two Populations or Groups Section 10.2 Comparing Two Means.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 10-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Lecture 8 Estimation and Hypothesis Testing for Two Population Parameters.
Chapter 7 Inference Concerning Populations (Numeric Responses)
Objectives (PSLS Chapter 18) Comparing two means (σ unknown)  Two-sample situations  t-distribution for two independent samples  Two-sample t test 
Chapter 10: The t Test For Two Independent Samples.
Inference for a Single Population Proportion (p)
What are their purposes? What kinds?
Presentation transcript:

Comparing Two Population Means

Two kinds of studies or experiments There are two general research strategies that can be used to compare the two populations of interest: The two samples could be drawn from two independent populations (e.g. women and men, or patients taking drug A vs. those taking drug B) between subjects or independent samples design The two sets of data could come from related populations (e.g. “before treatment” and “after treatment”, patient taking drug A and the SAME patient taking drug B at a later date, or subjects in different treatment groups are meaningfully matched according to some criteria) within subjects or dependent samples/matched pairs design We will focus on the independent samples case first.

Comparing Two Population Means: Using Independent Samples OBSERVATIONAL STUDY or EXPERIMENT? Observational Comparative Study A research study in which two or more groups are compared with respect to some measurement or response. The groups, determined by their natural characteristics, are merely “observed.”

Observational Comparative Study Population 1 m1 & s1 Population 2 m2 & s2 m1 vs. m2 To make inferences use: Hypothesis test, CI for difference in means, effect size (d) Sample size = n1 Calculate: Sample size = n2

Example 1: Normal Human Body Temperature (females vs. males) Is the normal human body temperature the same for females and males? To answer this question two samples of size 65 are independently drawn from these two populations. Let’s assume that our research hypothesis is that, for what ever reason, females have a higher mean body temperature than males.

Comparing Two Population Means: Using Independent Samples Comparative Experiment A study in which two or more groups (see later) are randomly assigned to a “treatment” to see how the treatment affects some response. If each experimental unit has the same chance of receiving any treatment, then the experiment is called a completely randomized design.

Comparative Experiment Population Randomly assign Treatment 1 m1 & s1 Treatment 2 m2 & s2 m1 vs. m2 To make inferences use: Hypothesis test, CI for difference in means, effect size (d) Sample size = n1 Calculate: Sample size = n2

All possible questions in statistical notation In general, we can always compare two means by seeing how their difference (m1 – m2) compares to 0:

Hypotheses For testing equality Ho: m1 = m2 or (m1 – m2) = 0 Note: If we wanted to establish that one mean was say e.g. at least 10 units larger than the other we could replace 0 in these statements by 10. In general to establish a difference of at least D units then we replace 0 by D. For testing equality Ho: m1 = m2 or (m1 – m2) = 0 The possible alternatives are: HA: m1 > m2 or (m1 – m2) > 0 (upper-tail) HA: m1 < m2 or (m1 – m2) < 0 (lower-tail) HA: 1  2 or (m1 – m2)  0 (two-tailed)

Example 1: Normal Human Body Temperature (females vs. males) Recall our research hypothesis is the females have a higher mean body temperature than males, therefore we have… mF = mean body temperature for females mM= mean body temperature for males Ho: mF = mM or equivalently (mF – mM) = 0 HA: mF > mM or equivalently (mF – mM) > 0

Test Statistic The basic form of the two-sample t test statistic is... which assuming the following assumptions are satisfied has an approximate t-distribution (df (see 3 below)). Both populations are approximately normally distributed. This assumption can be relaxed when both sample sizes (n1 and n2) are “large”. Random, independent samples were drawn from the two populations of interest. The df depends on how the standard error of the difference is estimated.

Test Statistic The standard error of the difference in sample means is calculated to different ways depending on whether or not we assume the population variances are equal. i.e., Can we assume ? Yes, use pooled t-test No, use Welch’s t-test

Assuming equal population variances (pooled t-test) Estimate the standard error of the difference using the common pooled variance : Estimate of the common variance (s2) where Then the sampling distribution is a t-distribution with n1+n2-2 degrees of freedom (df). Rule O’ Thumb: Assume variances are equal only if neither sample standard deviation is more than twice that of the other sample standard deviation.

If variances of the measurements of the two groups are not equal (Welch’s t-test)... Estimate the standard error of the difference as: Then the sampling distribution is an approximate t distribution with a complicated formula for d.f. Always round down!

Quantifying the Size of the Difference in Population Means (m1 – m2) To quantify the size of the effect, i.e. the difference in the population means we use… Confidence Interval for (m1 – m2) (estimate) + (table value) SE(estimate)  basic form Effect Size (d)

Example 1: Normal Human Body Temperature (females vs. males) STEP 1) State Hypotheses mF = mean body temperature for females mM= mean body temperature for males Ho: mF = mM or equivalently (mF – mM) = 0 HA: mF > mM or equivalently (mF – mM) > 0

Example 1: Normal Human Body Temperature (females vs. males) STEP 2) Determine Test Criteria a) Choose a=.05 as a Type I Error is of little consequence. b) Use two-sample t-test, either pooled t-test or Welch’s t-test. As to which form to use we need to examine our data in terms of the equality of population variances. If uncertain use Welch’s!

Example 1: Normal Human Body Temperature (females vs. males) STEP 3) Collect Data and Compute Test Statistic Take independent samples from the two populations and examine the resulting data.

Example 1: Normal Human Body Temperature (females vs. males) Oneway Analysis > Means/Anova/Pooled t Populations appear normally distributed and our sample sizes are “large” (> 30). Few mild outliers for in sample for females Sample mean for females appears to slightly larger than that for males Variation appears to be similar for both samples. The sample standard deviation for females (sF = .74) is larger than that for males (sM = .70) although it is not twice as large, thus we assume pop. variances are equal.

Example 1: Normal Human Body Temperature (females vs. males) STEP 3) Collect Data and Compute Test Statistic Because it seems reasonable to assume the population variances are approximately equal we will use a pooled t-test.

Pooled t-test calculations (FYI) For the body temperature example Assuming equal variances, the pooled estimate of common variance is So the standard error of the difference in sample means is

Pooled t-test calculations (FYI) For the body temperature example Computing test statistic gives t = 2.28 P-value = .0121

Example 1: Normal Human Body Temperature (females vs. males) STEP 4) Compute p-value The p-value = .0121, which indicates we have a 1.21% chance of observing a difference in sample means this large by chance variation alone if in fact the population means were equal. STEP 5) Make Decision and Interpret Because p-value < .05 we reject Ho and conclude that the mean normal body temperature for females is larger than that for males.

Example 1: Normal Human Body Temperature (females vs. males) STEP 6) Quantify Significant Findings Construct a 95% CI for (mF – mM) (98.39 – 98.10) + (1.98)(.127) = (.039o , .541o) We estimate the normal mean body temperature for women is between .039o F to .541o F larger than the normal mean body temperature for men.

Example 1: Normal Human Body Temperature (females vs. males) STEP 6) Quantify Significant Findings Effect Size (d) Thus effect size is moderate at best with % overlap of the two body temperature distributions being around 72.6%. Furthermore, the absolute difference represented by the lower confidence limit (LCL = .039 deg F) hardly seems of any physiological importance.

Power and Sample Size Issues (equal variance case) Power is a function of: The sample sizes (n1 and n2), as the sample sizes increase so does power. The smaller the common variance (s2) the larger the power. The larger the difference (D) to detect the greater the power. As always, as a increases the power increases.

Power and Sample Size Issues Common standard deviation to both populations/groups D = difference in population means which makes alternative true Common sample size (n1 = n2 = n) Probability of rejecting Ho when difference in means is D.

Power and Sample Size Issues Error Std. Dev = common standard deviation (s) = .721 for body temperature example. Difference in Means = D = |m1 – m2| = .29 which is the difference in the samples for body temperature example. Alpha = P(Type I Error) = .05 For body temperature example we have a power of 1 – b = .6240.

Power and Sample Size Issues Error Std. Dev = common standard deviation (s) = .721 for body temperature example. Difference in Means = D = |m1 – m2| = .29 which is the difference in the samples for body temperature example. Alpha = P(Type I Error) = .05

Example 3: Gestational age of babies born to women with preeclampsia We wish to compare the mean gestational age (in weeks) of babies born to women with preeclampsia during pregnancy vs. those who had normal pregnancies. Data: Preeclampsia: 38, 32, 42, 30, 38,35, 32, 38, 39, 29, 29, 32 Normal: 40, 41, 38, 40, 40, 39, 39, 41, 41, 40, 40, 40

Example 3: Gestational age of babies born to women with preeclampsia The sample standard deviations control the slope of the reference lines, the line for preeclamptics is over 4 times steeper. The SD for the preeclamptic group is over 4 times larger!!

Example 3: Gestational age of babies born to women with preeclampsia Formally testing equality of variances: Test Statistic:

Example 3: Gestational age of babies born to women with preeclampsia i.e. the sample variance for the preeclamptic group is 23.89 times larger than the sample variance for the normal pregnancy group. The p-value comes from an a F-dist. with numerator df = 11, denominator df = 11. We have strong evidence against equality of the variance of the response for these two populations of women (p < .0001) therefore we should not use a pooled t-Test!

Let’s say our observed value for F was F0 = 23.89 F-distribution When H0 is true, the F-ratio ~ F (df1,df2) For example, consider the F-distribution with 11 and 11 df 10 20 30 40 0.0 0.2 0.4 0.6 0.8 F-distribution Then the P-value < .0001 Let’s say our observed value for F was F0 = 23.89

Example 3: Gestational age of babies born to women with preeclampsia Oneway Analysis > t Test (i.e. non-pooled t–Test) We strong evidence that the mean gestational age of babies born to preeclamptic mothers is lower than that for babies born to women with normal pregnancies (p = .0013). Furthermore, we estimate that the mean gestational age for babies born to preeclamptic mothers is between 2.59 and 8.24 weeks less than the mean gestational age for babies born to mothers with normal pregnancies.

Summary If in doubt, use non-pooled t-Test. Be sure to quantify significant differences with a CI for (m1 – m2) or other measure of effect size. Assess normality of both samples. When sample sizes are small, non-normality presents a problem and we should use test procedures that do not require normality (i.e. nonparametric tests, we’ll see these later).