Chapter 19: Two-Sample Problems STAT 1450. Connecting Chapter 18 to our Current Knowledge of Statistics ▸ Remember that these formulas are only valid.

Slides:



Advertisements
Similar presentations
Chapter 18: Inference about One Population Mean STAT 1450.
Advertisements

Lecture 6 Outline – Thur. Jan. 29
Section 9.3 Inferences About Two Means (Independent)
Introduction Comparing Two Means
5/15/2015Slide 1 SOLVING THE PROBLEM The one sample t-test compares two values for the population mean of a single variable. The two-sample test of a population.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Variance Chapter 16.
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Significance Testing Chapter 13 Victor Katch Kinesiology.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Significance Tests Chapter 13.
Chapter 9: Inferences for Two –Samples
Chapter 10 Two-Sample Tests
Copyright ©2011 Brooks/Cole, Cengage Learning Analysis of Variance Chapter 16 1.
PSY 307 – Statistics for the Behavioral Sciences
Chapter 19: Two-Sample Problems
Testing the Difference Between Means (Small Independent Samples)
Chapter 11: Inference for Distributions
PSY 307 – Statistics for the Behavioral Sciences
CHAPTER 19: Two-Sample Problems
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Two-Sample Tests Basic Business Statistics 10 th Edition.
PSY 307 – Statistics for the Behavioral Sciences
1 Chapter 9 Inferences from Two Samples In this chapter we will deal with two samples from two populations. The general goal is to compare the parameters.
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
Lesson Comparing Two Means.
Ch 11 – Inference for Distributions YMS Inference for the Mean of a Population.
Comparing 2 population parameters Chapter 13. Introduction: Two Sample problems  Ex: How do small businesses that fail differ from those that succeed?
AP STATISTICS LESSON 11 – 2 (DAY 1) Comparing Two Means.
More About Significance Tests
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
Comparing Two Population Means
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Section 9-2 Inferences About Two Proportions.
Chapter 10 Comparing Two Means Target Goal: I can use two-sample t procedures to compare two means. 10.2a h.w: pg. 626: 29 – 32, pg. 652: 35, 37, 57.
Week 111 Power of the t-test - Example In a metropolitan area, the concentration of cadmium (Cd) in leaf lettuce was measured in 7 representative gardens.
Chapter 11 Inference for Distributions AP Statistics 11.1 – Inference for the Mean of a Population.
CHAPTER 18: Inference about a Population Mean
1 Happiness comes not from material wealth but less desire.
For 95 out of 100 (large) samples, the interval will contain the true population mean. But we don’t know  ?!
Chapter 10 Inferences from Two Samples
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.2.
AP Exam Prep: Essential Notes. Chapter 11: Inference for Distributions 11.1Inference for Means of a Population 11.2Comparing Two Means.
Chap 9-1 Two-Sample Tests. Chap 9-2 Two Sample Tests Population Means, Independent Samples Means, Related Samples Population Variances Group 1 vs. independent.
7. Comparing Two Groups Goal: Use CI and/or significance test to compare means (quantitative variable) proportions (categorical variable) Group 1 Group.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Two-Sample Tests Statistics for Managers Using Microsoft.
BPS - 3rd Ed. Chapter 161 Inference about a Population Mean.
Essential Statistics Chapter 161 Inference about a Population Mean.
Lesson Comparing Two Means. Knowledge Objectives Describe the three conditions necessary for doing inference involving two population means. Clarify.
Week111 The t distribution Suppose that a SRS of size n is drawn from a N(μ, σ) population. Then the one sample t statistic has a t distribution with n.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Estimating with Confidence Section 11.1 Estimating a Population Mean.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
MATB344 Applied Statistics I. Experimental Designs for Small Samples II. Statistical Tests of Significance III. Small Sample Test Statistics Chapter 10.
+ Unit 6: Comparing Two Populations or Groups Section 10.2 Comparing Two Means.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 10-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
1 Pertemuan 09 & 10 Pengujian Hipotesis Mata kuliah : A Statistik Ekonomi Tahun: 2010.
Comparing Two Means Ch. 13. Two-Sample t Interval for a Difference Between Two Means.
Essential Statistics Chapter 171 Two-Sample Problems.
+ Unit 5: Estimating with Confidence Section 8.3 Estimating a Population Mean.
Chapter 7 Inference Concerning Populations (Numeric Responses)
Two Sample Problems  Compare the responses of two treatments or compare the characteristics of 2 populations  Separate samples from each population.
+ Chapter 8 Estimating with Confidence 8.1Confidence Intervals: The Basics 8.2Estimating a Population Proportion 8.3Estimating a Population Mean.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.Copyright © 2010 Pearson Education Section 9-3 Inferences About Two Means:
CHAPTER 19: Two-Sample Problems ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
AP Statistics Chapter 11 Section 2. TestConfidence IntervalFormulasAssumptions 1-sample z-test mean SRS Normal pop. Or large n (n>40) Know 1-sample t-test.
16/23/2016Inference about µ1 Chapter 17 Inference about a Population Mean.
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
Chapter 8 Hypothesis Testing with Two Samples.
CHAPTER 21: Comparing Two Means
Hypothesis tests for the difference between two means: Independent samples Section 11.1.
Lesson Comparing Two Means.
Basic Practice of Statistics - 3rd Edition Two-Sample Problems
Essential Statistics Two-Sample Problems - Two-sample t procedures -
Presentation transcript:

Chapter 19: Two-Sample Problems STAT 1450

Connecting Chapter 18 to our Current Knowledge of Statistics ▸ Remember that these formulas are only valid when appropriate simple conditions apply! 19.0 Two-Sample Problems Population Parameter Point Estimate Confidence Interval Test Statistic μ (σ known) μ (σ unknown)s

Connecting Chapter 19 to our Current Knowledge of Statistics ▸ Matched pairs were covered at the end of Chapter 18. A common situation requiring matched pairs is when before-and-after measurements are taken on individual subjects. ▸ Example: Prices for a random sample of tickets to a 2008 Katy Perry concert were compared with the ticket prices (for the same seats) to her 2013 concert..  The data could be consolidated into 1 column of differences in ticket prices.  A test of significance, or, a confidence interval would then occur for “1 sample of data.” 19.0 Two-Sample Problems

The Two-Sample Problems ▸ Two-sample problems require us to compare:  the response to two treatments - or -  the characteristics of two populations. ▸ We have a separate sample from each treatment or population The Two-Sample Problem

Two-Sample Problems ▸ The end of Chapter 18 described inference procedures for the mean difference in two measurements on one group of subjects (e.g., pulse rates for 12 students before-and-after listening to music). ▸ Given our answer from above, and the likelihood that each sample has different sample sizes, variances, etc… Chapter 19 focuses on the difference in means for 2 different groups. Population Parameter Point Estimate Confidence Interval Test Statistic 19.1 The Two-Sample Problem

Sampling Distribution of Two Sample Means 19.2 Comparing Two Population Means

Sampling Distribution of Two Sample Means ▸ The following table stems from the above comment on standard error and statistical theory Comparing Two Population Means VariableParameterPoint EstimatePopulation Standard Deviation Standard Error x1x1 11 11 x2x2 22 22 Diff = x 1 - x 2  1 -  2

Example: SSHA Scores ▸ The Survey of Study Habits and Attitudes (SSHA) is a psychological test designed to measure various academic behaviors (motivation, study habits, attitudes, etc…) of college students. Scores on the SSHA range from 0 to 200. The data for random samples 17 women (**the outlier from the original data set was removed**) and 20 men yielded the following summary statistics. ▸ Is there a difference in SSHA performance based upon gender? 19.2 Comparing Two Population Means

Example: SSHA Scores ▸ Summary statistics for the two groups are below:  There is a difference in these two groups. The women’s average was 17 points > than the men’s average. Group Sample Mean Sample Standard Deviation Sample Size Women** Men Comparing Two Population Means

Example: SSHA Scores ▸ Summary statistics for the two groups are below:  There is a difference in these two groups. The women’s average was 17 points > than the men’s average.  Yet, the standard deviations are larger than this sample difference, and the sample sizes are about the same. Group Sample Mean Sample Standard Deviation Sample Size Women** Men Comparing Two Population Means

Example: SSHA Scores Group Sample Mean Sample Standard Deviation Sample Size Women** Men Comparing Two Population Means

Example: SSHA Scores Group Sample Mean Sample Standard Deviation Sample Size Women** Men Comparing Two Population Means

The Two-sample t Procedures: Derived ▸ Now that we have a point estimate and a formula for the standard error, we can determine the confidence interval for the difference in two population means. ChapterParameter of InterestPoint Estimate Standard Error Confidence Interval 18  (σ unknown; 1-sample) 19 μ 1 - μ 2 (σ 1, σ 2 unknown; 2-samples) pt. estimate ± t*(standard error) 19.3 Two-Sample t Procedures

The Two-sample t Procedures: Derived ▸ Now that we have a point estimate and a formula for the standard error, we can determine the confidence interval for the difference in two population means. ChapterParameter of InterestPoint Estimate Standard Error Confidence Interval 18  (σ unknown; 1-sample) 19 μ 1 - μ 2 (σ 1, σ 2 unknown; 2-samples) 19.3 Two-Sample t Procedures

The Two-sample t Procedures: Derived 19.3 Two-Sample t Procedures ChapterParameter of Interest Point Estimate Standard Error Test Statistic 18 μ (σ unknown; 1-sample) 19  1 - μ 2 (σ 1, σ 2 unknown; 2-samples) pt. estimate –  0 standard error Note: H 0 for our purposes will be that  1 =  2 ; which is equivalent to there being a mean difference of ‘0.’

The Two-sample t Procedures: Derived 19.3 Two-Sample t Procedures ChapterParameter of Interest Point Estimate Standard Error Test Statistic 18 μ (σ unknown; 1-sample) 19  1 - μ 2 (σ 1, σ 2 unknown; 2-samples) Note: H 0 for our purposes will be that  1 =  2 ; which is equivalent to their being a mean difference of ‘0.’

The Two-sample t Procedures ▸ Now we can complete the table from earlier: t* is the critical value for confidence level C for the t distribution with df = smaller of (n 1 -1) and (n 2 -1). Find P-values from the t distribution with df = smaller of (n 1 -1) and (n 2 -1). Population Parameter Point EstimateConfidence IntervalTest Statistic 19.3 Two-Sample t Procedures

The Two-sample t Procedures ▸ Now we can complete the table from earlier: t* is the critical value for confidence level C for the t distribution with df = smaller of (n 1 -1) and (n 2 -1). Find P-values from the t distribution with df = smaller of (n 1 -1) and (n 2 -1). Population Parameter Point EstimateConfidence IntervalTest Statistic 19.3 Two-Sample t Procedures

The Two-sample t Procedures ▸ Now we can complete the table from earlier: t* is the critical value for confidence level C for the t distribution with df = smaller of (n 1 -1) and (n 2 -1). Find P-values from the t distribution with df = smaller of (n 1 -1) and (n 2 -1). Population Parameter Point EstimateConfidence IntervalTest Statistic 19.3 Two-Sample t Procedures

The Two-sample t Procedures: Confidence Intervals 19.3 Two-Sample t Procedures

The Two-sample t Procedures: Significance Tests 19.3 Two-Sample t Procedures

Conditions for Inference Comparing Two- Sample Means and Robustness of t Procedures ▸ The general structure of our necessary conditions is an extension of the one-sample cases.  Simple Random Samples:  Do we have 2 simple random samples?  Population : Sample Ratio:  The samples must be independent and from two large populations of interest Two-Sample Problems

Conditions for Inference Comparing Two- Sample Means and Robustness of t Procedures  Large enough sample: Both populations will be assumed to be from a Normal distribution and  when the sum of the sample sizes is less than 15, t procedures can be used if the data close to Normal (roughly symmetric, single peak, no outliers)? If there is clear skewness or outliers then, do not use t.  when the sum of the sample sizes is between 15 and 40, t procedures can be used except in the presences of outliers or strong skewness.  when the sum of the sample sizes is at least 40, the t procedures can be used even for clearly skewed distributions Two-Sample Problems

Conditions for Inference Comparing Two- Sample Means and Robustness of t Procedures ▸ Note: In practice it is enough that the two distributions have similar shape with no strong outliers. The two-sample t procedures are even more robust against non-Normality than the one-sample procedures Two-Sample Problems

Example: SSHA Scores ▸ The summary statistics for the SSHA scores for random samples of men and women are below. There was neither significant skewness, nor, strong outliers, in either data set. Use this information to construct a 90% confidence interval for the mean difference Two-Sample t Procedures Group Sample Mean Sample Standard Deviation Sample Size Women Men

Example: 90% CI for SSHA Scores 1. Components Do we have two simple random samples? Yes. It was stated. Large enough population: sample ratio? Yes. N W > 20*17 = 340 N M > 20*20 = 400 (Independence) Large enough sample? Yes. n W + n M =37 < 40 but outlier has been removed. No skewness. Steps for Success- Constructing Confidence Intervals for     . 1.Confirm that the 3 key conditions are satisfied (SRS?, N:n?, t-distribution?) One-Sample t Confidence Intervals

Example: 90% CI for SSHA Scores 18.3 One-Sample t Confidence Intervals

Example: 90% CI for SSHA Scores 18.3 One-Sample t Confidence Intervals

Example: 90% CI for SSHA Scores 18.3 One-Sample t Confidence Intervals

Example: 90% CI for SSHA Scores 18.3 One-Sample t Confidence Intervals

Example: 90% CI for SSHA Scores 18.3 One-Sample t Confidence Intervals

Example: 90% CI for SSHA Scores 18.3 One-Sample t Confidence Intervals

Example: 90% CI for SSHA Scores 18.3 One-Sample t Confidence Intervals

Example: 90% CI for SSHA Scores 18.3 One-Sample t Confidence Intervals

Example: 90% CI for SSHA Scores 18.3 One-Sample t Confidence Intervals

Example: 90% CI for SSHA Scores 18.3 One-Sample t Confidence Intervals

Example: SSHA Scores ▸ Let’s continue with this example by now conducting a test of significance for the mean difference in SSHA by gender at  = Two-Sample t Procedures Group Sample Mean Sample Standard Deviation Sample Size Women Men

Example: SSHA Scores 19.3 Two-Sample t Procedures

Example: SSHA Scores 19.3 Two-Sample t Procedures

Example: SSHA Scores 19.3 Two-Sample t Procedures

Example: SSHA Scores 19.3 Two-Sample t Procedures

Example: SSHA Scores 19.3 Two-Sample t Procedures

Example: SSHA Scores Plan: f) Sketch the region(s) of “extremely unlikely” test statistics Two-Sample t Procedures

Example: SSHA Scores Solve: a)Check the conditions for the test you plan to use.  Two Simple Random Samples?  Large enough population: sample ratios?  Large enough samples? 19.3 Two-Sample t Procedures

Example: SSHA Scores Solve: a)Check the conditions for the test you plan to use.  Two Simple Random Samples? Yes. Stated as a random sample.  Large enough population: sample ratios? Yes. Both populations are arbitrarily large; much greater than, N W > 20*17 = 340; N M > 20*20 = 400  Large enough samples? Yes. n W + n M =37 < 40 outlier has been removed. No skewness Two-Sample t Procedures

Example: SSHA Scores Solve: b)Calculate the test statistic c)Determine (or approximate) the P-Value Two-Sample t Procedures

Example: SSHA Scores Solve: b)Calculate the test statistic c)Determine (or approximate) the P-Value Two-Sample t Procedures

Example: SSHA Scores Solve: b)Calculate the test statistic c)Determine (or approximate) the P-Value DF =  < 1.96 < 2.12 .05 < P-value <.10 P-value 19.3 Two-Sample t Procedures

Example: SSHA Scores Conclude: a) Make a decision about the null hypothesis (Reject H 0 or Fail to reject H 0 ) Two-Sample t Procedures

Example: SSHA Scores Conclude: a) Make a decision about the null hypothesis (Reject H 0 or Fail to reject H 0 ). Because the approximate P-value is smaller than 0.10, we reject the null hypothesis. b) Interpret the decision in the context of the original claim Two-Sample t Procedures

Example: SSHA Scores Conclude: a) Make a decision about the null hypothesis (Reject H 0 or Fail to reject H 0 ). Because the approximate P-value is smaller than 0.10, we reject the null hypothesis. b) Interpret the decision in the context of the original claim. There is enough evidence (at  =.10) that there is a difference in the mean SSHA score between men and women Two-Sample t Procedures

▸ JMP  Enter the quantitative data into one of the columns.  In the next column, enter an abridged description of the categorical variable associated with each row of quantitative data. (Note: Pay attention to the spelling and capitalization of the abridged descriptions.)  Analyze  Fit Y by X.  “Click-and-Drag” (the quantitative variable) into the ‘Y, Response’ box. “Click-and- Drag” (the categorical variable) into the ‘X, Factor’ box. Click on OK.  Click on the red upside-down triangle next to the title “Oneway Analysis of …”  Proceed to ‘Means and Std Dev.’  Click on the red upside-down triangle next to the title “Oneway Analysis of …”  Proceed to ‘t Test.’ 19.3 Two-Sample t Procedures

▸ JMP  Enter the quantitative data into one of the columns.  In the next column, enter an abridged description of the categorical variable associated with each row of quantitative data. ▸ (Note: Pay attention to the spelling and capitalization of the abridged descriptions.)  Analyze  Fit Y by X.  “Click-and-Drag” (the quantitative variable) into the ‘Y, Response’ box.  “Click-and-Drag” (the categorical variable) into the ‘X, Factor’ box. Click on OK.  Click on the red upside-down triangle next to the title “Oneway Analysis of …”  Proceed to ‘Means and Std Dev.’  Click on the red upside-down triangle next to the title “Oneway Analysis of …”  Proceed to ‘t Test.’ 19.3 Two-Sample t Procedures

SSHA Scores (via Technology) ▸ Use technology to compute a 98% confidence interval for the mean difference in SSHA scores between women and men. ▸ Use technology to conduct the test of significance for the mean difference in SSHA scores at  = Two-Sample t Procedures

98% confidence interval for the mean difference in SSHA scores between women & men Two-Sample t Procedures

98% confidence interval for the mean difference in SSHA scores between women & men Two-Sample t Procedures

Test of Significance for the mean difference in SSHA scores between women & men Two-Sample t Procedures

Test of Significance for the mean difference in SSHA scores between women & men Two-Sample t Procedures

Example: SSHA Scores ▸ Technology output for Two Sample Means: 19.3 Two-Sample t Procedures

Closing Caveats and Comments ▸ The two-sample t statistic has an approximate (but accurate) t distribution. The approximate distribution of the two-sample t has an elaborate degrees of freedom computation (p.480). Computers use this formula in determining degrees of freedom. ▸ We will use Option 2 (p.470). This has df= smaller of (n 1 -1) and (n 2 -1). ▸ Because of the above fact, output in JMP (or other software packages) might have different df and p-values from manual analyses Two-Sample t Procedures

Closing Caveats and Comments ▸ We will not use “pooled” two-sample procedures. This assumes that the population variance is known and is equal for both variables. Use of our “Option 1” for two-sample t procedures yields more accurate results than the “pooled t.” The only caveat is when the sample sizes are equal; then our results and the “pooled t” would be equal. ▸ Do not use two-sample t procedures for inference regarding standard deviations. The F-test is more appropriate in those cases Two-Sample t Procedures

Closing Caveats and Comments ▸ Practitioners prefer having equal sample sizes for the two groups when possible. ▸ Exercises for this chapter will all assume that the SRS is from a Normal distribution Two-Sample t Procedures

Five-Minute Summary ▸ List at least 3 concepts that had the most impact on your knowledge of two-sample problems. _________________________________________