Chapter 10 Two-Sample Tests.

Slides:



Advertisements
Similar presentations
Business and Economics 9th Edition
Advertisements

Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 10-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Chapter 10 Two-Sample Tests
Chapter 8 Estimation: Additional Topics
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 10 Hypothesis Testing:
Chapter 10 Two-Sample Tests
Pengujian Hipotesis Beda Nilai Tengah Pertemuan 20 Matakuliah: I0134/Metode Statistika Tahun: 2007.
© 2002 Prentice-Hall, Inc.Chap 8-1 Statistics for Managers using Microsoft Excel 3 rd Edition Chapter 8 Two Sample Tests with Numerical Data.
Chap 11-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 11 Hypothesis Testing II Statistics for Business and Economics.
Chapter Goals After completing this chapter, you should be able to:
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 9-1 Introduction to Statistics Chapter 10 Estimation and Hypothesis.
Chapter Topics Comparing Two Independent Samples:
1/45 Chapter 11 Hypothesis Testing II EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008.
Chap 9-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 9 Estimation: Additional Topics Statistics for Business and Economics.
A Decision-Making Approach
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 10-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
© 2002 Prentice-Hall, Inc.Chap 8-1 Statistics for Managers using Microsoft Excel 3 rd Edition Kafla 8 Próf fyrir tvö úrtök (Ekki þýtt)
Chapter 11 Hypothesis Tests and Estimation for Population Variances
© 2004 Prentice-Hall, Inc.Chap 10-1 Basic Business Statistics (9 th Edition) Chapter 10 Two-Sample Tests with Numerical Data.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Two Sample Tests Statistics for Managers Using Microsoft.
Basic Business Statistics (9th Edition)
Chapter 10, sections 1 and 4 Two-sample Hypothesis Testing Test hypotheses for the difference between two independent population means ( standard deviations.
1/49 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 9 Estimation: Additional Topics.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Two-Sample Tests Basic Business Statistics 10 th Edition.
Hypothesis Testing – Two Samples
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 11-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap th & 7 th Lesson Hypothesis Testing for Two Population Parameters.
Chapter 10 Two-Sample Tests and One-Way ANOVA
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 10-1 Chapter 2c Two-Sample Tests.
10-1 Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Chapter 10 Two-Sample Tests Statistics for Managers using Microsoft Excel 6 th.
Pengujian Hipotesis Dua Populasi By. Nurvita Arumsari, Ssi, MSi.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Two-Sample Tests and One-Way ANOVA Business Statistics, A First.
A Course In Business Statistics 4th © 2006 Prentice-Hall, Inc. Chap 9-1 A Course In Business Statistics 4 th Edition Chapter 9 Estimation and Hypothesis.
Industrial Statistics 2
Chap 9-1 Two-Sample Tests. Chap 9-2 Two Sample Tests Population Means, Independent Samples Means, Related Samples Population Variances Group 1 vs. independent.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap th Lesson Hypothesis Tests for One and Two Population Variances.
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 10 Hypothesis Testing:
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Two-Sample Tests Statistics for Managers Using Microsoft.
Chapter 10 Statistical Inferences Based on Two Samples Statistics for Business (Env) 1.
Chap 10-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 10 Hypothesis Tests for.
Business Statistics: A First Course (3rd Edition)
Comparing Sample Means
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 10, Slide 1 Two-Sample Tests and One-Way ANOVA Chapter 10.
AP Statistics. Chap 13-1 Chapter 13 Estimation and Hypothesis Testing for Two Population Parameters.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 10-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Lecture 8 Estimation and Hypothesis Testing for Two Population Parameters.
10-1 Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Chapter 10 Two-Sample Tests Statistics for Managers using Microsoft Excel 6 th.
Hypothesis Testing – Two Means(Small, Independent Samples)
Chapter 9 Estimation: Additional Topics
Chapter 10 Two-Sample Tests and One-Way ANOVA.
Statistics for Managers using Microsoft Excel 3rd Edition
Chapter 11 Hypothesis Testing II
Chapter 10 Two Sample Tests
Comparing Two Proportions
Estimation & Hypothesis Testing for Two Population Parameters
Chapter 11 Hypothesis Testing II
Chapter 8 Hypothesis Testing with Two Samples.
Data Mining 2016/2017 Fall MIS 331 Chapter 2 Sampliing Distribution
Chapter 10 Two-Sample Tests and One-Way ANOVA.
Statistics for Business and Economics
Hypothesis Tests for a Population Mean in Practice
Data Mining 2016/2017 Fall MIS 331 Chapter 2 Sampliing Distribution
Chapter 11 Hypothesis Tests and Estimation for Population Variances
Chapter 8 Estimation: Additional Topics
Chapter 10 Hypothesis Tests for One and Two Population Variances
Chapter 10 Two-Sample Tests
Chapter 9 Estimation: Additional Topics
Chapter 8 Estimation: Additional Topics
Presentation transcript:

Chapter 10 Two-Sample Tests

Learning Objectives In this chapter, you learn: How to use hypothesis testing for comparing the difference between The means of two independent populations The means of two related populations The proportions of two independent populations The variances of two independent populations

Two-Sample Tests Two-Sample Tests DCOVA Population Means, Independent Samples Population Means, Related Samples Population Proportions Population Variances Examples: Group 1 vs. Group 2 Same group before vs. after treatment Proportion 1 vs. Proportion 2 Variance 1 vs. Variance 2

Difference Between Two Means DCOVA Population means, independent samples Goal: Test hypothesis or form a confidence interval for the difference between two population means, μ1 – μ2 * σ1 and σ2 unknown, assumed equal The point estimate for the difference is X1 – X2 σ1 and σ2 unknown, not assumed equal

Difference Between Two Means: Independent Samples DCOVA Different data sources Unrelated Independent Sample selected from one population has no effect on the sample selected from the other population Population means, independent samples * Use Sp to estimate unknown σ. Use a Pooled-Variance t test. σ1 and σ2 unknown, assumed equal Use S1 and S2 to estimate unknown σ1 and σ2. Use a Separate-variance t test σ1 and σ2 unknown, not assumed equal

Hypothesis Tests for Two Population Means DCOVA Two Population Means, Independent Samples Lower-tail test: H0: μ1  μ2 H1: μ1 < μ2 i.e., H0: μ1 – μ2  0 H1: μ1 – μ2 < 0 Upper-tail test: H0: μ1 ≤ μ2 H1: μ1 > μ2 i.e., H0: μ1 – μ2 ≤ 0 H1: μ1 – μ2 > 0 Two-tail test: H0: μ1 = μ2 H1: μ1 ≠ μ2 i.e., H0: μ1 – μ2 = 0 H1: μ1 – μ2 ≠ 0

Hypothesis tests for μ1 – μ2 DCOVA Two Population Means, Independent Samples Lower-tail test: H0: μ1 – μ2  0 H1: μ1 – μ2 < 0 Upper-tail test: H0: μ1 – μ2 ≤ 0 H1: μ1 – μ2 > 0 Two-tail test: H0: μ1 – μ2 = 0 H1: μ1 – μ2 ≠ 0 a a a/2 a/2 -ta ta -ta/2 ta/2 Reject H0 if tSTAT < -ta Reject H0 if tSTAT > ta Reject H0 if tSTAT < -ta/2 or tSTAT > ta/2

Hypothesis tests for µ1 - µ2 with σ1 and σ2 unknown and assumed equal DCOVA Assumptions: Samples are randomly and independently drawn Populations are normally distributed or both sample sizes are at least 30 Population variances are unknown but assumed equal Population means, independent samples * σ1 and σ2 unknown, assumed equal σ1 and σ2 unknown, not assumed equal

Hypothesis tests for µ1 - µ2 with σ1 and σ2 unknown and assumed equal (continued) DCOVA The pooled variance is: The test statistic is: Where tSTAT has d.f. = (n1 + n2 – 2) Population means, independent samples * σ1 and σ2 unknown, assumed equal σ1 and σ2 unknown, not assumed equal

Confidence interval for µ1 - µ2 with σ1 and σ2 unknown and assumed equal DCOVA Population means, independent samples The confidence interval for μ1 – μ2 is: Where tα/2 has d.f. = n1 + n2 – 2 * σ1 and σ2 unknown, assumed equal σ1 and σ2 unknown, not assumed equal

Pooled-Variance t Test Example DCOVA You are a financial analyst for a brokerage firm. Is there a difference in dividend yield between stocks listed on the NYSE & NASDAQ? You collect the following data: NYSE NASDAQ Number 21 25 Sample mean 3.27 2.53 Sample std dev 1.30 1.16 Assuming both populations are approximately normal with equal variances, is there a difference in mean yield ( = 0.05)?

Pooled-Variance t Test Example: Calculating the Test Statistic (continued) H0: μ1 - μ2 = 0 i.e. (μ1 = μ2) H1: μ1 - μ2 ≠ 0 i.e. (μ1 ≠ μ2) DCOVA The test statistic is:

Pooled-Variance t Test Example: Hypothesis Test Solution DCOVA Reject H0 Reject H0 H0: μ1 - μ2 = 0 i.e. (μ1 = μ2) H1: μ1 - μ2 ≠ 0 i.e. (μ1 ≠ μ2)  = 0.05 df = 21 + 25 - 2 = 44 Critical Values: t = ± 2.0154 Test Statistic: .025 .025 -2.0154 2.0154 t 2.040 Decision: Conclusion: Reject H0 at a = 0.05 There is evidence of a difference in means.

Pooled-Variance t Test Example: Confidence Interval for µ1 - µ2 DCOVA Since we rejected H0 can we be 95% confident that µNYSE > µNASDAQ? 95% Confidence Interval for µNYSE - µNASDAQ Since 0 is less than the entire interval, we can be 95% confident that µNYSE > µNASDAQ

Hypothesis tests for µ1 - µ2 with σ1 and σ2 unknown, not assumed equal DCOVA Assumptions: Samples are randomly and independently drawn Populations are normally distributed or both sample sizes are at least 30 Population variances are unknown and cannot be assumed to be equal Population means, independent samples σ1 and σ2 unknown, assumed equal * σ1 and σ2 unknown, not assumed equal

Hypothesis tests for µ1 - µ2 with σ1 and σ2 unknown and not assumed equal (continued) DCOVA The test statistic is: Population means, independent samples σ1 and σ2 unknown, assumed equal tSTAT has d.f. ν = * σ1 and σ2 unknown, not assumed equal

Separate-Variance t Test Example DCOVA You are a financial analyst for a brokerage firm. Is there a difference in dividend yield between stocks listed on the NYSE & NASDAQ? You collect the following data: NYSE NASDAQ Number 21 25 Sample mean 3.27 2.53 Sample std dev 1.30 1.16 Assuming both populations are approximately normal with unequal variances, is there a difference in mean yield ( = 0.05)?

Separate-Variance t Test Example: Calculating the Test Statistic (continued) H0: μ1 - μ2 = 0 i.e. (μ1 = μ2) H1: μ1 - μ2 ≠ 0 i.e. (μ1 ≠ μ2) DCOVA The test statistic is: Use degrees of freedom = 40

Separate-Variance t Test Example: Hypothesis Test Solution DCOVA Reject H0 Reject H0 H0: μ1 - μ2 = 0 i.e. (μ1 = μ2) H1: μ1 - μ2 ≠ 0 i.e. (μ1 ≠ μ2)  = 0.05 df = 40 Critical Values: t = ± 2.021 Test Statistic: .025 .025 -2.021 2.021 t 2.019 Decision: Conclusion: Fail To Reject H0 at a = 0.05 There is insufficient evidence of a difference in means.

Related Populations The Paired Difference Test DCOVA Tests Means of 2 Related Populations Paired or matched samples Repeated measures (before/after) Use difference between paired values: Eliminates Variation Among Subjects Assumptions: Both Populations Are Normally Distributed Or, if not Normal, use large samples Related samples Di = X1i - X2i

Related Populations The Paired Difference Test (continued) DCOVA The ith paired difference is Di , where Related samples Di = X1i - X2i The point estimate for the paired difference population mean μD is D : The sample standard deviation is SD n is the number of pairs in the paired sample

The Paired Difference Test: Finding tSTAT DCOVA The test statistic for μD is: Paired samples Where tSTAT has n - 1 d.f.

The Paired Difference Test: Possible Hypotheses DCOVA Paired Samples Lower-tail test: H0: μD  0 H1: μD < 0 Upper-tail test: H0: μD ≤ 0 H1: μD > 0 Two-tail test: H0: μD = 0 H1: μD ≠ 0 a a a/2 a/2 -ta ta -ta/2 ta/2 Reject H0 if tSTAT < -ta Reject H0 if tSTAT > ta Reject H0 if tSTAT < -ta/2 or tSTAT > ta/2 Where tSTAT has n - 1 d.f.

The Paired Difference Confidence Interval DCOVA The confidence interval for μD is Paired samples where

Paired Difference Test: Example DCOVA Assume you send your salespeople to a “customer service” training workshop. Has the training made a difference in the number of complaints? You collect the following data:  D = Di n = -4.2 Number of Complaints: (2) - (1) Salesperson Before (1) After (2) Difference, Di C.B. 6 4 - 2 T.F. 20 6 -14 M.H. 3 2 - 1 R.K. 0 0 0 M.O. 4 0 - 4 -21

Paired Difference Test: Solution DCOVA Has the training made a difference in the number of complaints (at the 0.01 level)? Reject Reject H0: μD = 0 H1: μD  0 /2 /2  = .01 D = - 4.2 - 4.604 4.604 - 1.66 t0.005 = ± 4.604 d.f. = n - 1 = 4 Decision: Do not reject H0 (tstat is not in the reject region) Test Statistic: Conclusion: There is not a significant change in the number of complaints.

The Paired Difference Confidence Interval -- Example DCOVA The confidence interval for μD is: Since this interval contains 0 cannot be 99% confident that μD doesn’t = 0 D = -4.2, SD = 5.67

Two Population Proportions DCOVA Goal: test a hypothesis or form a confidence interval for the difference between two population proportions, π1 – π2 Population proportions Assumptions: n1 π1  5 , n1(1- π1)  5 n2 π2  5 , n2(1- π2)  5 The point estimate for the difference is

Two Population Proportions DCOVA In the null hypothesis we assume the null hypothesis is true, so we assume π1 = π2 and pool the two sample estimates Population proportions The pooled estimate for the overall proportion is: where X1 and X2 are the number of items of interest in samples 1 and 2

Two Population Proportions (continued) DCOVA The test statistic for π1 – π2 is a Z statistic: Population proportions where

Hypothesis Tests for Two Population Proportions DCOVA Population proportions Lower-tail test: H0: π1  π2 H1: π1 < π2 i.e., H0: π1 – π2  0 H1: π1 – π2 < 0 Upper-tail test: H0: π1 ≤ π2 H1: π1 > π2 i.e., H0: π1 – π2 ≤ 0 H1: π1 – π2 > 0 Two-tail test: H0: π1 = π2 H1: π1 ≠ π2 i.e., H0: π1 – π2 = 0 H1: π1 – π2 ≠ 0

Hypothesis Tests for Two Population Proportions (continued) Population proportions DCOVA Lower-tail test: H0: π1 – π2  0 H1: π1 – π2 < 0 Upper-tail test: H0: π1 – π2 ≤ 0 H1: π1 – π2 > 0 Two-tail test: H0: π1 – π2 = 0 H1: π1 – π2 ≠ 0 a a a/2 a/2 -za za -za/2 za/2 Reject H0 if ZSTAT < -Za Reject H0 if ZSTAT > Za Reject H0 if ZSTAT < -Za/2 or ZSTAT > Za/2

Hypothesis Test Example: Two population Proportions DCOVA Is there a significant difference between the proportion of men and the proportion of women who will vote Yes on Proposition A? In a random sample, 36 of 72 men and 35 of 50 women indicated they would vote Yes Test at the .05 level of significance

Hypothesis Test Example: Two population Proportions (continued) DCOVA The hypothesis test is: H0: π1 – π2 = 0 (the two proportions are equal) H1: π1 – π2 ≠ 0 (there is a significant difference between proportions) The sample proportions are: Men: p1 = 36/72 = 0.50 Women: p2 = 35/50 = 0.70 The pooled estimate for the overall proportion is:

Hypothesis Test Example: Two population Proportions (continued) DCOVA Reject H0 Reject H0 The test statistic for π1 – π2 is: .025 .025 -1.96 1.96 -2.20 Decision: Reject H0 Conclusion: There is significant evidence of a difference in the proportion of men and women who will vote yes. Critical Values = ±1.96 For  = .05

Confidence Interval for Two Population Proportions DCOVA Population proportions The confidence interval for π1 – π2 is:

Confidence Interval forTwo Population Proportions -- Example DCOVA The 95% confidence interval for π1 – π2 is: Since this interval does not contain 0 can be 95% confident the two proportions are different.

Testing for the Ratio Of Two Population Variances DCOVA Hypotheses FSTAT * Tests for Two Population Variances H0: σ12 = σ22 H1: σ12 ≠ σ22 S12 / S22 H0: σ12 ≤ σ22 H1: σ12 > σ22 F test statistic Where: S12 = Variance of sample 1 (the larger sample variance) n1 = sample size of sample 1 S22 = Variance of sample 2 (the smaller sample variance) n2 = sample size of sample 2 n1 –1 = numerator degrees of freedom n2 – 1 = denominator degrees of freedom

The F Distribution DCOVA The F critical value is found from the F table There are two degrees of freedom required: numerator and denominator The larger sample variance is always the numerator When In the F table, numerator degrees of freedom determine the column denominator degrees of freedom determine the row df1 = n1 – 1 ; df2 = n2 – 1

Finding the Rejection Region DCOVA H0: σ12 = σ22 H1: σ12 ≠ σ22 H0: σ12 ≤ σ22 H1: σ12 > σ22 F /2 Reject H0 Do not reject H0 Fα/2 F  Fα Reject H0 Do not reject H0 Reject H0 if FSTAT > Fα/2 Reject H0 if FSTAT > Fα

F Test: An Example DCOVA You are a financial analyst for a brokerage firm. You want to compare dividend yields between stocks listed on the NYSE & NASDAQ. You collect the following data: NYSE NASDAQ Number 21 25 Mean 3.27 2.53 Std dev 1.30 1.16 Is there a difference in the variances between the NYSE & NASDAQ at the  = 0.05 level?

F Test: Example Solution DCOVA Form the hypothesis test: H0: σ21 = σ22 (there is no difference between variances) H1: σ21 ≠ σ22 (there is a difference between variances) Find the F critical value for  = 0.05: Numerator d.f. = n1 – 1 = 21 –1 =20 Denominator d.f. = n2 – 1 = 25 –1 = 24 Fα/2 = F.025, 20, 24 = 2.33

F Test: Example Solution DCOVA (continued) The test statistic is: H0: σ12 = σ22 H1: σ12 ≠ σ22 /2 = .025 F Do not reject H0 Reject H0 FSTAT = 1.256 is not in the rejection region, so we do not reject H0 F0.025=2.33 Conclusion: There is not sufficient evidence of a difference in variances at  = .05

Chapter Summary In this chapter we discussed Comparing two independent samples Performed pooled-variance t test for the difference in two means Performed separate-variance t test for difference in two means Formed confidence intervals for the difference between two means Comparing two related samples (paired samples) Performed paired t test for the mean difference Formed confidence interval for the mean difference

Chapter Summary Comparing two population proportions (continued) Comparing two population proportions Performed Z-test for two population proportions Formed confidence intervals for the difference between two population proportions Performing an F test for the ratio of two population variances

Chapter 10 On Line Topic -- Effect Size

Learning Objectives In this topic, you learn: The impact sample size can have on statistical significance How to classify the effect size of a difference When it is appropriate to consider the effect size in addition to statistical significance

The Same Difference Is Insignificant With Sample Sizes of 50 But Significant With Sample Sizes of 5000 DCOVA The Difference In Download Time For Web Page Design A & Design B n1=n2=50 n1=n2=5000 Difference = 0.96 – 0.90 = 0.06 p-value > 0.05 p-value < 0.05

One method to accomplish this is to measure the effect size. With The Growth of Big Data The Likelihood of Finding A Statistically Significant but Practically Unimportant Result Increases DCOVA Need to find a way to measure the importance of a result not just its statistical significance. One method to accomplish this is to measure the effect size. Can use a confidence interval to describe the uncertainty in the size of the result.

Should this difference be considered small, medium, or large? Can Be 95% Confident The Mean Download Time Difference Between Design A & Design B if Between 0.0527 & 0.0673 DCOVA Should this difference be considered small, medium, or large?

A Standard Way To Classify The Standardized Effect Size DCOVA A standardized effect size of 0.2 is classified as small A standardized effect size of 0.5 is classified as medium A standardized effect size of 0.8 or more is classified as large

Standardized Effect Confidence Interval Formulas For The Standardized Effect Size & An Associated Confidence Interval For The Difference In Two Population Means DCOVA Standardized Effect Confidence Interval Where: ESB = sample estimate of Bonett’s Standardized Effect Size Za/2 = upper tail critical value of the standardized normal distribution S1 = sample standard deviation from population 1 S2 = sample standard deviation from population 2 nM = mean sample size where

The Standardized Effect Size For The Web Page Download Data Is Classified As Small With At Least 95% Confidence DCOVA Point Estimate Of The Standardized Effect Size Is 0.3233 Can Be 95% Confident Standardized Effect Size Is Between 0.2841 and 0.3625

Can Also Find The Standardized Effect Size For The Difference In Two Proportions DCOVA There are multiple alternative measures available. One way to measure the effect size is to construct a confidence interval estimate of the difference between the two proportions.

Two Proportions Standardized Effect Size Example DCOVA Web designers tested a new call to action button on their web page. Every visitor to the web page was randomly shown the original button or the new button. Of the 4883 visitors who saw the original button, 421 took action. Of the 5,698 visitors who saw the new button, 787 took action.

Two Proportions Standardized Effect Size Example (continued) DCOVA Can be 95% confident the difference between the proportions (Original button – New button) is between -0.0638 and -0.0400. Thus the effect size is between 0.0400 and 0.0638.

Topic Summary In this topic we discussed: The impact sample size can have on statistical significance How to classify the effect size of a difference When it is appropriate to consider the effect size in addition to statistical significance