Towson University - J. Jung

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 9 Inferences Based on Two Samples.
Previous Lecture: Distributions. Introduction to Biostatistics and Bioinformatics Estimation I This Lecture By Judy Zhong Assistant Professor Division.
1 Chapter 12 Inference About One Population Introduction In this chapter we utilize the approach developed before to describe a population.In.
Chapter 9 Chapter 10 Chapter 11 Chapter 12
1 Inference about Comparing Two Populations Chapter 13.
Announcements Homework 2 will be posted on the web by tonight.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 13 Inference About Comparing Two Populations.
Inferences About Means of Single Samples Chapter 10 Homework: 1-6.
Lecture Inference for a population mean when the stdev is unknown; one more example 12.3 Testing a population variance 12.4 Testing a population.
1 Inference about Comparing Two Populations Chapter 13.
1 Chapter 12 Inference About a Population 2 Introduction In this chapter we utilize the approach developed before to describe a population.In this chapter.
Inference about a Mean Part II
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
Inferences About Process Quality
1 Inference About a Population Variance Sometimes we are interested in making inference about the variability of processes. Examples: –Investors use variance.
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
5-3 Inference on the Means of Two Populations, Variances Unknown
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
Chapter 12 Section 1 Inference for Linear Regression.
Economics 173 Business Statistics Lecture 8 Fall, 2001 Professor J. Petry
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Chapter 6: Sampling Distributions
AM Recitation 2/10/11.
Chapter 9.3 (323) A Test of the Mean of a Normal Distribution: Population Variance Unknown Given a random sample of n observations from a normal population.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
Economics 173 Business Statistics Lecture 6 Fall, 2001 Professor J. Petry
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Economics 173 Business Statistics Lecture 7 Fall, 2001 Professor J. Petry
Chapter 13 Inference About Comparing Two Populations.
1 Inference about Two Populations Chapter Introduction Variety of techniques are presented to compare two populations. We are interested in:
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 12 Inference About A Population.
Chapter 13 Inference About Comparing Two Populations.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Large sample CI for μ Small sample CI for μ Large sample CI for p
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
1 Inference about Two Populations Chapter Introduction Variety of techniques are presented whose objective is to compare two populations. We.
Section A Confidence Interval for the Difference of Two Proportions Objectives: 1.To find the mean and standard error of the sampling distribution.
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
- We have samples for each of two conditions. We provide an answer for “Are the two sample means significantly different from each other, or could both.
1 Economics 173 Business Statistics Lectures 5 & 6 Summer, 2001 Professor J. Petry.
Chapter 9 Inferences Based on Two Samples: Confidence Intervals and Tests of Hypothesis.
Hypothesis Testing. Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean μ = 120 and variance σ.
Chapter 12 Inference About One Population. We shall develop techniques to estimate and test three population parameters.  Population mean   Population.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Economics 173 Business Statistics
Chapter 6: Sampling Distributions
One-Sample Inference for Proportions
Chapter 4. Inference about Process Quality
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Chapter 9: Inferences Involving One Population
Inference about Two Populations
Chapter 6: Sampling Distributions
Inference about Comparing Two Populations
Math 4030 – 10a Tests for Population Mean(s)
Econ Roadmap Focus Midterm 1 Focus Midterm 2 Focus Final Intro
Chapter 9 Hypothesis Testing.
Confidence Interval Estimation and Statistical Inference
Econ 3790: Business and Economics Statistics
Inference About Comparing Two Populations
Introduction to Estimation
Chapter 13: Inferences about Comparing Two Populations Lecture 7a
Chapter 13: Inferences about Comparing Two Populations
Interval Estimation Download this presentation.
Presentation transcript:

Towson University - J. Jung Chapters 1. Introduction 2. Graphs 3. Descriptive statistics 4. Basic probability 5. Discrete distributions 6. Continuous distributions 7. Central limit theorem 8. Estimation 9. Hypothesis testing 10. Two-sample tests 13. Linear regression 14. Multivariate regression Chapter 10 Two Sample Tests 11/8/2018 Towson University - J. Jung

Comparing Two Populations Previously we looked at techniques to estimate and test parameters for one population: Population Mean: µ Population Proportion: 𝜋 We will still consider these parameters when we are looking at two populations, however our interest will now be:  The difference between two means.  The ratio of two variances.  The difference between two proportions.

Difference between Two Means In order to test and estimate the difference between two population means, we draw random samples from each of two populations. Initially, we will consider independent samples, that is, samples that are completely unrelated to one another. (Likewise, we consider for Population 2) Sample: n1 Population 1 Parameters: Statistics:

Difference between Two Means Because we are comparing two population means, we use the statistic: which is an unbiased and consistent estimator of: µ1- µ2

Sampling Distribution of Differences of Means 𝑋 1 − 𝑋 2 is normally distributed if the original populations are normal –or– approximately normal if the populations are non-normal and the sample sizes are large (n1, n2 > 30)!! (Remember condition for CLT) The expected value of 𝑋 1 − 𝑋 2 is: 𝜇 1 − 𝜇 2 The variance of 𝑋 1 − 𝑋 2 is: 𝜎 1 2 𝑛 1 + 𝜎 2 2 𝑛 2 and the standard error is : 𝜎 1 2 𝑛 1 + 𝜎 2 2 𝑛 2

Making Inferences About μ1-μ2 Since 𝑋 1 − 𝑋 2 is normally distributed if the original populations are normal –or– approximately normal if the populations are non-normal and the sample sizes are large, then: is a N(0,1) (or approximately normal) random variable. We could use this to build the test statistic and the confidence interval estimator for 𝜇 1 − 𝜇 2 .

Making Inferences About 𝜇 1 − 𝜇 2 …except that, in practice, the z statistic is rarely used since the population variances are unknown. Instead we use a t-statistic. We consider two cases for the unknown population variances: when we believe they are equal: 𝝈 𝟏 𝟐 = 𝝈 𝟐 𝟐 when we believe they are not equal: 𝝈 𝟏 𝟐 ≠ 𝝈 𝟐 𝟐 ??

Test Statistic for 𝝁 𝟏 − 𝝁 𝟐 (equal variances: 𝝈 𝟏 𝟐 = 𝝈 𝟐 𝟐 ) Calculate – the pooled variance estimator as… …and use it here: degrees of freedom

CI Estimator for μ1-μ2 (equal variances) The confidence interval estimator for μ1-μ2 when the population variances are equal is given by: pooled variance estimator degrees of freedom

Test Statistic for 𝝁 𝟏 − 𝝁 𝟐 (unequal variances: 𝝈 𝟏 𝟐 ≠ 𝝈 𝟐 𝟐 ) The test statistic for μ1-μ2 when the population variances are unequal is given by: Likewise, the confidence interval estimator is: degrees of freedom

Which test to use? Which test statistic do we use? Equal variance or unequal variance? Whenever there is insufficient evidence that the variances are unequal, it is preferable to perform the equal variances t-test. This is so, because for any two given samples: The number of degrees of freedom for the equal variances case The number of degrees of freedom for the unequal variances case ≥ Larger numbers of degrees of freedom have the same effect as having larger sample sizes ≥

Example Millions of investors buy mutual funds choosing from thousands of possibilities. Some funds can be purchased directly from banks or other financial institutions while others must be purchased through brokers, who charge a fee for this service. This raises the question, can investors do better by buying mutual funds directly than by purchasing mutual funds through brokers.

Example To help answer this question a group of researchers randomly sampled the annual returns from mutual funds that can be acquired directly and mutual funds that are bought through brokers and recorded the net annual returns, which are the returns on investment after deducting all relevant fees. Can we conclude at the 5% significance level that directly-purchased mutual funds outperform mutual funds bought through brokers?

Example To answer the question we need to compare the population of returns from direct and the returns from broker- bought mutual funds. The data are obviously interval (we've recorded real numbers). This problem objective - data type combination tells us that the parameter to be tested is the difference between two means µ1- µ2.

Example The hypothesis to be tested is that the mean net annual return from directly-purchased mutual funds (µ1) is larger than the mean of broker-purchased funds (µ2). Hence the alternative hypothesis is H1: µ1- µ2 > 0 and H0: µ1- µ2 = 0 To decide which of the t-tests of µ1 - µ2 to apply we conduct the F-test of σ12/ σ22 . (Not covered in class, so just roll with it for now)

Example Assume F-test concluded that there is not enough evidence to infer that the population variances differ. It follows that we must apply the equal-variances t-test of µ1- µ2

Example

Example: Result The value of the test statistic is 2.29. The one-tail p-value is .0122. We observe that the p-value of the test is small (and the test statistic falls into the rejection region). As a result we conclude that there is sufficient evidence to infer that on average directly-purchased mutual funds outperform broker-purchased mutual funds

Confidence Interval Estimator Suppose we wanted to compute a 95% confidence interval estimate of the difference between mean caloric intake for consumers and non-consumers of high-fiber cereals. The unequal-variances estimator is

Confidence Interval Estimator

Confidence Interval Estimator We estimate that the return on directly purchased mutual funds is on average between .39 and 5.43 percentage points larger than broker-purchased mutual funds.