Presentation on theme: "Inferences based on TWO samples"— Presentation transcript:
1Inferences based on TWO samples New concept: Independent versus dependent samplesComparing two population means: Independent samplingComparing two population means: Dependent sampling
2Inferences About Two Means In the previous chapter we used one sample to make inferences about a single population. Very often we are interested in comparing two populations.1) Is the average midterm grade in Stat higher than the average midterm grade in Stat ?2) Is the average grade in Quiz #1 higher than Quiz #2 in this section of Introductory Statistics?
3Inferences about Two Means Each sample is an example of testing a claim between two populations. However, there is a fundamental difference between 1) and 2).In # 2) the samples are not independent where as in # 1), they are.Why? 1. Different people in each class Same people writing different test.
4Recognizing independent versus dependent samples 1. Is the average midterm grade in Stat higher than the average midterm grade in Stat ? Independent samples2. Is the average grade in Quiz #1 higher than Quiz #2 in this section of Introductory Statistics? Dependent samples
5Definition. Independent and Dependent Samples Two samples are independent if the sample selected from one population is not related to the sample selected from the other population.If one sample is related to the other, the samples are dependent. With dependent samples we get two values for each person, sometimes called paired-samples.
6We consider first the case of two dependent (or paired) samples Calculations are very similar to those in the previous chapter for a CI or Test of Hypothesis involving one sample
7Organize work using a table Sample 1Sample 2Difference1x1y1d1=x1 - y12x2y2d2=x2 - y23x3y3d3=x3 - y3…..l.….…..nxnyndn=xn -yn
8Organize work using a table Sample 1Sample 2Difference1x1y1d1=x1 - y12x2y2d2=x2 - y23x3y3d3=x3 - y3…..l.….…..nxnyndn=xn -ynCan now use the methods of the previous chapter to find a confidence interval for the population mean of the difference d between x1 and x2.
10The (1-a)*100% confidence interval for the mean difference md is Confidence Interval for the Mean Difference (Dependent Samples: Paired Data )The (1-a)*100% confidence interval for the mean difference md is
11Test Statistic for the Mean Difference (Dependent Samples) For n<30 the appropriate test statistic for testing the mean difference between paired samples is with n-1 degrees of freedom.For n>30 then we use ‘z’
12We now turn to the more challenging case of independent samples
13Testing Claims about the Mean Difference (Independent Samples) When making claims about the mean difference between independent samples a different procedure is used than that for dependent/paired samples.Again there are different procedures for large (n>30) samples and small samples (n<30).In the small sample case, we must assume that both populations are normal and have equal variances.
14ExampleSuppose we wish to compare two brands of 9-volt batteries, Brand 1 and Brand 2. Specifically, we would like to compare the mean life for the population of batteries of Brand 1, 1, and the mean life for the population of batteries of Brand 2, 2. To obtain a meaningful comparison we shall estimate the difference of the two population means by picking samples from the two populations.
15For Brand 1 a sample of size 64 was chosen. From the data a point estimate for 1, would be From the data a point estimate for 2 would be 7.78.It would therefore be natural for us to take as a point estimate for (1-2) to be hours.
16Point Estimator (Independent Samples) The estimate is the best point estimator of (1-2).Having found a point estimate, our next goal is to determine a confidence interval for it.
17Point Estimator (Independent Samples) To construct a confidence interval for (1-2) we need to know the distribution of its point estimator.The distribution of is normal with mean (1-2) and standard deviationwhere n1 is the size of sample 1, n2 is the size of sample 2.
18Confidence Interval for Difference in two Means Confidence Interval for Difference in two Means (Large samples or known variance)
19Example: Life span of Batteries Let a = .05 so we are looking for the 95% confidence interval for the mean difference.What conclusion can you draw from the above?
20Example: Life span of Batteries Let a = .05 so we are looking for the 95% confidence interval for the mean difference.We are 95 percent certain that the difference is negative. Thus, weare 95% certain that
21Test Statistic for Two Means: Independent and large samples
22Example: Life span of Batteries Hypothesis Testing. I claim that the two brands of batteries do not have the same life span. Using a 5% level of significance, test this claim.
23Example: Life span of Batteries HypothesisSample DataTest Statistic
24Example: Life span of Batteries Critical RegionDecision The test statistic lies in the critical region, therefore we reject H0. The samples provide sufficient evidence to claim that the Batteries do indeed have different life spans.
25ExerciseShow that we would have rejected the null hypothesis even if we had used level of significance .008 (instead of .05. Thus…We conclude that the mean battery lives ARE different (p = .008)
26Overview Comparing Two Populations: Mean (Small Dependent (paired) Samples)Asumptions: Samples are random plus eith n>=30 or the population of differences is approximately normalMean (Large Independent Samples)Assumptions: Both samples are randomly chosen plus both sample sizes >= 30.
27NOTEIn the case of SMALL independent samples, one must use the t-distribution plus additional conditions must be satisfied AND one must use what is called a pooled estimate of the variance.
28You are not responsible for this material NOTEIn the case of SMALL independent samples, one must use the t-distribution plus additional conditions must be satisfied AND one must use what is called a pooled estimate of the variance.You are not responsible for this material