Week121 Robustness of the two-sample procedures The two sample t-procedures are more robust against nonnormality than one-sample t-procedures. When the.

Slides:



Advertisements
Similar presentations
BPS - 5th Ed. Chapter 241 One-Way Analysis of Variance: Comparing Several Means.
Advertisements

Introduction Comparing Two Means
Confidence Interval and Hypothesis Testing for:
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Significance Testing Chapter 13 Victor Katch Kinesiology.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Significance Tests Chapter 13.
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
Chapter 9: Inferences for Two –Samples
Copyright ©2011 Brooks/Cole, Cengage Learning Analysis of Variance Chapter 16 1.
Hypothesis Testing. Introduction Always about a population parameter Attempt to prove (or disprove) some assumption Setup: alternate hypothesis: What.
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
Chapter 2 Simple Comparative Experiments
Chapter 11: Inference for Distributions
Chapter 9 Hypothesis Testing.
1 (Student’s) T Distribution. 2 Z vs. T Many applications involve making conclusions about an unknown mean . Because a second unknown, , is present,
5-3 Inference on the Means of Two Populations, Variances Unknown
CHAPTER 19: Two-Sample Problems
AP Statistics Section 13.1 A. Which of two popular drugs, Lipitor or Pravachol, helps lower bad cholesterol more? 4000 people with heart disease were.
AM Recitation 2/10/11.
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
Hypothesis testing – mean differences between populations
1/2555 สมศักดิ์ ศิวดำรงพงศ์
+ DO NOW What conditions do you need to check before constructing a confidence interval for the population proportion? (hint: there are three)
AP STATISTICS LESSON 11 – 2 (DAY 1) Comparing Two Means.
More About Significance Tests
Lecture 3: Review Review of Point and Interval Estimators
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 9: Testing a Claim Section 9.3a Tests About a Population Mean.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
Comparing Two Population Means
Chapter 9: Testing Hypotheses
One Sample Inf-1 If sample came from a normal distribution, t has a t-distribution with n-1 degrees of freedom. 1)Symmetric about 0. 2)Looks like a standard.
Week 111 Power of the t-test - Example In a metropolitan area, the concentration of cadmium (Cd) in leaf lettuce was measured in 7 representative gardens.
Chapter 11 Inference for Distributions AP Statistics 11.1 – Inference for the Mean of a Population.
CHAPTER 20: Inference About a Population Proportion ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture.
Inference for distributions: - Comparing two means IPS chapter 7.2 © 2006 W.H. Freeman and Company.
CHAPTER 18: Inference about a Population Mean
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
1 Happiness comes not from material wealth but less desire.
Week101 Decision Errors and Power When we perform a statistical test we hope that our decision will be correct, but sometimes it will be wrong. There are.
1 Section 9-4 Two Means: Matched Pairs In this section we deal with dependent samples. In other words, there is some relationship between the two samples.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Objectives (BPS chapter 19) Comparing two population means  Two-sample t procedures  Examples of two-sample t procedures  Using technology  Robustness.
Lecture 9 Chap 9-1 Chapter 2b Fundamentals of Hypothesis Testing: One-Sample Tests.
BPS - 3rd Ed. Chapter 161 Inference about a Population Mean.
1 Objective Compare of two population variances using two samples from each population. Hypothesis Tests and Confidence Intervals of two variances use.
Psych 230 Psychological Measurement and Statistics
Week111 The t distribution Suppose that a SRS of size n is drawn from a N(μ, σ) population. Then the one sample t statistic has a t distribution with n.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Estimating with Confidence Section 11.1 Estimating a Population Mean.
© The McGraw-Hill Companies, Inc., Chapter 10 Testing the Difference between Means, Variances, and Proportions.
BPS - 3rd Ed. Chapter 191 Comparing Two Proportions.
AP Statistics.  If our data comes from a simple random sample (SRS) and the sample size is sufficiently large, then we know that the sampling distribution.
+ Unit 6: Comparing Two Populations or Groups Section 10.2 Comparing Two Means.
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Difference Between Two Means.
Essential Statistics Chapter 171 Two-Sample Problems.
Inference for distributions: - Comparing two means.
+ Unit 5: Estimating with Confidence Section 8.3 Estimating a Population Mean.
+ Chapter 8 Estimating with Confidence 8.1Confidence Intervals: The Basics 8.2Estimating a Population Proportion 8.3Estimating a Population Mean.
 What is Hypothesis Testing?  Testing for the population mean  One-tailed testing  Two-tailed testing  Tests Concerning Proportions  Types of Errors.
Essential Statistics Chapter 191 Comparing Two Proportions.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Testing the Difference between Means, Variances, and Proportions
Testing the Difference between Means and Variances
Chapter 9 Hypothesis Testing.
Decision Errors and Power
CHAPTER 18: Inference about a Population Mean
Presentation transcript:

week121 Robustness of the two-sample procedures The two sample t-procedures are more robust against nonnormality than one-sample t-procedures. When the sizes of the two samples are equal and the distributions of the two samples being compared have similar shapes, probability values from the t table are quite accurate for a broad range of distribution when the sample sizes are as small as n 1 =n 2 =5. When the two population distribution have different shapes, larger samples are needed. In planning a two-sample study, choose equal sample sizes if you can. Example 7.16 on page 493 in IPS.

week122 Power of the two-sample t-test The two-sample t test is one of the most used statistical procedures. Unfortunately, because of inadequate planning, users frequently fail to find evidence for the effect they believe is true. Power calculations should be part of the planning of any statistical study. Information from a pilot study or previous research is needed. MINITAB can be used to calculate the power of the two- sample t-tests when the sample sizes are equal and the population standard deviations are equal. Commands: Stat > Power and sample size > 2 sample t

week123 Example Find the power of the two-sample t-test when the difference in means is 5, both samples are of size 20 and the common population std deviation is 6. Power and Sample Size 2-Sample t Test Testing mean 1 = mean 2 (versus not =) Calculating power for mean 1 = mean Alpha = 0.05 Sigma = 6 Sample Size Power Find the sample size (for each group) that will increase the power to Power and Sample Size 2-Sample t Test Testing mean 1 = mean 2 (versus not =) Calculating power for mean 1 = mean Alpha = 0.05 Sigma = 6 Sample Size Target Power Actual Power

week124 Comparing two proportions (two independent samples) A university financial aid office polled an SRS of undergraduate students to study their summer employment. Not all students were employed the previous summer. Here are the results for men and women: a) Is there evidence that the proportion of male students employed during the summer differs from the proportion of female students who were employed? State H 0 and H a, compute the test statistic, and give its P-value. MenWomen Employed Not employed Total

week125 The hypothesis to be tested are: H 0 : p 1 = p 2 vs H a : p 1 ≠ p 2. The test statistics is: where, Hence, z = and the P-value = 2·P(Z > |-5.07|) = 0 and so we reject H 0 at any significant level.

week126 (b) Give a 99% confidence interval for the difference between the proportions of male and female students who were employed during the summer. A C level CI for p 1 – p 2 is given by, substituting the values we get ( – )  (2.576)( ) = to MINITAB commands: Stat > Basic statistics > 2 Proportions

week127 MINITAB output for the above problem is given below: Test and Confidence Interval for Two Proportions Sample X N Sample p Estimate for p(1) - p(2): % CI for p(1)- p(2):( , ) Test for p(1) - p(2) = 0 (vs not = 0): Z = P-Value =

week128 Question from Final exam Dec 2000 A SRS of 600 voters in a western Canadian province were cross-classified by income level and political party of choice in an upcoming election, with the details shown below. Which of the following statements are true? Income levelAllianceLiberalNDPTotal Low Moderate High Very High Total

week129 I)The proportion of high income earners who are liberal supporters has an estimated variance equal to: II)The proportion of voters who are either NDP supporters or low income earners is estimated to be 155/600. III)A 95% CI for the difference between the proportion of high income earners who are Liberal supporters and the proportion of low income earners who are liberal supporters (to the number of decimal places displayed) is : 0.11  1.96  0.095

week1210 The F-test for equality of spread Suppose that we have 2 independent SRSs from two normal populations, a sample of size n 1 from N(  1,  1 ) and a sample of size n 2 from N(  2,  2 ). The population means and standard deviations are all unknown. The hypothesis of equal spread, H 0 : σ 1 = σ 2 vs H a : σ 1 ≠ σ 2, is tested by the F statistic. The F statistic is given by where and are the sample variances. This has F distribution with n and n degrees of freedom when H 0 is true. We usually take the test statistic to be the ratio between the largest variance and the smallest variance.

week1211 The F distributions are a family of distributions with two parameters, the degrees of freedom of the sample variances in the numerator and denominator of the F statistic. The numerator degrees of freedom are always mentioned first. Interchanging the degrees of freedom changes the distribution so the order is important. A brief notation will be F(j,k) for the F distribution with j degrees of freedom in the numerator and k in the denominator. The F distributions are not symmetric but are right-skewed. Because sample variances cannot be negative, the F statistic takes only positive values and the F distribution has no probability below 0. The peak of the F density curve is near 1; values far from 1 in either direction provide evidence against the H 0. To get the P-value we compare the value of the F statistic with the critical values of table E, then double the corresponding probability p. Robustness of the F-test F-test and other procedures for inference about variances are so lacking of robustness as to be of little use in practice.

week1212 Example Random samples of 8 and 10 observations were selected from populations 1 and 2 respectively. The corresponding sample variances were 7.4 and Do the data provide sufficient evidence to indicate a difference between the variances of the two populations. Answer: The test statistic is F = 12.7 / 7.4 =1.72 with df 9, 7. The P-value > 2·0.1 and so we can not reject H 0.

week1213 Questions from final Exam Dec 98 Students in a statistics course participated in a simple experiment. The students took their own pulse rate for 1 minute. They then were asked to flip a coin. If their coin came up heads, they were to run in the place for 1 minute. Then everyone took their own pulse rate again. Some other data were also collected. Description of the data set is given below. VariableDescription 1. PULSE1First pulse rate 2. PULSE2second pulse rate 3. RAN1 = ran in place, 2 = did not ran in place 4. SMOKES 1 = smokes regularly, 2 = does not smoke regularly 5. SEX1 = male, 2 = female 6. Activity Usual level of physical activity: 1 = slight 2 = moderate, 3 = a lot

week1214 Variable Ran N Mean Median TrMean StDev Pulse Variable Ran SE Mean Minimum Maximum Q1 Q3 Pulse (1)Suppose that we wished to test the null hypothesis that the mean PULSE2 rate for those who ran in place is equal to the mean PULSE2 rate for those who did not ran in place. Which of the following are true? I) It would be reasonable to use a test based on the t(90 d.f.) distribution. II) High power for the test implies that there is a high probability of concluding that the mean PULSE2 rate of those who ran in place is not the same as the mean PULSE2 rates of those who did not run in place, when in fact they are not the same. III) A type I error would be made if we conclude that the mean PULSE2 rate of those who ran in place was different than those who did not, then in fact they are the same.

week1215 Some more outputs: (2) Which of the following are true? I) PULSE1 rates appear to be normally distributed. II) PULSE1 rates appear to be left (negatively)-skewed. III) PULSE2 rates appear to be left (negatively)-skewed. IV) PULSE2 rates appear to be right (positively)-skewed.

week1216 Some more outputs: Two sample T for Pulse1 Smokes N Mean StDev SE Mean % CI for mu (1) - mu (2): ( -2.6, 8.8) T-Test mu (1) = mu (2) (vs not =): T = 1.08 P = 0.28 DF=39 Now we analyze only the 35 who ran in place: T Confidence Intervals Variable N Mean StDev SE Mean 90.0 % CI Pulse1-Pulse (-23.22, ) Two Sample T-Test and Confidence Interval Two sample T for Pulse1 vs Pulse2 N Mean StDev SE Mean Pulse Pulse % CI for mu Pulse1 - mu Pulse2: ( -25.2, -12.7) T-Test mu Pulse1 = mu Pulse2 (vs not =): T = P = DF = 55.

week1217 (3) Which of the following are true? I) A test of the null hypothesis: “Mean PULSE1 rate for nonsmokers = Mean PULSE1 rate for smokers” vs the alternative “Mean PULSE1 rate for nonsmokers is less than Mean PULSE1 rate for smokers” has p-value II) The 90% CI for the average change in pulse rate, for those who ran is ( -25.2, -12.7). III) If we calculated the 50% CI for the average change in pulse rate, for those who ran, it would include the value –12.5. (4) Continuing with the study above, which of the following statements are true? I) A test for a difference in std deviations of pulses (PULSE1) between the smokers and the nonsmokers would be based on F(27, 63) II) The sign test could be applied to analyze the difference posed in I) above, if we have doubts about the normality of PULSE1 measurements. III) The relation between activity level and gender should be analyzed by regression and correlation methods.