Week121 Robustness of the two-sample procedures The two sample t-procedures are more robust against nonnormality than one-sample t-procedures. When the.

week121 Robustness of the two-sample procedures The two sample t-procedures are more robust against nonnormality than one-sample t-procedures. When the sizes of the two samples are equal and the distributions of the two samples being compared have similar shapes, probability values from the t table are quite accurate for a broad range of distribution when the sample sizes are as small as n 1 =n 2 =5. When the two population distribution have different shapes, larger samples are needed. In planning a two-sample study, choose equal sample sizes if you can. Example 7.16 on page 493 in IPS.

week122 Power of the two-sample t-test The two-sample t test is one of the most used statistical procedures. Unfortunately, because of inadequate planning, users frequently fail to find evidence for the effect they believe is true. Power calculations should be part of the planning of any statistical study. Information from a pilot study or previous research is needed. MINITAB can be used to calculate the power of the two- sample t-tests when the sample sizes are equal and the population standard deviations are equal. Commands: Stat > Power and sample size > 2 sample t

week123 Example Find the power of the two-sample t-test when the difference in means is 5, both samples are of size 20 and the common population std deviation is 6. Power and Sample Size 2-Sample t Test Testing mean 1 = mean 2 (versus not =) Calculating power for mean 1 = mean 2 + 5 Alpha = 0.05 Sigma = 6 Sample Size Power 20 0.7285 Find the sample size (for each group) that will increase the power to 0.90. Power and Sample Size 2-Sample t Test Testing mean 1 = mean 2 (versus not =) Calculating power for mean 1 = mean 2 + 5 Alpha = 0.05 Sigma = 6 Sample Size Target Power Actual Power 32 0.9000 0.9068

week124 Comparing two proportions (two independent samples) A university financial aid office polled an SRS of undergraduate students to study their summer employment. Not all students were employed the previous summer. Here are the results for men and women: a) Is there evidence that the proportion of male students employed during the summer differs from the proportion of female students who were employed? State H 0 and H a, compute the test statistic, and give its P-value. MenWomen Employed718 593 Not employed 79 139 Total 797 732

week125 The hypothesis to be tested are: H 0 : p 1 = p 2 vs H a : p 1 ≠ p 2. The test statistics is: where, Hence, z = -5.07 and the P-value = 2·P(Z > |-5.07|) = 0 and so we reject H 0 at any significant level.

week126 (b) Give a 99% confidence interval for the difference between the proportions of male and female students who were employed during the summer. A C level CI for p 1 – p 2 is given by, substituting the values we get (0.9009 – 0.8101)  (2.576)(0.01795) = 0.0446 to 0.1370. MINITAB commands: Stat > Basic statistics > 2 Proportions

week127 MINITAB output for the above problem is given below: Test and Confidence Interval for Two Proportions Sample X N Sample p 1 593 732 0.810109 2 718 797 0.900878 Estimate for p(1) - p(2): -0.0907690 95% CI for p(1)- p(2):(-0.12595, -0.0555881) Test for p(1) - p(2) = 0 (vs not = 0): Z = -5.06 P-Value = 0.000.

week128 Question from Final exam Dec 2000 A SRS of 600 voters in a western Canadian province were cross-classified by income level and political party of choice in an upcoming election, with the details shown below. Which of the following statements are true? Income levelAllianceLiberalNDPTotal Low20403090 Moderate15010020270 High1106010180 Very High4015560 Total32021565600

week129 I)The proportion of high income earners who are liberal supporters has an estimated variance equal to: II)The proportion of voters who are either NDP supporters or low income earners is estimated to be 155/600. III)A 95% CI for the difference between the proportion of high income earners who are Liberal supporters and the proportion of low income earners who are liberal supporters (to the number of decimal places displayed) is : 0.11  1.96  0.095

week1210 The F-test for equality of spread Suppose that we have 2 independent SRSs from two normal populations, a sample of size n 1 from N(  1,  1 ) and a sample of size n 2 from N(  2,  2 ). The population means and standard deviations are all unknown. The hypothesis of equal spread, H 0 : σ 1 = σ 2 vs H a : σ 1 ≠ σ 2, is tested by the F statistic. The F statistic is given by where and are the sample variances. This has F distribution with n 1 - 1 and n 2 - 1 degrees of freedom when H 0 is true. We usually take the test statistic to be the ratio between the largest variance and the smallest variance.

week1211 The F distributions are a family of distributions with two parameters, the degrees of freedom of the sample variances in the numerator and denominator of the F statistic. The numerator degrees of freedom are always mentioned first. Interchanging the degrees of freedom changes the distribution so the order is important. A brief notation will be F(j,k) for the F distribution with j degrees of freedom in the numerator and k in the denominator. The F distributions are not symmetric but are right-skewed. Because sample variances cannot be negative, the F statistic takes only positive values and the F distribution has no probability below 0. The peak of the F density curve is near 1; values far from 1 in either direction provide evidence against the H 0. To get the P-value we compare the value of the F statistic with the critical values of table E, then double the corresponding probability p. Robustness of the F-test F-test and other procedures for inference about variances are so lacking of robustness as to be of little use in practice.

week1212 Example Random samples of 8 and 10 observations were selected from populations 1 and 2 respectively. The corresponding sample variances were 7.4 and 12.7. Do the data provide sufficient evidence to indicate a difference between the variances of the two populations. Answer: The test statistic is F = 12.7 / 7.4 =1.72 with df 9, 7. The P-value > 2·0.1 and so we can not reject H 0.

week1213 Questions from final Exam Dec 98 Students in a statistics course participated in a simple experiment. The students took their own pulse rate for 1 minute. They then were asked to flip a coin. If their coin came up heads, they were to run in the place for 1 minute. Then everyone took their own pulse rate again. Some other data were also collected. Description of the data set is given below. VariableDescription 1. PULSE1First pulse rate 2. PULSE2second pulse rate 3. RAN1 = ran in place, 2 = did not ran in place 4. SMOKES 1 = smokes regularly, 2 = does not smoke regularly 5. SEX1 = male, 2 = female 6. Activity Usual level of physical activity: 1 = slight 2 = moderate, 3 = a lot

week1214 Variable Ran N Mean Median TrMean StDev Pulse2 1 35 92.51 88.00 91.68 18.94 2 57 72.32 70.00 72.24 9.95 Variable Ran SE Mean Minimum Maximum Q1 Q3 Pulse2 1 3.20 58.00 140.00 76 106 2 1.32 50.00 94.00 66 79 (1)Suppose that we wished to test the null hypothesis that the mean PULSE2 rate for those who ran in place is equal to the mean PULSE2 rate for those who did not ran in place. Which of the following are true? I) It would be reasonable to use a test based on the t(90 d.f.) distribution. II) High power for the test implies that there is a high probability of concluding that the mean PULSE2 rate of those who ran in place is not the same as the mean PULSE2 rates of those who did not run in place, when in fact they are not the same. III) A type I error would be made if we conclude that the mean PULSE2 rate of those who ran in place was different than those who did not, then in fact they are the same.

week1215 Some more outputs: (2) Which of the following are true? I) PULSE1 rates appear to be normally distributed. II) PULSE1 rates appear to be left (negatively)-skewed. III) PULSE2 rates appear to be left (negatively)-skewed. IV) PULSE2 rates appear to be right (positively)-skewed.

week1216 Some more outputs: Two sample T for Pulse1 Smokes N Mean StDev SE Mean 1 28 75.0 13.5 2.6 2 64 71.94 9.70 1.2 95% CI for mu (1) - mu (2): ( -2.6, 8.8) T-Test mu (1) = mu (2) (vs not =): T = 1.08 P = 0.28 DF=39 Now we analyze only the 35 who ran in place: T Confidence Intervals Variable N Mean StDev SE Mean 90.0 % CI Pulse1-Pulse2 35 -18.91 15.05 2.54 (-23.22, -14.61 ) Two Sample T-Test and Confidence Interval Two sample T for Pulse1 vs Pulse2 N Mean StDev SE Mean Pulse1 35 73.6 11.4 1.9 Pulse2 35 92.5 18.9 3.2 90% CI for mu Pulse1 - mu Pulse2: ( -25.2, -12.7) T-Test mu Pulse1 = mu Pulse2 (vs not =): T = -5.06 P = 0.0000 DF = 55.

week1217 (3) Which of the following are true? I) A test of the null hypothesis: “Mean PULSE1 rate for nonsmokers = Mean PULSE1 rate for smokers” vs the alternative “Mean PULSE1 rate for nonsmokers is less than Mean PULSE1 rate for smokers” has p-value 0.14. II) The 90% CI for the average change in pulse rate, for those who ran is ( -25.2, -12.7). III) If we calculated the 50% CI for the average change in pulse rate, for those who ran, it would include the value –12.5. (4) Continuing with the study above, which of the following statements are true? I) A test for a difference in std deviations of pulses (PULSE1) between the smokers and the nonsmokers would be based on F(27, 63) II) The sign test could be applied to analyze the difference posed in I) above, if we have doubts about the normality of PULSE1 measurements. III) The relation between activity level and gender should be analyzed by regression and correlation methods.

Week121 Robustness of the two-sample procedures The two sample t-procedures are more robust against nonnormality than one-sample t-procedures. When the.

Similar presentations

Presentation on theme: "Week121 Robustness of the two-sample procedures The two sample t-procedures are more robust against nonnormality than one-sample t-procedures. When the."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Week121 Robustness of the two-sample procedures The two sample t-procedures are more robust against nonnormality than one-sample t-procedures. When the.

Similar presentations

Presentation on theme: "Week121 Robustness of the two-sample procedures The two sample t-procedures are more robust against nonnormality than one-sample t-procedures. When the."— Presentation transcript:

Similar presentations

About project

Feedback