Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.

Slides:



Advertisements
Similar presentations
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Advertisements

Chapter 16 Introduction to Nonparametric Statistics
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Chapter 12 Chi-Square Tests and Nonparametric Tests
Chapter 14 Analysis of Categorical Data
Chapter 15 Nonparametric Statistics
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 12-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Chapter 11 Nonparametric Tests Larson/Farber 4th ed.
11 Chapter Nonparametric Tests © 2012 Pearson Education, Inc.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Slide 1 Copyright © 2004 Pearson Education, Inc..
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Lecture Slides Elementary Statistics Twelfth Edition
Chapter 14: Nonparametric Statistics
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Correlation.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-1 Review and Preview.
14 Elements of Nonparametric Statistics
NONPARAMETRIC STATISTICS
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Section 9-2 Inferences About Two Proportions.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chapter 11 Nonparametric Tests.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Copyright © 2004 Pearson Education, Inc.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
© Copyright McGraw-Hill CHAPTER 13 Nonparametric Statistics.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
1 Nonparametric Statistical Techniques Chapter 17.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests and Nonparametric Tests Statistics for.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Overview.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Slide 1 Copyright © 2004 Pearson Education, Inc..
CD-ROM Chap 16-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition CD-ROM Chapter 16 Introduction.
Nonparametric Statistics
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Nonparametric Statistics.
Lecture Slides Elementary Statistics Twelfth Edition
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Nonparametric Statistics.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 11 Multinomial Experiments and Contingency Tables 11-1 Overview 11-2 Multinomial Experiments:
1 Nonparametric Statistical Techniques Chapter 18.
Slide Slide 1 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing Chapter 8.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-1 Overview Overview 10-2 Correlation 10-3 Regression-3 Regression.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Lecture Slides Elementary Statistics Twelfth Edition
Inferential Statistics Inferences from Two Samples
Lecture Slides Elementary Statistics Twelfth Edition
Lecture Slides Elementary Statistics Eleventh Edition
Lecture Slides Elementary Statistics Tenth Edition
Lecture Slides Elementary Statistics Twelfth Edition
Lecture Slides Elementary Statistics Eleventh Edition
Presentation transcript:

Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola

Slide Slide 2 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Chapter 13 Nonparametric Statistics 13-1Overview 13-2Sign Test 13-3Wilcoxon Signed-Ranks Test for Matched Pairs 13-4 Wilcoxon Rank-Sum Test for Two Independent Samples 13-5 Kruskal-Wallis Test 13-6 Rank Correlation 13-7Runs Test for Randomness

Slide Slide 3 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Section 13-1 Overview Created by Erin Hodgess, Houston, Texas Revised to accompany 10th Edition, Jim Zimmer, Chattanooga State, Chattanooga, TN

Slide Slide 4 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Definitions  Parametric tests have requirements about the nature or shape of the populations involved.  Nonparametric tests do not require that samples come from populations with normal distributions or have any other particular distributions. Consequently, nonparametric tests are called distribution-free tests. Overview

Slide Slide 5 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Advantages of Nonparametric Methods 1. Nonparametric methods can be applied to a wide variety of situations because they do not have the more rigid requirements of the corresponding parametric methods. In particular, nonparametric methods do not require normally distributed populations. 2. Unlike parametric methods, nonparametric methods can often be applied to categorical data, such as the genders of survey respondents. 3. Nonparametric methods usually involve simpler computations than the corresponding parametric methods and are therefore easier to understand and apply.

Slide Slide 6 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Disadvantages of Nonparametric Methods 1. Nonparametric methods tend to waste information because exact numerical data are often reduced to a qualitative form. 2. Nonparametric tests are not as efficient as parametric tests, so with a nonparametric test we generally need stronger evidence (such as a larger sample or greater differences) before we reject a null hypothesis.

Slide Slide 7 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Efficiency of Nonparametric Methods

Slide Slide 8 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Definitions Data are sorted when they are arranged according to some criterion, such as smallest to the largest or best to worst. A rank is a number assigned to an individual sample item according to its order in the sorted list. The first item is assigned a rank of 1, the second is assigned a rank of 2, and so on.

Slide Slide 9 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Handling Ties in Ranks Find the mean of the ranks involved and assign this mean rank to each of the tied items. Sorted Data Rank Mean is 3. Mean is 7.5. Preliminary Ranking

Slide Slide 10 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Section 13-2 Sign Test Created by Erin Hodgess, Houston, Texas Revised to accompany 10th Edition, Jim Zimmer, Chattanooga State, Chattanooga, TN

Slide Slide 11 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Key Concept The main objective of this section is to understand the sign test procedure, which involves converting data values to plus and minus signs, then testing for disproportionately more of either sign.

Slide Slide 12 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Definition Sign Test The sign test is a nonparametric (distribution free) test that uses plus and minus signs to test different claims, including: 1) Claims involving matched pairs of sample data; 2) Claims involving nominal data; 3) Claims about the median of a single population.

Slide Slide 13 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Basic Concept of the Sign Test The basic idea underlying the sign test is to analyze the frequencies of the plus and minus signs to determine whether they are significantly different.

Slide Slide 14 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Figure 13-1 Sign Test Procedure

Slide Slide 15 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Figure 13-1 Sign Test Procedure

Slide Slide 16 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Figure 13-1 Sign Test Procedure

Slide Slide 17 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Requirements 1. The sample data have been randomly selected. 2. There is no requirement that the sample data come from a population with a particular distribution, such as a normal distribution.

Slide Slide 18 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Notation for Sign Test x = the number of times the less frequent sign occurs n = the total number of positive and negative signs combined

Slide Slide 19 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Test Statistic For n  25 : x (the number of times the less frequent sign occurs) Critical values For n  25, critical x values are in Table A-7. For n > 25, critical z values are in Table A-2. z = For n > 25 : n ( x + 0.5) – n 2 2

Slide Slide 20 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Claims Involving Matched Pairs When using the sign test with data that are matched pairs, we convert the raw data to plus and minus signs as follows: 1.Subtract each value of the second variable from the corresponding value of the first variable. 2.Record only the sign of the difference found in step 1. Exclude ties: that is, any matched pairs in which both values are equal.

Slide Slide 21 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Key Concept Underlying This Use of the Sign Test If the two sets of data have equal medians, the number of positive signs should be approximately equal to the number of negative signs.

Slide Slide 22 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Yields of Corn from Different Seeds Use the data in Table 13-3 with a 0.05 significance level to test the claim that there is no difference between the yields from the regular and kiln-dried seed.

Slide Slide 23 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. H 0 : The median of the differences is equal to 0. H 1 : The median of the differences is not equal to 0.  = 0.05 x = minimum(7, 4) = 4 (From Table 13-3, there are 7 negative signs and 4 positive signs.) Critical value = 1 (From Table A-7 where n = 11 and  = 0.05) Example: Yields of Corn from Different Seeds Use the data in Table 13-3 with a 0.05 significance level to test the claim that there is no difference between the yields from the regular and kiln-dried seed.

Slide Slide 24 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. H 0 : The median of the differences is equal to 0. H 1 : The median of the differences is not equal to 0. With a test statistic of x = 4 and a critical value of 1, we fail to reject the null hypothesis of no difference. There is not sufficient evidence to warrant rejection of the claim that the median of the differences is equal to 0. Example: Yields of Corn from Different Seeds Use the data in Table 13-3 with a 0.05 significance level to test the claim that there is no difference between the yields from the regular and kiln-dried seed.

Slide Slide 25 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Claims Involving Nominal Data The nature of nominal data limits the calculations that are possible, but we can identify the proportion of the sample data that belong to a particular category. Then we can test claims about the corresponding population proportion p.

Slide Slide 26 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Gender Selection Of the 325 babies born to parents using the XSORT method of gender selection, 295 were girls. Use the sign test and a 0.05 significance level to test the claim that this method of gender selection has no effect. The procedures are for cases in which n > 25. Note that the only requirement is that the sample data are randomly selected. H 0 : p = 0.5 (the proportion of girls is 0.5) H 1 : p  0.5

Slide Slide 27 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Gender Selection Of the 325 babies born to parents using the XSORT method of gender selection, 295 were girls. Use the sign test and a 0.05 significance level to test the claim that this method of gender selection has no effect. Denoting girls by the positive sign (+) and boys by the negative sign (–), we have 295 positive signs and 30 negative signs. Test statistic x = minimum(295, 30) = 30 The test involves two tails.

Slide Slide 28 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Gender Selection Of the 325 babies born to parents using the XSORT method of gender selection, 295 were girls. Use the sign test and a 0.05 significance level to test the claim that this method of gender selection has no effect. n ( x + 0.5) – z = n 2 2 ( ) – z = = –14.64

Slide Slide 29 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Gender Selection Of the 325 babies born to parents using the XSORT method of gender selection, 295 were girls. Use the sign test and a 0.05 significance level to test the claim that this method of gender selection has no effect. With  = 0.05 in a two-tailed test, the critical values are z =  The test statistic z = is less than We reject the null hypothesis that p = 0.5. There is sufficient evidence to warrant rejection of the claim that the method of gender selection has no effect.

Slide Slide 30 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Gender Selection Of the 325 babies born to parents using the XSORT method of gender selection, 295 were girls. Use the sign test and a 0.05 significance level to test the claim that this method of gender selection has no effect. Figure 13.2

Slide Slide 31 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Claims About the Median of a Single Population The negative and positive signs are based on the claimed value of the median.

Slide Slide 32 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Body Temperature Use the temperatures for 12:00 A.M. on Day 2 in Data Set 2 in Appendix B. Use the sign test to test the claim that the median is less than 98.6°F. There are 68 subjects with temperatures below 98.6°F, 23 subjects with temperatures above 98.6°F, and 15 subjects with temperatures equal to 98.6°F. H 0 : Median is equal to 98.6°F. H 1 : Median is less than 98.6°F. Since the claim is that the median is less than 98.6°F. the test involves only the left tail.

Slide Slide 33 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Body Temperature Use the temperatures for 12:00 A.M. on Day 2 in Data Set 2 in Appendix B. Use the sign test to test the claim that the median is less than 98.6°F. Discard the 15 zeros. Use ( – ) to denote the 68 temperatures below 98.6°F, and use ( + ) to denote the 23 temperatures above 98.6°F. So n = 91 and x = 23

Slide Slide 34 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Body Temperature Use the temperatures for 12:00 A.M. on Day 2 in Data Set 2 in Appendix B. Use the sign test to test the claim that the median is less than 98.6°F. ( x + 0.5) – z = n 2 2 n ( ) – z = = – 4.61

Slide Slide 35 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Body Temperature Use the temperatures for 12:00 A.M. on Day 2 in Data Set 2 in Appendix B. Use the sign test to test the claim that the median is less than 98.6°F. We use Table A-2 to get the critical z value of – The test statistic of z = –4.61 falls into the critical region. We reject the null hypothesis. We support the claim that the median body temperature of healthy adults is less than 98.6°F.

Slide Slide 36 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Body Temperature Use the temperatures for 12:00 A.M. on Day 2 in Data Set 2 in Appendix B. Use the sign test to test the claim that the median is less than 98.6°F. Figure 13.3

Slide Slide 37 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Recap In this section we have discussed: Sign tests where data are assigned plus or minus signs and then tested to see if the number of plus and minus signs is equal. Sign tests can be performed on claims involving: Matched pairs Nominal data The median of a single population

Slide Slide 38 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Section 13-3 Wilcoxon Signed-Ranks Test for Matched Pairs Created by Erin Hodgess, Houston, Texas Revised to accompany 10th Edition, Jim Zimmer, Chattanooga State, Chattanooga, TN

Slide Slide 39 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Key Concept The Wilcoxon signed-ranks test uses ranks of sample data consisting of matched pairs. This test is used with a null hypothesis that the population of differences from the matched pairs has a median equal to zero.

Slide Slide 40 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. The Wilcoxon signed-ranks test is a nonparametric test that uses ranks of sample data consisting of matched pairs. It is used to test the null hypothesis that the population of differences has a median of zero. Definition H 0 : The matched pairs have differences that come from a population with a median equal to zero. H 1 :The matched pairs have differences that come from a population with a nonzero median.

Slide Slide 41 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. 1. The data consist of matched pairs that have been randomly selected. 2. The population of differences (found from the pairs of data) has a distribution that is approximately symmetric, meaning that the left half of its histogram is roughly a mirror image of its right half. (There is no requirement that the data have a normal distribution.) Wilcoxon Signed-Ranks Test Requirements

Slide Slide 42 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Notation T = the smaller of the following two sums: 1. The sum of the absolute values of the negative ranks of the nonzero differences d 2. The sum of the positive ranks of the nonzero differences d

Slide Slide 43 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Test Statistic for the Wilcoxon Signed-Ranks Test for Matched Pairs For n  30, the test statistic is T. z = For n > 30, the test statistic is 4 T – n ( n + 1) n(n +1) (2n +1) 24

Slide Slide 44 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Critical Values for the Wilcoxon Signed-Ranks Test for Matched Pairs For n  30, the critical T value is found in Table A-8. For n > 30, the critical z values are found in Table A-2.

Slide Slide 45 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Procedure for Finding the Value of the Test Statistic Step 1: For each pair of data, find the difference d by subtracting the second value from the first. Keep the signs, but discard any pairs for which d = 0. Step 2: Ignore the signs of the differences, then sort the differences from lowest to highest and replace the differences by the corresponding rank value. When differences have the same numerical value, assign to them the mean of the ranks involved in the tie. Step 3: Attach to each rank the sign difference from which it came. That is, insert those signs that were ignored in step 2. Step 4: Find the sum of the absolute values of the negative ranks. Also find the sum of the positive ranks.

Slide Slide 46 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Step 5: Let T be the smaller of the two sums found in Step 4. Either sum could be used, but for a simplified procedure we arbitrarily select the smaller of the two sums. Step 6: Let n be the number of pairs of data for which the difference d is not 0. Step 7: Determine the test statistic and critical values based on the sample size, as shown above. Step 8: When forming the conclusion, reject the null hypothesis if the sample data lead to a test statistic that is in the critical region - that is, the test statistic is less than or equal to the critical value(s). Otherwise, fail to reject the null hypothesis. Procedure for Finding the Value of the Test Statistic

Slide Slide 47 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Does the Type of Seed Affect Corn Growth? Use the data in Table 13-4 with the Wilcoxon signed-ranks test and 0.05 significance level to test the claim that there is no difference between the yields from the regular and kiln-dried seed.

Slide Slide 48 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Use the data in Table 13-4 with the Wilcoxon signed-ranks test and 0.05 significance level to test the claim that there is no difference between the yields from the regular and kiln-dried seed. Example: Does the Type of Seed Affect Corn Growth? H 0 : There is no difference between the times of the first and second trials. H 1 : There is a difference between the times of the first and second trials.

Slide Slide 49 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.  The ranks of differences in row four of the table are found by ranking the absolute differences, handling ties by assigning the mean of the ranks.  The signed ranks in row five of the table are found by attaching the sign of the differences to the ranks. Example: Does the Type of Seed Affect Corn Growth?  The differences in row three of the table are found by computing the first time – second time.

Slide Slide 50 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Does the Type of Seed Affect Corn Growth? Step 1: In Table 13- 4, the row of differences is obtained by computing this difference for each pair of data: d = yield from regular seed – yield from kiln- dried seed Step 2: Ignoring their signs, we rank the absolute differences from lowest to highest. Step 3: The bottom row of Table is created by attaching to each rank the sign of the corresponding differences. Calculate the Test Statistic

Slide Slide 51 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Does the Type of Seed Affect Corn Growth? Step 3 (cont.): If there really is no difference between the yields from the two types of seed (as in the null hypothesis), we expect the sum of the positive ranks to be approximately equal to the sum of the absolute values of the negative ranks. Step 4: We now find the sum of the absolute values of the negative ranks, and we also find the sum of the positive ranks. Step 3 (cont.): Calculate the Test Statistic

Slide Slide 52 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Does the Type of Seed Affect Corn Growth? Sum of absolute values of negative ranks: 51 (from ) Sum of positive ranks: 15 (from ) Step 5: Letting T be the smaller of the two sums found in Step 4, we find that T = 15. Step 6: Letting n be the number of pairs of data for which the difference d is not 0, we have n = 11. Calculate the Test Statistic Step 4 (cont.):

Slide Slide 53 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Does the Type of Seed Affect Corn Growth? Step 7: Because n = 11, we have n ≤ 30, so we use a test statistic of T = 15. From Table A- 8, the critical T = 11 (using n = 11 and  = 0.05 in two tails). Step 8: The test statistic T = 15 is not less than or equal to the critical value of 11, so we fail to reject the null hypothesis. It appears that there is no difference between yields from regular seed and kiln-dried seed. Step 7: Calculate the Test Statistic

Slide Slide 54 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Recap In this section we have discussed: The Wilcoxon signed-ranks test which uses matched pairs. The hypothesis is that the matched pairs have differences that come from a population with a median equal to zero.

Slide Slide 55 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Section Wilcoxon Rank-Sum Test for Two Independent Samples Created by Erin Hodgess, Houston, Texas Revised to accompany 10th Edition, Jim Zimmer, Chattanooga State, Chattanooga, TN

Slide Slide 56 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Key Concept The Wilcoxon signed-ranks test (Section 13-3) involves matched pairs of data. The Wilcoxon rank-sum test of this section involves two independent samples that are not related or somehow matched or paired.

Slide Slide 57 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Definition The Wilcoxon rank-sum test is a nonparametric test that uses ranks of sample data from two independent populations. It is used to test the null hypothesis that the two independent samples come from populations with equal medians. H 0 : The two samples come from populations with equal medians. H 1 : The two samples come from populations with different medians.

Slide Slide 58 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Basic Concept If two samples are drawn from identical populations and the individual values are all ranked as one combined collection of values, then the high and low ranks should fall evenly between the two samples.

Slide Slide 59 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Requirements 1. There are two independent samples of randomly selected data. 2. Each of the two samples has more than 10 values. 3. There is no requirement that the two populations have a normal distribution or any other particular distribution.

Slide Slide 60 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. n 1 = size of Sample 1 n 2 = size of Sample 2 R 1 = sum of ranks for Sample 1 R 2 = sum of ranks for Sample 2 R = same as R 1 (sum of ranks for Sample 1)  R = mean of the sample R values that is expected when the two populations have equal medians  R = standard deviation of the sample R values that is expected with two populations having equal medians Notation for the Wilcoxon Rank-Sum Test

Slide Slide 61 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Test Statistic for the Wilcoxon Rank-Sum Test R –  R z =z = RR  R = n 1 n 2 (n 1 + n ) 12 n 1 (n 1 + n ) 2 = RR where n 1 = size of the sample from which the rank sum R is found n 2 = size of the other sample R = sum of ranks of the sample with size n 1

Slide Slide 62 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Critical values can be found in Table A-2 (because the test statistic is based on the normal distribution). Critical Values for the Wilcoxon Rank-Sum Test

Slide Slide 63 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Procedure for Finding the Value of the Test Statistic 1. Temporarily combine the two samples into one big sample, then replace each sample value with its rank. 2. Find the sum of the ranks for either one of the two samples. 3. Calculate the value of the z test statistic as shown in the previous slide, where either sample can be used as ‘Sample 1’.

Slide Slide 64 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. The data in Table 13-5 are from Data Set 1 in Appendix B and use only the first 13 sample values for men and the first 12 sample values for women. The numbers in parentheses are their ranks beginning with a rank of 1 assigned to the lowest value of R 1 and R 2 at the bottom denote the sum of ranks. Example: BMI of Men and Women

Slide Slide 65 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: BMI of Men and Women Use the data in Table 13-5 with the Wilcoxon rank-sum test and a 0.05 significance level to test the claim that the median BMI of men is equal to the median BMI of women. The requirements of having two independent and random samples and each having more than 10 values are met. H 0 : Men and women have BMI values with equal medians H 1 : Men and women have BMI values with medians that are not equal

Slide Slide 66 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Use the data in Table 13-5 with the Wilcoxon rank-sum test and a 0.05 significance level to test the claim that the median BMI of men is equal to the median BMI of women. Example: BMI of Men and Women Procedures. 1. Rank all 25 BMI measurements combined. This is done in Table Find the sum of the ranks of either one of the samples. For men the sum of ranks is R = … = 187

Slide Slide 67 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: BMI of Men and Women Procedures (cont.). 3. Calculate the value of the z test statistic.

Slide Slide 68 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: BMI of Men and Women Use the data in Table 13-5 with the Wilcoxon rank-sum test and a 0.05 significance level to test the claim that the median BMI of men is equal to the median BMI of women. A large positive value of z would indicate that the higher ranks are found disproportionately in Sample 1, and a large negative value of z would indicate that Sample 1 had a disproportionate share of lower ranks.

Slide Slide 69 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: BMI of Men and Women Use the data in Table 13-5 with the Wilcoxon rank-sum test and a 0.05 significance level to test the claim that the median BMI of men is equal to the median BMI of women. We have a two tailed test (with  = 0.05), so the critical values are 1.96 and –1.96. The test statistic of 0.98 does not fall within the critical region, so we fail to reject the null hypothesis that men and women have BMI values with equal medians. It appears that BMI values of men and women are basically the same.

Slide Slide 70 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. The preceding example used only 13 of the 40 sample BMI values for men listed in Data Set 1 in Appendix B, and it used only 12 of the 40 BMI values for women. Do the results change if we use all 40 sample values for both men and women? The null and alternative hypotheses are the same. Example: BMI of Men and Women

Slide Slide 71 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. In the Minitab display below ETA1 and ETA2 denote the medians of the first and second samples, respectively. The rank sum for men is W = The P-value is (or after adjustment for ties). Example: BMI of Men and Women Minitab

Slide Slide 72 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Because the P-value is greater than α = 0.05, we fail to reject the null hypothesis. There is not sufficient evidence to warrant rejection of the claim that men and women have BMI values with equal medians. Example: BMI of Men and Women Minitab

Slide Slide 73 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Recap In this section we have discussed: The Wilcoxon Rank-Sum Test for Two Independent Samples. It is used to test the null hypothesis that the two independent samples come from populations with equal medians.

Slide Slide 74 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Section 13-5 Kruskal-Wallis Test Created by Erin Hodgess, Houston, Texas Revised to accompany 10th Edition, Jim Zimmer, Chattanooga State, Chattanooga, TN

Slide Slide 75 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Key Concept This section introduces the Kruskal- Wallis test, which uses ranks of data from three or more independent samples to test the null hypothesis that the samples come from populations with equal medians.

Slide Slide 76 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Definition. The Kruskal-Wallis test (also called the H test) is a nonparametric test that uses ranks of sample data from three or more independent populations. It is used to test the null hypothesis that the independent samples come from populations with the equal medians. Kruskal-Wallis Test H 0 : The samples come from populations with equal medians. H 1 : The samples come from populations with medians that are not all equal.

Slide Slide 77 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. We compute the test statistic H, which has a distribution that can be approximated by the chi-square (  2 ) distribution as long as each sample has at least 5 observations. When we use the chi-square distribution in this context, the number of degrees of freedom is k – 1, where k is the number of samples. Kruskal-Wallis Test

Slide Slide 78 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Kruskal-Wallis Test 1. We have at least three independent samples, all of which are randomly selected. 2. Each sample has at least 5 observations. 3. There is no requirement that the populations have a normal distribution or any other particular distribution. Requirements

Slide Slide 79 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. N = total number of observations in all observations combined k = number of samples R 1 = sum of ranks for Sample 1 n 1 = number of observations in Sample 1 For Sample 2, the sum of ranks is R 2 and the number of observations is n 2, and similar notation is used for the other samples. Kruskal-Wallis Test Notation

Slide Slide 80 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Kruskal-Wallis Test Test Statistic Critical Values 1. Test is right-tailed. 2. df = k – 1 (Because the test statistic H can be approximated by the  2 distribution, use Table A- 4).

Slide Slide 81 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Procedure for Finding the Value of the Test Statistic H 1 Temporarily combine all samples into one big sample and assign a rank to each sample value. 2.For each sample, find the sum of the ranks and find the sample size. 3.Calculate H by using the results of Step 2 and the notation and test statistic given on the preceding slide.

Slide Slide 82 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Procedure for Finding the Value of the Test Statistic H The test statistic H is basically a measure of the variance of the rank sums R 1, R 2, …, R k. If the ranks are distributed evenly among the sample groups, then H should be a relatively small number. If the samples are very different, then the ranks will be excessively low in some groups and high in others, with the net effect that H will be large.

Slide Slide 83 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Effects of Treatments on Poplar Tree Weights Table 13-6 lists weights of poplar trees given different treatments. (Numbers in parentheses are ranks.)

Slide Slide 84 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Use the data in Table 13-6 with the Kruskal-Wallis test to test the claim that the four samples come from populations with equal medians. Are requirements met? There are three or more independent and random samples. Each sample size is 5. (Requirement is at least 5.) Example: Effects of Treatments on Poplar Tree Weights H 0 : The populations of poplar tree weights from the four treatments have equal medians. H 1 : The four population medians are not all equal.

Slide Slide 85 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Use the data in Table 13-6 with the Kruskal-Wallis test to test the claim that the four samples come from populations with equal medians. Example: Effects of Treatments on Poplar Tree Weights The following statistics come from Table 13-6: n 1 = 5, n 2 = 5, n 3 = 5, n 4 = 5 N = 20 R 1 = 45, R 2 = 37.5, R 3 = 42.5, R 4 = 85

Slide Slide 86 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Use the data in Table 13-6 with the Kruskal-Wallis test to test the claim that the four samples come from populations with equal medians. Evaluate the test statistic.. Example: Effects of Treatments on Poplar Tree Weights

Slide Slide 87 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Use the data in Table 13-6 with the Kruskal-Wallis test to test the claim that the four samples come from populations with equal medians. Find the critical value.. Because each sample has at least five observations, the distribution of H is approximately a chi-square distribution. Example: Effects of Treatments on Poplar Tree Weights df = k – 1 = 4 – 1 = 3 α = 0.05 From Table A-4 the critical value =

Slide Slide 88 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Use the data in Table 13-6 with the Kruskal-Wallis test to test the claim that the four samples come from populations with equal medians. Example: Effects of Treatments on Poplar Tree Weights The test statistic is in the critical region, so we reject the null hypothesis of equal medians. At least one of the medians appears to be different from the others.

Slide Slide 89 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Recap In this section we have discussed: The Kruskal-Wallis Test is the non- parametric equivalent of ANOVA. It tests the hypothesis that three or more populations have equal means. The populations do not have to be normally distributed.

Slide Slide 90 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Section 13-6 Rank Correlation Created by Erin Hodgess, Houston, Texas Revised to accompany 10th Edition, Jim Zimmer, Chattanooga State, Chattanooga, TN

Slide Slide 91 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Key Concept This section describes the nonparametric method of rank correlation, which uses paired data to test for an association between two variables. In Chapter 10 we used paired sample data to compute values for the linear correlation coefficient r, but in this section we use ranks as a the basis for computing the rank correlation coefficient r s.

Slide Slide 92 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Rank Correlation Definition The rank correlation test (or Spearman’s rank correlation test) is a non-parametric test that uses ranks of sample data consisting of matched pairs. It is used to test for an association between two variables, so the null and alternative hypotheses are as follows (where ρ s denotes the rank correlation coefficient for the entire population): H o : ρ s = 0 (There is no correlation between the two variables.) H 1 : ρ s  0 (There is a correlation between the two variables.)

Slide Slide 93 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. 1.The nonparametric method of rank correlation can be used in a wider variety of circumstances than the parametric method of linear correlation. With rank correlation, we can analyze paired data that are ranks or can be converted to ranks. 2.Rank correlation can be used to detect some (not all) relationships that are not linear. Advantages Rank correlation has these advantages over the parametric methods discussed in Chapter 10:

Slide Slide 94 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Disadvantages A disadvantage of rank correlation is its efficiency rating of 0.91, as described in Section This efficiency rating shows that with all other circumstances being equal, the nonparametric approach of rank correlation requires 100 pairs of sample data to achieve the same results as only 91 pairs of sample observations analyzed through parametric methods, assuming that the stricter requirements of the parametric approach are met.

Slide Slide 95 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Figure 13-4 Rank Correlation for Testing H 0 :  s = 0

Slide Slide 96 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Figure 13-4 Rank Correlation for Testing H 0 :  s = 0

Slide Slide 97 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Requirements 1.The sample paired data have been randomly selected. 2.Unlike the parametric methods of Section 10-2, there is no requirement that the sample pairs of data have a bivariate normal distribution. There is no requirement of a normal distribution for any population.

Slide Slide 98 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Notation r s = rank correlation coefficient for sample paired data ( r s is a sample statistic)  s = rank correlation coefficient for all the population data (  s is a population parameter) n = number of pairs of data d = difference between ranks for the two values within a pair

Slide Slide 99 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Rank Correlation Test Statistic No ties: After converting the data in each sample to ranks, if there are no ties among ranks for either variable, the exact value of the test statistic can be calculated using this formula: Ties: After converting the data in each sample to ranks, if either variable has ties among its ranks, the exact value of the test statistic rs can be found by using Formula 10-1 with the ranks:

Slide Slide 100 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Critical values: If n  30, critical values are found in Table A-9. If n > 30, use Formula Rank Correlation Formula 13-1 where the value of z corresponds to the significance level. (For example, if  = 0.05, z – 1.96.)

Slide Slide 101 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Rankings of Colleges Use the data in Table 13-7 to determine if there is a correlation between the student rankings and the rankings of the magazine.

Slide Slide 102 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Rankings of Colleges Use the data in Table 13-7 to determine if there is a correlation between the student rankings and the rankings of the magazine. Since neither variable has ties in the ranks: H 0 :  s = 0 H 1 :  s  0

Slide Slide 103 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Rankings of Colleges Use the data in Table 13-7 to determine if there is a correlation between the student rankings and the rankings of the magazine. H 0 :  s = 0 H 1 :  s  0 From Table A-9 the critical values are  Because the test statistic of r s = does not exceed the critical value, we fail to reject the null hypothesis. There is not sufficient evidence to support a claim of a correlation between the rankings of the students and the magazine.

Slide Slide 104 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Assume that the preceding example is expanded by including a total of 40 colleges and that the test statistic r s is found to be If the significance level of  = 0.05, what do you conclude about the correlation? Example: Rankings of Colleges Large Sample Case Since n = 40 exceeds 30, we find the critical value from Formula 13-1

Slide Slide 105 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Assume that the preceding example is expanded by including a total of 40 colleges and that the test statistic r s is found to be If the significance level of  = 0.05, what do you conclude about the correlation? Example: Rankings of Colleges Large Sample Case The test statistic of r s = does not exceed the critical value of 0.314, so we fail to reject the null hypothesis. There is not sufficient evidence to support the claim of a correlation between students and the magazine.

Slide Slide 106 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. The data in Table are the numbers of games played and the last scores (in millions) of a Raiders of the Lost Ark pinball game. We expect that there should be an association between the number of games played and the pinball score. Example: Detecting a Nonlinear Pattern H 0 :  s = 0 H 1 :  s  0

Slide Slide 107 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. There are no ties among ranks of either list. Example: Detecting a Nonlinear Pattern

Slide Slide 108 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Detecting a Nonlinear Pattern Since n = 9 is less than 30, use Table A-9 Critical values are ± The sample statistic exceeds 0.700, so we conclude that there is significant evidence to reject the null hypothesis of no correlation. There appears to be correlation between the number of games played and the score.

Slide Slide 109 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Detecting a Nonlinear Pattern If the preceding example is done using the methods of Chapter 9, the linear correlation coefficient is r = This leads to the conclusion that there is not enough evidence to support the claim of a significant linear correlation, whereas the nonlinear test found that there was enough evidence. The Excel scatter diagram shows that there is a non-linear relationship that the parametric method would not have detected. Excel

Slide Slide 110 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Recap In this section we have discussed: Rank correlation which is the non-parametric equivalent of testing for correlation described in Chapter 10. It uses ranks of matched pairs to test for association. Sometimes rank correlation can detect non- linear correlation that the parametric test will not recognize.

Slide Slide 111 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Section 13-7 Runs Test for Randomness Created by Erin Hodgess, Houston, Texas Revised to accompany 10th Edition, Jim Zimmer, Chattanooga State, Chattanooga, TN

Slide Slide 112 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Key Concept This section introduces the runs test for randomness, which can be used to determine whether the sample data in a sequence are in a random order. This test is based on sample data that have two characteristics, and it analyzes runs of those characteristics to determine whether the runs appear to result from some random process, or whether the runs suggest that the order of the data is not random.

Slide Slide 113 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Runs Test for Randomness Definitions A run is a sequence of data having the same characteristic; the sequence is preceded and followed by data with a different characteristic or by no data at all. The runs test uses the number of runs in a sequence of sample data to test for randomness in the order of the data.

Slide Slide 114 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Fundamental Principles of the Run Test Reject randomness if the number of runs is very low or very high. Example: The sequence of genders FFFFFMMMMM is not random because it has only 2 runs, so the number of runs is very low. Example: The sequence of genders FMFMFMFMFM is not random because there are 10 runs, which is very high. It is important to note that the runs test for randomness is based on the order in which the data occur; it is not based on the frequency of the data.

Slide Slide 115 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Figure 13-5 Procedure for Runs Test for Randomness

Slide Slide 116 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Figure 13-5 Procedure for Runs Test for Randomness

Slide Slide 117 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Requirements 1. The sample data are arranged according to some ordering scheme, such as the order in which the sample values were obtained. 2. Each data value can be categorized into one of two separate categories (such as male/female).

Slide Slide 118 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Notation n 1 = number of elements in the sequence that have one particular characteristic (The characteristic chosen for n 1 is arbitrary.) n 2 = number of elements in the sequence that have the other characteristic G = number of runs

Slide Slide 119 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Test Statistic Test statistic is the number of runs G Critical Values Critical values are found in Table A-10. Runs Test for Randomness For Small Samples (n 1 ≤ 20 and n 2 ≤ 20) and  = 0.05:

Slide Slide 120 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Decision criteria Reject randomness if the number of runs G is: less than or equal to the smaller critical value found in Table A-10. or greater than or equal to the larger critical value found in Table A-10. Runs Test for Randomness For Small Samples (n 1 ≤ 20 and n 2 ≤ 20) and  = 0.05:

Slide Slide 121 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Test Statistic where and For Small Samples (n 1 ≤ 20 and n 2 ≤ 20) and  = 0.05: Runs Test for Randomness

Slide Slide 122 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Critical Values Critical values of z: Use Table A-2. Runs Test for Randomness For Large Samples (n 1 > 20 or n 2 > 20) or  ≠ 0.05:

Slide Slide 123 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Small Sample Genders of Bears Listed below are the genders of the first 10 bears from Data Set 6 in Appendix B. Use a 0.05 significance level to test for randomness in the sequence of genders. M M M M F F M M F F Separate the runs as shown below. M M M M F F M M F F 2nd run3rd run4th run1st run

Slide Slide 124 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Small Sample Genders of Bears M M M M F F M M F F 2nd run3rd run4th run1st run n 1 = total number of males = 6 n 2 = total number of females = 4 G = number of runs = 4 Because n 1 ≤ 20 and n 2 ≤ 20 and  = 0.05, the test statistic is G = 4

Slide Slide 125 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Example: Small Sample Genders of Bears M M M M F F M M F F 2nd run3rd run4th run1st run From Table A-10, the critical values are 2 and 9. Because G = 4 is not less than or equal to 2, nor is it greater than or equal to 9, we do not reject randomness. It appears the sequence of genders is random.

Slide Slide 126 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Refer to the rainfall amounts for Boston as listed in Data Set 10 in Appendix B. Is there sufficient evidence to support the claim that rain on Mondays is not random? D D D D R D R D D R D D R D D D R D D R R R D D D D R D R D R R R D R D D D R D D D R D R D D R D D D R H 0 : The sequence is random. H 1 : The sequence is not random. n 1 = number of Ds = 33 n 2 = number or Rs = 19 G = number of runs = 30 Example: Large Sample Boston Rainfall on Mondays

Slide Slide 127 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Since n 1 > 20, we must calculate z using the formulas: Example: Large Sample Boston Rainfall on Mondays

Slide Slide 128 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. The critical values are z = and The test statistic of z = 1.48 does not fall within the critical region, so we fail to reject the null hypothesis of randomness. The given sequence does appear to be random. Example: Large Sample Boston Rainfall on Mondays

Slide Slide 129 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Recap In this section we have discussed: The runs test for randomness which can be used to determine whether the sample data in a sequence are in a random order. We reject randomness if the number of runs is very low or very high.