Previous Lecture: Categorical Data Methods. Nonparametric Methods This Lecture Judy Zhong Ph.D.

Slides:



Advertisements
Similar presentations
Prepared by Lloyd R. Jaisingh
Advertisements

Chapter 16 Introduction to Nonparametric Statistics
Economics 105: Statistics Go over GH 11 & 12 GH 13 & 14 due Thursday.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Nonparametric Methods Chapter 15.
EPI 809 / Spring 2008 Chapter 9 Nonparametric Statistics.
Ordinal Data. Ordinal Tests Non-parametric tests Non-parametric tests No assumptions about the shape of the distribution No assumptions about the shape.
statistics NONPARAMETRIC TEST
Statistical Tests Karen H. Hagglund, M.S.
Lecture 10 Non Parametric Testing STAT 3120 Statistical Methods I.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Chapter 12 Chi-Square Tests and Nonparametric Tests
Chapter 14 Analysis of Categorical Data
Chapter 12 Chi-Square Tests and Nonparametric Tests
© 2002 Prentice-Hall, Inc.Chap 8-1 Statistics for Managers using Microsoft Excel 3 rd Edition Chapter 8 Two Sample Tests with Numerical Data.
Lesson #25 Nonparametric Tests for a Single Population.
Test statistic: Group Comparison Jobayer Hossain Larry Holmes, Jr Research Statistics, Lecture 5 October 30,2008.
Statistics 07 Nonparametric Hypothesis Testing. Parametric testing such as Z test, t test and F test is suitable for the test of range variables or ratio.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 17: Nonparametric Tests & Course Summary.
Bivariate Statistics GTECH 201 Lecture 17. Overview of Today’s Topic Two-Sample Difference of Means Test Matched Pairs (Dependent Sample) Tests Chi-Square.
Wilcoxon Tests What is the Purpose of Wilcoxon Tests? What are the Assumptions? How does the Wilcoxon Rank-Sum Test Work? How does the Wilcoxon Matched-
Statistics for Managers Using Microsoft® Excel 5th Edition
1 Distribution-free testing If the data are normally distributed, we may apply a z- test or t-test when the parameter of interest is . But what if this.
© 2004 Prentice-Hall, Inc.Chap 10-1 Basic Business Statistics (9 th Edition) Chapter 10 Two-Sample Tests with Numerical Data.
Basic Business Statistics (9th Edition)
Student’s t statistic Use Test for equality of two means
Biostatistics in Research Practice: Non-parametric tests Dr Victoria Allgar.
15-1 Introduction Most of the hypothesis-testing and confidence interval procedures discussed in previous chapters are based on the assumption that.
Nonparametrics and goodness of fit Petter Mostad
Chapter 15 Nonparametric Statistics
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 12-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
NONPARAMETRIC STATISTICS
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Inferences in Regression and Correlation Analysis Ayona Chatterjee Spring 2008 Math 4803/5803.
Non-parametric Tests. With histograms like these, there really isn’t a need to perform the Shapiro-Wilk tests!
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chapter 14 Nonparametric Statistics. 2 Introduction: Distribution-Free Tests Distribution-free tests – statistical tests that don’t rely on assumptions.
Biostat 200 Lecture 7 1. Hypothesis tests so far T-test of one mean: Null hypothesis µ=µ 0 Test of one proportion: Null hypothesis p=p 0 Paired t-test:
Lesson Inferences about the Differences between Two Medians: Dependent Samples.
What are Nonparametric Statistics? In all of the preceding chapters we have focused on testing and estimating parameters associated with distributions.
Copyright © 2012 Pearson Education. Chapter 23 Nonparametric Methods.
© 2000 Prentice-Hall, Inc. Statistics Nonparametric Statistics Chapter 14.
© Copyright McGraw-Hill CHAPTER 13 Nonparametric Statistics.
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
Fall 2002Biostat Nonparametric Tests Nonparametric tests are useful when normality or the CLT can not be used. Nonparametric tests base inference.
Nonparametric Statistics. In previous testing, we assumed that our samples were drawn from normally distributed populations. This chapter introduces some.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Nonparametric Tests IPS Chapter 15 © 2009 W.H. Freeman and Company.
1 Nonparametric Statistical Techniques Chapter 17.
Nonparametric Statistics
Lesson 15 - R Chapter 15 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests and Nonparametric Tests Statistics for.
GG 313 Lecture 9 Nonparametric Tests 9/22/05. If we cannot assume that our data are at least approximately normally distributed - because there are a.
Medical Statistics (full English class) Ji-Qian Fang School of Public Health Sun Yat-Sen University.
Statistics in Applied Science and Technology Chapter14. Nonparametric Methods.
CD-ROM Chap 16-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition CD-ROM Chapter 16 Introduction.
1 Uses both direction (sign) and magnitude. Applies to the case of symmetric continuous distributions: Mean equals median. Wilcoxon Signed-Rank Test.
BPS - 5th Ed. Chapter 251 Nonparametric Tests. BPS - 5th Ed. Chapter 252 Inference Methods So Far u Variables have had Normal distributions. u In practice,
NON-PARAMETRIC STATISTICS
Nonparametric Statistics
Biostatistics Nonparametric Statistics Class 8 March 14, 2000.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Nonparametric Statistics.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
1 Nonparametric Statistical Techniques Chapter 18.
1 Underlying population distribution is continuous. No other assumptions. Data need not be quantitative, but may be categorical or rank data. Very quick.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Nonparametric Statistics
Lesson Inferences about the Differences between Two Medians: Dependent Samples.
十二、Nonparametric Methods (Chapter 12)
Some Nonparametric Methods
Presentation transcript:

Previous Lecture: Categorical Data Methods

Nonparametric Methods This Lecture Judy Zhong Ph.D.

Nonparametric statistical methods Previously, the data were assumed to come from some underlying distribution (e.g. normal distribution). We will consider methods for statistical inference which do not depend upon knowledge of the functional form of the underlying probability distributions. They are “distribution-free”, no assumptions about the sample populations. Methods based on such assumptions are called parametric methods.

Nonparametric methods Do not require normality Use if  Sample size small  Data with outliers (strong deviations from normality) Two types of tests:  Permutation test  Rank-based tests

Ranks  Sometimes we wish to test a null hypothesis about a population mean, but if the sample size is small and we have non-normally distributed variables, the t-test may not be appropriate.  A powerful distribution-free tool is the use of ranks.  The ranks of an observations is the relative position of an observation’s magnitude compared to the rest of the sample.  When two or more observations have the same value (ties), the rank is assigned by computing the average of the ranks that would have been assigned to tied values and using this average as the common rank shared by each of the tied values.

Example  The ordered observations and ranks are as follows:  If we consider only continuous distributions (to avoid ties), the distribution of ranks does not depend on the particular continuous distribution of the sample.  In other words, rank based procedures are distribution-free.

Rank-based Tests Types Wilcoxon Signed Rank Test  one-sample or paired samples Wilcoxon Rank Sum Test  two independent samples Good for: Small n Ordinal data Data with outliers (strong deviations from normality)

Rank-based Tests Cardinal data: data are on a scale e.g., weight, height, blood pressure, body temperature Can compute means, variances, etc Ordinal data: data can be ordered, but do not have specific values e.g., high school, college, post graduate degree. Convenient to use ranks instead of numerical statistics

Types: One sample Paired samples Wilcoxon Signed Rank Test

Paired sample example: wages of paired tall and short men Steps: 1. For each of n sample items, compute the difference, D i, between two measurements 2. Ignore + and – signs and find the absolute values, |D i | 3. Omit zero differences, so sample size is n ’ 4. Assign ranks R i from 1 to n ’ (give average rank to ties) 5. Reassign + and – signs to the ranks R i 6. Compute the Wilcoxon test statistic W as the sum of the positive ranks

Wilcoxon Signed Rank Test x y d = x-y |d| Rank Signed rank W1 = Sum of positive ranks: 34 W2 = Sum of negative ranks: 21

Wilcoxon Signed Ranks Test Statistic The Wilcoxon signed ranks test statistic is the sum of the positive (or negative) ranks:

Wilcoxon Signed Rank Test: exact p-values For small n’, can compute exactly: p-value = 2 * P(W1 ≥ W1 obs ) = 2 * P(W2 ≤ W2 obs ) Can use R Can use Table 11 in the Appendix > x<-c(25.4,27.7,30.1,30.6,32.3,33.3,34.7,38.8,40.3,55.5) > y<-c(25.7,26.4,24.5,31.6,25.0,28.0,37.4,43.8,35.8,60.9) > wilcox.test(x, y, paired=TRUE) Wilcoxon signed rank test data: x and y V = 34, p-value = alternative hypothesis: true location shift is not equal to 0

Wilcoxon Rank Sum Test for Two independent samples

Wilcoxon Rank-Sum Test for Differences in 2 Medians Test two independent population medians Populations need not be normally distributed Distribution-free procedure Used for small samples, ordinal data, data with outliers, skewed data

Wilcoxon Rank-Sum Test: Small Samples Assign ranks to the combined n 1 + n 2 sample observations Smallest value rank = 1, largest value rank = n 1 + n 2 Assign average rank for ties Sum the ranks for each sample: R 1 and R 2

Sample data are collected on the capacity rates (% of capacity) for two factories. Are the median operating rates for two factories the same? For factory A, the rates are 71, 82, 77, 94, 88 For factory B, the rates are 85, 82, 92, 97 Test for equality of the population medians at the 0.05 significance level Wilcoxon Rank-Sum Test: Small Sample Example

CapacityRank Factory AFactory BFactory AFactory B Rank Sums: Tie in 3 rd and 4 th places Ranked Capacity values: (continued)

R 1 = 24.5 Wilcoxon Rank-Sum Test: Small Sample Example (continued) The sample sizes are: n 1 = 4 (factory B) n 2 = 5 (factory A) The level of significance is  =.05 R 2 = 20.5 Critical values from Table 12 Conclusion: NS > a<-c(71,82,77,94,88) > b<-c(85,82,92,97) > wilcox.test(a, b, paired=F) Wilcoxon rank sum test with continuity correction W = 5.5, p-value = alternative hypothesis: true location shift is not equal to 0

Summary: Nonparametric Tests Do not require normality Use if sample sizes small, ordinal data and/or data with outliers Rank-based tests one sample, paired samples: Wilcoxon Signed Rank Test two independent samples: Wilcoxon Rank Sum Test based on ranks of observations

Next Lecture: Regression and Correlation