SADC Course in Statistics Common Non- Parametric Methods for Comparing Two Samples (Session 20)

Slides:



Advertisements
Similar presentations
Introductory Mathematics & Statistics for Business
Advertisements

Prepared by Lloyd R. Jaisingh
MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
Addition Facts
Introduction to Study Skills & Research Methods (HL10040)
1 Session 8 Tests of Hypotheses. 2 By the end of this session, you will be able to set up, conduct and interpret results from a test of hypothesis concerning.
SADC Course in Statistics Analysis of Variance for comparing means (Session 11)
SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)
The Poisson distribution
SADC Course in Statistics Further ideas concerning confidence intervals (Session 06)
SADC Course in Statistics Introduction to Non- Parametric Methods (Session 19)
SADC Course in Statistics Tests for Variances (Session 11)
Assumptions underlying regression analysis
SADC Course in Statistics Basic principles of hypothesis tests (Session 08)
SADC Course in Statistics Meaning and use of confidence intervals (Session 05)
SADC Course in Statistics Sampling weights: an appreciation (Sessions 19)
SADC Course in Statistics Importance of the normal distribution (Session 09)
SADC Course in Statistics Sampling design using the Paddy game (Sessions 15&16)
SADC Course in Statistics Comparing two proportions (Session 14)
SADC Course in Statistics Linking tests to confidence intervals (and other issues) (Session 10)
SADC Course in Statistics (Session 09)
SADC Course in Statistics A model for comparing means (Session 12)
SADC Course in Statistics Revision on tests for means using CAST (Session 17)
SADC Course in Statistics Revision on tests for proportions using CAST (Session 18)
Probability Distributions
Non-parametric tests, part A:
STATISTICAL INFERENCE ABOUT MEANS AND PROPORTIONS WITH TWO POPULATIONS
Hypothesis Test II: t tests
Elementary Statistics
Non-Parametric Statistics
Non-parametric statistics
Hypothesis Tests: Two Independent Samples
Comparison of 2 Population Means Goal: To compare 2 populations/treatments wrt a numeric outcome Sampling Design: Independent Samples (Parallel Groups)
Hypothesis Tests: Two Related Samples AKA Dependent Samples Tests
The Kruskal-Wallis H Test
Area under curves Consider the curve y = f(x) for x  [a, b] The actual area under the curve is units 2 The approximate area is the sum of areas.
Module 16: One-sample t-tests and Confidence Intervals
Module 17: Two-Sample t-tests, with equal variances for the two populations This module describes one of the most utilized statistical tests, the.
Addition 1’s to 20.
McGraw-Hill, Bluman, 7th ed., Chapter 9
Please enter data on page 477 in your calculator.
© The McGraw-Hill Companies, Inc., Chapter 10 Testing the Difference between Means and Variances.
Comparing Two Means.
Chapter 18: The Chi-Square Statistic
ABOUT TWO INDEPENDENT POPULATIONS
CHAPTER 15: Tests of Significance: The Basics Lecture PowerPoint Slides The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner.
1 Chapter 20: Statistical Tests for Ordinal Data.
Chapter 14 Nonparametric Statistics
January Structure of the book Section 1 (Ch 1 – 10) Basic concepts and techniques Section 2 (Ch 11 – 15): Inference for quantitative outcomes Section.
Happiness comes not from material wealth but less desire. 1.
SADC Course in Statistics Comparing Means from Independent Samples (Session 12)
Statistics 07 Nonparametric Hypothesis Testing. Parametric testing such as Z test, t test and F test is suitable for the test of range variables or ratio.
© 2004 Prentice-Hall, Inc.Chap 10-1 Basic Business Statistics (9 th Edition) Chapter 10 Two-Sample Tests with Numerical Data.
Basic Business Statistics (9th Edition)
15-1 Introduction Most of the hypothesis-testing and confidence interval procedures discussed in previous chapters are based on the assumption that.
Mann-Whitney and Wilcoxon Tests.
Nonparametrics and goodness of fit Petter Mostad
Chapter 15 Nonparametric Statistics
Statistical Methods II
Non-parametric Dr Azmi Mohd Tamil.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
Independent samples- Wilcoxon rank sum test. Example The main outcome measure in MS is the expanded disability status scale (EDSS) The main outcome measure.
Non-parametric Tests. With histograms like these, there really isn’t a need to perform the Shapiro-Wilk tests!
Parametric tests (independent t- test and paired t-test & ANOVA) Dr. Omar Al Jadaan.
Nonparametric Statistics. In previous testing, we assumed that our samples were drawn from normally distributed populations. This chapter introduces some.
Nonparametric Statistics
DTC Quantitative Methods Bivariate Analysis: t-tests and Analysis of Variance (ANOVA) Thursday 14 th February 2013.
Copyright © Cengage Learning. All rights reserved. 15 Distribution-Free Procedures.
DTC Quantitative Methods Bivariate Analysis: t-tests and Analysis of Variance (ANOVA) Thursday 20th February 2014  
Some Nonparametric Methods
Presentation transcript:

SADC Course in Statistics Common Non- Parametric Methods for Comparing Two Samples (Session 20)

To put your footer here go to View > Header and Footer 2 Learning Objectives At the end of this session, you will be able to Understand the type of logic behind common non-parametric tests for comparing two groups based on ranks Interpret and understand the commonly used Wilcoxon signed-rank test Appreciate some practical problems associated with the methods

To put your footer here go to View > Header and Footer 3 An illustrative example Paired-Samples 10 farmers recorded their crop yield (tonnes/hectare) before and after the use of a fertiliser. Has the use of fertiliser changed the yield? Data (after-before pair-wise differences): 0.02, 0.89, -0.06, 0.26, 0.83, 0.42, 0.80, -0.05, 0.64, 0.84 How can we address this question objectively?

To put your footer here go to View > Header and Footer 4 Start by plotting - Roughly symmetric distribution?

To put your footer here go to View > Header and Footer 5 Addressing the question … A paired t-test is often employed in such cases Recall this is simply a one-sample t-test applied to the pair-wise differences The procedure assumes the pair-wise differences are from a normal distribution

To put your footer here go to View > Header and Footer 6 Addressing the question Recall the t-test procedure is quite robust against departures from normality (c.f. Session 19) However, if we are concerned about the validity of the normal assumption we might use a non-parametric test that does not make this assumption

To put your footer here go to View > Header and Footer 7 Addressing the question One possibility is to use a sign test, to test H 0 : Population median difference, =0 vs. H 1 : Population median difference, 0 However, this procedure is inefficient as it effectively only utilises the signs (+/-) of the pair-wise differences Can more information be used?

To put your footer here go to View > Header and Footer 8 Yes, but at a price… We use the rank order of the pair-wise differences, but not the actual values This leads to the Wilcoxon-signed rank test Assumptions The pair-wise differences are not only independent, but are from a symmetric, but unspecified distribution Wilcoxon signed-rank test

To put your footer here go to View > Header and Footer 9 Back to the example Let us assume the distribution of pair-wise differences is symmetrically distributed Not unreasonable based on the plots Also, the sample median and mean are similar; 0.53 and 0.46 respectively

To put your footer here go to View > Header and Footer 10 Wilcoxon signed-rank test Difference Rank Sign Signed rank Rank the n=10 differences according to their magnitude Re-attach the signs to give signed-ranks: Notes Use average ranks for ties Zero differences are ignored in the above process (reducing the sample size)

To put your footer here go to View > Header and Footer 11 Wilcoxon signed-rank test LetT + =sum of +ve ranks = 50 T =sum of –ve ranks = 5 Take either T+ or T as a test statistic –T + + T =n(n+1)/2 Consider T +. A sufficiently small or large value is evidence to reject H 0 To obtain a p-value we compare T + with its null distribution –This is a symmetric discrete distribution A two-sided p-value is then Prob(T +5)+Prob(T +50)

To put your footer here go to View > Header and Footer 12 The p-value calculation Exact method –Cumbersome –Use appropriate software Large sample approximation –Approximate the null distribution of T + using a normal distribution –n>20 will usually give a reasonable approximation

To put your footer here go to View > Header and Footer 13 P-value calculation Using statistical software, e.g. Stata: sign | obs sum ranks positive | 8 50 negative | 2 5 all | Ho: diff = 0 z = Prob > |z| = Approximate p-value = From SPSS, exact p-value = 0.020

To put your footer here go to View > Header and Footer 14 Conclusions The p-value is small. Hence, there is evidence to reject H 0 The estimated median difference (after – before), 0.53, is significant There is evidence based on this study for a positive fertiliser effect

To put your footer here go to View > Header and Footer 15 Comments While the Wilcoxon signed-rank test makes less restrictive assumptions than the t-test there are still a number of major practical problems The symmetric assumption is still quite limiting, as many distributions are skewed Confidence intervals (CIs) –As with the sign test (Session 19) most software packages concentrate on the p-value rather than point estimates and confidence intervals

To put your footer here go to View > Header and Footer 16 Two independent samples For comparing independent samples, a t- test for independent samples is often used If we were concerned about the validity of the underlying assumptions we could employ a non-parametric method The Wilcoxon rank-sum test (or the equivalent Mann-Whitney U test) is a common choice –Once again this is based on ranks

To put your footer here go to View > Header and Footer 17 Concluding remarks Non-parametric tests may be used. However… –Their usefulness is often over-rated –The lack of confidence intervals is a major disadvantage Practical statistics is frequently more complicated than comparing two groups. In this case, t-test methodology naturally extends into more a more general modelling framework The non-parametric tests discussed do not naturally extend

To put your footer here go to View > Header and Footer 18