Mann Whitney U For comparison data. Using Mann Whitney U Non-parametric i.e. no assumptions are made about data fitting a normal distribution Is used.

Slides:



Advertisements
Similar presentations
TWO STEP EQUATIONS 1. SOLVE FOR X 2. DO THE ADDITION STEP FIRST
Advertisements

C) between 18 and 27. D) between 27 and 50.
The Mann-Whitney U Test What you need to know
You have been given a mission and a code. Use the code to complete the mission and you will save the world from obliteration…
Advanced Piloting Cruise Plot.
Introductory Mathematics & Statistics for Business
Prepared by Lloyd R. Jaisingh
Chapter 1 The Study of Body Function Image PowerPoint
1 Copyright © 2010, Elsevier Inc. All rights Reserved Fig 2.1 Chapter 2.
STATISTICS HYPOTHESES TEST (II) One-sample tests on the mean and variance Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National.
Chapter 1 Image Slides Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
1 2 Test for Independence 2 Test for Independence.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Title Subtitle.
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Addition Facts
Overview of Lecture Parametric vs Non-Parametric Statistical Tests.
C82MST Statistical Methods 2 - Lecture 2 1 Overview of Lecture Variability and Averages The Normal Distribution Comparing Population Variances Experimental.
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
Around the World AdditionSubtraction MultiplicationDivision AdditionSubtraction MultiplicationDivision.
Learning to show the remainder
Non-parametric tests, part A:
Solve Multi-step Equations
Break Time Remaining 10:00.
Elementary Statistics
Independent Measures T-Test Quantitative Methods in HPELS 440:210.
Nonparametric Test Distribution-Free Tests 1.No assumptions of normality 2.Focus on medians rather than means 3.Not affected by outliers 4.Des NOT really.
(This presentation may be used for instructional purposes)
Non-Parametric Statistics
Non-parametric statistics
ABC Technology Project
Chi-Square and Analysis of Variance (ANOVA)
Hypothesis Tests: Two Independent Samples
Twenty Questions Subject: Twenty Questions
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
1..
Quantitative Analysis (Statistics Week 8)
Adding Up In Chunks.
Lets play bingo!!. Calculate: MEAN Calculate: MEDIAN
Chapter 5 Test Review Sections 5-1 through 5-4.
GG Consulting, LLC I-SUITE. Source: TEA SHARS Frequently asked questions 2.
Addition 1’s to 20.
Please enter data on page 477 in your calculator.
25 seconds left…...
Copyright © Cengage Learning. All rights reserved.
U1A L1 Examples FACTORING REVIEW EXAMPLES.
Week 1.
Statistical Inferences Based on Two Samples
© The McGraw-Hill Companies, Inc., Chapter 10 Testing the Difference between Means and Variances.
We will resume in: 25 Minutes.
© The McGraw-Hill Companies, Inc., Chapter 12 Chi-Square.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
Figure Essential Cell Biology (© Garland Science 2010)
Chapter Thirteen The One-Way Analysis of Variance.
Tutorial: Understanding the normal curve. Gauss Next mouse click.
PSSA Preparation.
Copyright © 2013 Pearson Education, Inc. All rights reserved Chapter 11 Simple Linear Regression.
Experimental Design and Analysis of Variance
1 Chapter 20: Statistical Tests for Ordinal Data.
Number Factors and Multiples Saturday, 09 September 2006 ©RSH.
Simple Linear Regression Analysis
Multiple Regression and Model Building
January Structure of the book Section 1 (Ch 1 – 10) Basic concepts and techniques Section 2 (Ch 11 – 15): Inference for quantitative outcomes Section.
Columbus State Community College
Spearman’s Rank For relationship data. Non-parametric i.e. no assumptions are made about data fitting a normal distribution You must have more than 5.
Statistics 07 Nonparametric Hypothesis Testing. Parametric testing such as Z test, t test and F test is suitable for the test of range variables or ratio.
Spearman’s Rank For relationship data.
Presentation transcript:

Mann Whitney U For comparison data

Using Mann Whitney U Non-parametric i.e. no assumptions are made about data fitting a normal distribution Is used to compare the medians of two sets of data It measures the overlap between the two data sets You must have between 6 and 20 replicates of data The data sets can have unequal numbers of replicates

Example of normally distributed data

When to use Mann-Whitney U-test Curve not normally distributed ie. non parametric Compares overlap between two data sets

The Equation U1U1 = n 1 x n 2 + ½ n 2 (n 2 + 1) -  R 2 U2U2 = n 1 x n 2 + ½ n 1 (n 1 + 1) -  R 1

The Equation U1U1 = n 1 x n 2 + ½ n 2 (n 2 + 1) -  R 2 U2U2 = n 1 x n 2 + ½ n 1 (n 1 + 1) -  R 1 Where: U1U1 =Mann - Whitney U for data set 1 n1n1 =Sample size of data set 1  R1 R1 =Sum of the ranks of data set 1 U2U2 =Mann - Whitney U for data set 2 n2n2 =Sample size of data set 2  R2 R2 =Sum of the ranks of data set 2

1. Establish the Null Hypothesis H 0 (this is always the negative form. i.e. there is no significant correlation between the variables) and the alternative hypothesis (H 1 ). Method H 0 - There is no significant difference between the variable at Site 1 and Site 2 H 1 - There is a significant difference between the variable at Site 1 and Site 2

2. Copy your data into the table below as variable x and variable y and label the data sets Rank 1 R 1 Data Set 1 Beech Hill (m) Data Set 2 Rushey Plain (m) Rank 2 R 2

Rank 1 R 1 Data Set 1 Beech Hill (m) Data Set 2 Rushey Plain (m) Rank 2 R 2 Start from the lowest and put the numbers in order: 16, 17, 18, 19, 20, 20, 21, 21, 22, 23, 23, 24, Treat both sets of data as one data set and rank them in increasing order (the lowest data value gets the lowest rank)

When you have data values of the same value, they must have the same rank. Take the ranks you would normally assign (5 and 6) and add them together (11) and divide the ranks between the data values(5.5) The lowest data value gets a rank of The same thing is done for all data values that are the same

When you have data values of the same value, they must have the same rank. Take the ranks you would normally assign (5 and 6) and add them together (11) and divide the ranks between the data values(5.5) The lowest data value gets a rank of The assigned ranks can then be put into the table

Rank 1 R Data Set 1 Beech Hill (m) Data Set 2 Rushey Plain (m) Rank 2 R Sum the ranks for each set of data (  R)  R 1 = = 68  R 2 = = 23

5. Calculate the number of samples in each data set (n) Count the number of samples in each of the data sets Rank 1 R Data Set 1 Beech Hill (m) Data Set 2 Rushey Plain (m) Rank 2 R n 1 = 7 n 2 = 6

U1U1 = n 1 x n 2 + ½ n 2 (n 2 + 1) -  R 2 U2U2 = n 1 x n 2 + ½ n 1 (n 1 + 1) -  R 1 It is a good idea to break the equations down into three bite size chunks that will then give you a very easy three figure sum U1U1 =n 1 x n 2 +½ n 2 (n 2 + 1)-  R 2 U2U2 =n 1 x n 2 +½ n 1 (n 1 + 1)-  R 1 6. Calculate the Values for U 1 and U 2 using the equations

U1U1 = n 1 x n 2 + ½ n 2 (n 2 + 1) -  R 2 (7x6) +3(6+1) -23 (7x6) +3(7) -23 U 1 = = 40 U 2 = = 2 (7x6) +3.5(8) -68 (7x6) +3.5(7+1) -68 U2U2 = n 1 x n 2 + ½ n 1 (n 1 + 1) -  R 1

The smallest U value isU 2 = 2 6. Compare the smallest U value against the table of critical values

Value of n n1n (at the 0.05 or 95% confidence level i.e. we are 95% confident our data was not due to chance) We us the values of n 1 and n 2 to find our critical value

Is 2 (our smallest U value) smaller or larger than 6 (our critical value from the Mann Whitney Table)? Smaller The smallest U value is less than the critical value; therefore the null hypothesis is rejected The alternative Hypothesis can be accepted – There is a significant difference between the tree heights of Beech Hill and Rushey Plain

Use the following data to calculate U values independently Rank 1 R 1 Velocity cm.s -1 Pools Velocity cm.s -1 Riffles Rank 2 R 2 Rank 1 R 1 Abundance of Gammarus pulex Pools Abundance of Gammarus pulex Riffles Rank 2 R 2 Abundance of Gammarus pulex in pools and riffles of an Exmoor stream

Key questions Is there a significant relationship? Which data value/s would you consider to be anomalous and why? What graph would you use to present this data?