How and why to use Spearman’s Rank… If you have done scattergraphs, Spearman’s Rank offers you the opportunity to use a statistical test to get a value.

Slides:



Advertisements
Similar presentations
1 Bayes’ Theorem. 2 Let’s consider an example. Say you have 31 people who play golf. One way to divide up the people is to put them in groups based on.
Advertisements

Means, variations and the effect of adding and multiplying Textbook, pp
Price Elasticity What is it all about ?. Situation 1 You have $15 to spend You must spend all your money In my “Econ goodies” shop the prices for my goods.
Probability Probability: what is the chance that a given event will occur? For us, what is the chance that a child, or a family of children, will have.
Adding and Subtracting FRACTIONS!!!!
Don’t worry!. Here is an example and reminders to help you with your homework.
QUICK MATH REVIEW & TIPS 2
Mr F’s Maths Notes Statistics 8. Scatter Diagrams.
Copyright © Cengage Learning. All rights reserved.
The Normal Distribution
Mr Barton’s Maths Notes
1 The Sample Mean rule Recall we learned a variable could have a normal distribution? This was useful because then we could say approximately.
APPLICATIONS OF DIFFERENTIATION 4. In Sections 2.2 and 2.4, we investigated infinite limits and vertical asymptotes.  There, we let x approach a number.
Solving Algebraic Equations
What are some of the ways you that that we can categorise numbers?
Perfect Squares
1.3 ORGANIZATIONAL PLANNING & DECISION MAKING INTRODUCTION TO DECISION TREES (HIGHER LEVEL CONTENT)
 Percentage Bar graphs are similar ways to pie graphs. They are used to show different amounts of related data.  They are construct using a single bar.
The Marriage Problem Finding an Optimal Stopping Procedure.
Simplifying Rational Expressions – Part I
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
1 Psych 5500/6500 Chi-Square (Part Two) Test for Association Fall, 2008.
Statistical Analysis to show Relationship Strength.
14 Elements of Nonparametric Statistics
1 Psych 5500/6500 Data Transformations Fall, 2008.
Mathematical Methods Wallands Community Primary School
Review Factoring Techniques for the Final Exam
The Normal Distribution The “Bell Curve” The “Normal Curve”
Jon Curwin and Roger Slater, QUANTITATIVE METHODS: A SHORT COURSE ISBN © Cengage Chapter 2: Basic Sums.
Correlation – Pearson’s. What does it do? Measures straight-line correlation – how close plotted points are to a straight line Takes values between –1.
Data analysis – Spearman’s Rank 1.Know what Spearman’s rank is and how to use it 2.Be able to produce a Spearman’s rank correlation graph for your results.
Probability Section 7.1.
30 marks 25% of final grade.  Is an outline of what this work is about.  Why are you doing this?  What is it you are going to do? (Hypothesis/Aims)
SQUARE ROOTS. This isn’t exactly true, but for the next 3 weeks: “Radical” means the same thing as “square root” *Side Note:
Warm-up Simplify. 5x x – a + 2b – (a – 2b) Multiply.
In Sections 2.2 and 2.4, we investigated infinite limits and vertical asymptotes.  There, we let x approach a number.  The result was that the values.
5.4 Factoring Greatest Common Factor,
Chapter 17: The binomial model of probability Part 3 AP Statistics.
Mr Barton’s Maths Notes Graphs 2. Quadratics and Cubics
Sampling  When we want to study populations.  We don’t need to count the whole population.  We take a sample that will REPRESENT the whole population.
11/23/2015Slide 1 Using a combination of tables and plots from SPSS plus spreadsheets from Excel, we will show the linkage between correlation and linear.
Algebraic Thinking 5 th Grade Guided Instruction Finding Rules and Writing Equations For Patterns.
Multiplying Decimals Type your name and send: Next slide.
1 Research Methods in Psychology AS Descriptive Statistics.
Targeting that Grade C in Mathematics A Simplified Revision Guide St Edmund Campion Mathematics Department.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
The T-Test Are our results reliable enough to support a conclusion?
The Normal Approximation for Data. History The normal curve was discovered by Abraham de Moivre around Around 1870, the Belgian mathematician Adolph.
+ Mortality. + Starter for 10…. In pairs write on a post it note: One statistic that we use to measure mortality On another post it note write down: A.
I can factor trinomials with grouping.. Factoring Chart This chart will help you to determine which method of factoring to use. TypeNumber of Terms 1.
Examining difference: chi-squared (x 2 ). When to use Chi-Squared? Chi-squared is used to examine differences between what you actually find in your study.
IB DP Geography – IA. Is Decatur a “typical” Central Business District? Higher Level: test a maximum of 3 hypotheses Standard Level: test a maximum of.
Complex Numbers and Equation Solving 1. Simple Equations 2. Compound Equations 3. Systems of Equations 4. Quadratic Equations 5. Determining Quadratic.
Statistics for A2 Biology Standard deviation Student’s t-test Chi squared Spearman’s rank.
Year 5 - Numeracy Title page..
Step 1: Specify a null hypothesis
Spearman’s Rank correlation coefficient
Factoring x2 + bx + c ax2 + bx + c when a=1
10: Leisure at an International Scale: Sport
Using Statistical techniques in Geography
Chi Square (2) Dr. Richard Jackson
Using Data to Analyze Trends: Spearman’s Rank
How and why to use Spearman’s Rank…
Correlation and the Pearson r
Multiplying Mixed Numbers
Spearman’s Rank For relationship data.
Top 10 maths topics that GCSE students struggle with
Presentation transcript:

How and why to use Spearman’s Rank… If you have done scattergraphs, Spearman’s Rank offers you the opportunity to use a statistical test to get a value which can determine the strength of the relationship between two sets of data…

So how do we do it? This is the equation, and looks complicated, so let’s think carefully about how we can do this… The best way to do this would be through an example. If we were looking at Settlement patterns for a town’s CBD in Geography, we may wish to compare aspects of the town, such as whether the number of people in a zone affect the type of shops that locate there (i.e. – convenience shops) To do this, we would construct a table as shown overleaf… In the above, r s refers to the overall value or rank The equation has to be done before the value is taken away from 1 In the above equation, the sign means ‘the total of’ d 2 is the first thing we will try to establish in our ranked tables (see next slides) ‘n’ refers to the number of sites or values you will process – so if there were there 15 river sites, ‘n’ would be 15. If there were 20 pedestrian count zones, ‘n’ would be 20, and so on…

Zone Pedestrians Rank Convenience shops Rank (r) Difference (d) D2D Here we have laid out a table of each of the twelve zones in a town 2. Pedestrian counts for each zone here 3. Number of Convenience shops for each zone here 4. We now need to rank the data (two highlighted columns)– this is shown overleaf

Zone Pedestrians Rank Convenience shops Rank (r) Difference (d) D2D You will see here that on this example, the pedestrian counts have been ranked from highest to Lowest, with the Highest value (70) Being ranked as Number 1, the Lowest value (8) Being ranked as Number 12.

Zone Pedestrians Rank Convenience shops Rank (r) Difference (d) D2D So that was fairly easy… We need to now do the next column for Convenience shops too. But hang on! Now we have a problem… We have two values that are 8, so what do we do? The next two ranks would be 4 and 5; we add the two ranks together and divide it by two. So these two ranks would both be called 4.5

Zone Pedestrians Rank Convenience shops Rank (r) Difference (d) D2D This is normally the point where one of the biggest mistakes is made. Having gone from 4.5, students will often then rank the next value as 5. But they can’t! Why not? Because we have already used rank number 5! So we would need to go to rank 6 This situation is complicated further by the fact that the next two ranks are also tied. So we do the same again – add ranks 6 and 7 and divide it by 2 to get 6.5

Rank Rank (r) Having ranked both sets of data we now need to work out the difference (d) between the two ranks. To do this we would take the second rank away from the first. This is demonstrated on the next slide

Zone Pedestrians Rank Convenience shops Rank (r) Difference (d) The difference between the two ranks has now been established So what next? We need to square each of these d values… Don’t worry if you have any negative values here – when we square them (multiply them by themselves) they will become positives

Zone Pedestrians Rank Convenience shops Rank (r) Difference (d) D2D So, the first value squared would be 0.25 (-0.5 x -0.5)

So what do we with these ‘d 2 ’ figures? First we need to add all of the figures in this d 2 column together This gives us…. 32 Now we can think about doing the actual equation!

Firstly, let’s remind ourselves of the equation... In this equation, we know the total of d 2, which is 32 So the top part of our equation is… 6 x 32 We also know what ‘n’ is (the number of sites or zones - 12 in this case), so the bottom part of the equation is… (12x12x12) - 12

We can now do the equation… 6 x OK – so this gives us a figure of

This is the equation, which we will by now be sick of! I have circled the part of the equation that we have done… Remember that we need to take this value that we have calculated away from 1. Forgetting to do this is probably the second biggest mistake that people make! So… 1 – = 0.888

So we have our Spearman’s Rank figure….But what does it mean? Your value will always be between -1 and +1 in value. As a rough guide, our figure of demonstrates there is a fairly positive relationship. It suggests that where pedestrian counts are high, there are a high number of convenience shops Should the figure be close to -1, it would suggest that there is a negative relationship, and that as one thing increases, the other decreases.

However… Just looking at a line and making an estimation isn’t particularly scientific. To be more sure, we need to look in critical values tables to see the level of significance and strength of the relationship. This is shown overleaf…

N 0.05 level0.01 level This is a critical values table and the ‘n’ column shows the numbers of sites or zones you have studied. In our case, we looked at 12 zones. 2. If look across we can see there are two further columns – one labelled 0.05, the other The first, 0.05 means that if our figure exceeds the value, we can be sure that 95 times in 100 the figures occurred because a relationship exists, and not because of pure chance The second, 0.01, means that if our figure exceeds this value, we can be sure that 99 times in 100 the figures occcurred because a relationship exists, and did not occur by chance. We can see that in our example our figure of exceeds the value of at the 0.05 level and also comfortably exceeds value at the 0.01 level too.

In our example above, we can see that our figure of exceeds the values at both the 95% and 99% levels. The figure is therefore highly significant

Finally… You need to think how you can use this yourself…I would advise that you do scattergraphs for the same sets of data so that you have a direct comparison