Correlation Assumptions: You can plot a scatter graph You know what positive, negative and no correlation look like on a scatter graph
Correlation describes the strength of the relationship between two variables. Paired data is often known as bivariate data. In S1 we will look at ways of measuring the degree of linear association First establish whether a linear correlation exists using a scatter diagram. x x x x x x x xx x x x x x x x x x x x x x x x x x x x
Correlation describes the strength of the relationship between two variables. Paired data is often known as bivariate data. In S1 we will look at ways of measuring the degree of linear association First establish whether a linear correlation exists using a scatter diagram. x x x x x x x xx x x x x x x x x x x x x x x x x x x x x x x
x x x x x x x xx x x x x x x x x x x x x x x x x x x x x x x ① ② ③④ Correlation Positive (most in 1 st & 3 rd ) Negative (most in 2 nd & 4 th ) None Assuming you believe a linear relationship exists, we can calculate a measure of how strong it is.
Product Moment Correlation Coefficient (PMCC) Quadrant 1 st 2 nd 3 rd 4 th ① ② ③④ x x x x x x x xx x x x x
Product Moment Correlation Coefficient (PMCC) x x x x x x x xx x x x Quadrant 1 st 2 nd 3 rd 4 th ① ② ③④ x
Product Moment Correlation Coefficient (PMCC) x x x x x x x xx x x x Quadrant 1 st + 2 nd 3 rd 4 th ① ② ③④ x
Product Moment Correlation Coefficient (PMCC) x x x x x x x xx x x x Quadrant 1 st nd 3 rd 4 th ① ② ③④ x
Product Moment Correlation Coefficient (PMCC) x x x x x x x xx x x x Quadrant 1 st nd 3 rd 4 th ① ② ③④ x
Product Moment Correlation Coefficient (PMCC) x x x x x x x xx x x x Quadrant 1 st nd 3 rd 4 th ① ② ③④ x Complete the table…
Product Moment Correlation Coefficient (PMCC) x x x x x x x xx x x x Quadrant 1 st nd - 3 rd 4 th ① ② ③④ x
Product Moment Correlation Coefficient (PMCC) x x x x x x x xx x x x Quadrant 1 st nd rd 4 th ① ② ③④ x
Product Moment Correlation Coefficient (PMCC) x x x x x x x xx x x x Quadrant 1 st nd rd 4 th ① ② ③④ x
Product Moment Correlation Coefficient (PMCC) x x x x x x x xx x x x Quadrant 1 st nd rd 4 th ① ② ③④ x
Product Moment Correlation Coefficient (PMCC) x x x x x x x xx x x x Quadrant 1 st nd rd - 4 th ① ② ③④ x
Product Moment Correlation Coefficient (PMCC) x x x x x x x xx x x x Quadrant 1 st nd rd -- 4 th ① ② ③④ x
Product Moment Correlation Coefficient (PMCC) x x x x x x x xx x x x Quadrant 1 st nd rd th ① ② ③④ x
Product Moment Correlation Coefficient (PMCC) x x x x x x x xx x x x Quadrant 1 st nd rd th ① ② ③④ x
Product Moment Correlation Coefficient (PMCC) x x x x x x x xx x x x Quadrant 1 st nd rd th + ① ② ③④ x
Product Moment Correlation Coefficient (PMCC) x x x x x x x xx x x x Quadrant 1 st nd rd th +- ① ② ③④ x
Product Moment Correlation Coefficient (PMCC) x x x x x x x xx x x x Quadrant 1 st nd rd th +-- ① ② ③④ x
Product Moment Correlation Coefficient (PMCC) x x x x x x x xx x x x Quadrant 1 st nd rd th +-- ① ② ③④ x
Product Moment Correlation Coefficient (PMCC) x x x x x x x xx x x x Quadrant 1 st nd rd th +-- x If we sum the values… For this example, since most points are in 1 st & 3 rd quadrants, the total will be positive (hence positive correlation) A negative correlation would be overall negative No correlation would give a sum close to zero What would be the effect on the sum in the example above if we used a data set ten times bigger?
Product Moment Correlation Coefficient (PMCC) x x x x x x x xx x x x Quadrant 1 st nd rd th +-- x If we sum the values… For this example, since most points are in 1 st & 3 rd quadrants, the total will be positive (hence positive correlation) A negative correlation would be overall negative No correlation would give a sum close to zero What would be the effect on the sum if we changed the units, e.g. used cm instead of metres for a measurement?
Product Moment Correlation Coefficient (PMCC) If we sum the values… For this example, since most points are in 1 st & 3 rd quadrants, the total will be positive (hence positive correlation) A negative correlation would be overall negative No correlation would give a sum close to zero To eliminate these problems we use the following formula. This will always give a value between -1 and 1
NB. Don’t use PMCC if a different type of correlation exists, For example if points follow a clear curve
An easier version of the formula NB You are given all these formulas in the exam
Example Find PMCC
Find PMCC
Find PMCC
Example Find PMCC
Example Find PMCC
Example Find PMCC Complete the table and calculate the totals
Example Find PMCC
Example Find PMCC
Example Find PMCC
Example Find PMCC
Example Find PMCC
Example
Example Find PMCC
Example Find PMCC
Example of Q that can’t be done using the data function
s.f s.f.