Download presentation

Presentation is loading. Please wait.

Published byAiden Barnes Modified over 3 years ago

1
Computing in Archaeology Session 11. Correlation and regression analysis © Richard Haddlesey www.medievalarchitecture.net

2
Lecture aims To introduce correlation and regression techniques To introduce correlation and regression techniques

3
The scattergram In correlation, we are always dealing with paired scores, and so values of the two variables taken together will be used to make a scattergram In correlation, we are always dealing with paired scores, and so values of the two variables taken together will be used to make a scattergram

4
example Quantities of New Forrest pottery recovered from sites at varying distances from the kilns Quantities of New Forrest pottery recovered from sites at varying distances from the kilns Site Distance (km) Quantity 1498 22060 33241 43447 52462

5
Negative correlation Here we can see that the quantity of pottery decreases as distance from the source increases

6
Positive correlation Here we see that the taller a pot, the wider the rim

7
Curvilinear monotonic relation Again the further from source, the less quantity of artefacts

8
Arched relationship (non-monotonic) Here we see the first molar increases with age and is then worn down as the animal gets older

10
scattergram This shows us that scattergrams are the most important means of studying relationships between two variables This shows us that scattergrams are the most important means of studying relationships between two variables

11
REGRESSION Regression differs from other techniques we have looked at so far in that it is concerned not just with whether or not a relationship exists, or the strength of that relationship, but with its nature Regression differs from other techniques we have looked at so far in that it is concerned not just with whether or not a relationship exists, or the strength of that relationship, but with its nature In regression analysis we use an independent variable to estimate (or predict) the values of a dependent variable In regression analysis we use an independent variable to estimate (or predict) the values of a dependent variable

12
Regression equation y = f(x) y = y axis (in this case the dependent y = y axis (in this case the dependent f = function (of x) f = function (of x) x = x axis x = x axis

13
y = f(x) y = x y = 2x y = x 2

15
General linear equations y = a + bx y = a + bx Where y is the dependent variable, x is the independent variable, and the coefficients a and b are constants, i.e. they are fixed for a given data Where y is the dependent variable, x is the independent variable, and the coefficients a and b are constants, i.e. they are fixed for a given data

16
Therefore: If x = 0 then the equation reduces to y = a, so a represents the point where the regression line crosses the y axis (the intercept) If x = 0 then the equation reduces to y = a, so a represents the point where the regression line crosses the y axis (the intercept) The b constant defines the slope of gradient of the regression line The b constant defines the slope of gradient of the regression line Thus for the pottery quantity in relation to distance from source, b represents the amount of decrease in pottery quantity from the source Thus for the pottery quantity in relation to distance from source, b represents the amount of decrease in pottery quantity from the source

17
y = a + bx

20
least-squares

24
y = a + bx

26
y = 102.64 – 1.8x

29
CORRELATION

30
1 correlation coefficient

31
CORRELATION 1 correlation coefficient 2 significance

32
CORRELATION 1 correlation coefficient r 2 significance

33
CORRELATION 1 correlation coefficient r -1 to +1 2 significance

35
nominal – in name only ordinal – forming a sequence interval – a sequence with fixed distances ratio – fixed distances with a datum point Levels of measurement:

36
nominal ordinal interval ratio Levels of measurement:

37
nominal ordinal interval Product-Moment Correlation Coefficient ratio Levels of measurement:

38
nominal ordinal Spearmans Rank Correlation Coefficient interval ratio Levels of measurement:

40
The Product-Moment Correlation Coefficient

41
length (cm) width (cm) sample – 20 bronze spearheads n=20

42
length (cm) width (cm) r = nΣxy – (Σx)(Σy) g [nΣx 2 – (Σx) 2 ] [nΣy 2 – (Σy) 2 ] n=20

43
r = nΣxy – (Σx)(Σy) g [nΣx 2 – (Σx) 2 ] [nΣy 2 – (Σy) 2 ] n=20

44
r = nΣxy – (Σx)(Σy) g [nΣx 2 – (Σx) 2 ] [nΣy 2 – (Σy) 2 ] n=20

45
r = nΣxy – (Σx)(Σy) g= +0.67 [nΣx 2 – (Σx) 2 ] [nΣy 2 – (Σy) 2 ] n=20

46
Test of product moment correlation coefficient

47
H 0 : true correlation coefficient = 0

48
Test of product moment correlation coefficient H 0 : true correlation coefficient = 0 H 1 : true correlation coefficient 0

49
Test of product moment correlation coefficient H 0 : true correlation coefficient = 0 H 1 : true correlation coefficient 0 Assumptions: both variables approximately random

50
Test of product moment correlation coefficient H 0 : true correlation coefficient = 0 H 1 : true correlation coefficient 0 Assumptions: both variables approximately random Sample statistics needed: n and r

51
Test of product moment correlation coefficient H 0 : true correlation coefficient = 0 H 1 : true correlation coefficient 0 Assumptions: both variables approximately random Sample statistics needed: n and r Test statistic: TS = r

52
Test of product moment correlation coefficient H 0 : true correlation coefficient = 0 H 1 : true correlation coefficient 0 Assumptions: both variables approximately random Sample statistics needed: n and r Test statistic: TS = r Table: product moment correlation coefficient table.

54
n = 20

55
n = 20 r = 0.67 p<0.01

56
length (cm) width (cm)

57
Spearmans Rank Correlation Coefficient (r s )

58
H 0 : true correlation coefficient = 0

59
Spearmans Rank Correlation Coefficient (r s ) H 0 : true correlation coefficient = 0 H 1 : true correlation coefficient 0

60
Spearmans Rank Correlation Coefficient (r s ) H 0 : true correlation coefficient = 0 H 1 : true correlation coefficient 0 Assumptions: both variables at least ordinal

61
Spearmans Rank Correlation Coefficient (r s ) H 0 : true correlation coefficient = 0 H 1 : true correlation coefficient 0 Assumptions: both variables at least ordinal Sample statistics needed: n and r s

62
Spearmans Rank Correlation Coefficient (r s ) H 0 : true correlation coefficient = 0 H 1 : true correlation coefficient 0 Assumptions: both variables at least ordinal Sample statistics needed: n and r s Test statistic: TS = r s

63
Spearmans Rank Correlation Coefficient (r s ) H 0 : true correlation coefficient = 0 H 1 : true correlation coefficient 0 Assumptions: both variables at least ordinal Sample statistics needed: n and r s Test statistic: TS = r s Table: Spearmans rank correlation coefficient table

Similar presentations

OK

Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.

Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.

© 2017 SlidePlayer.com Inc.

All rights reserved.

Ads by Google