Exploring Relationships: Correlations & Multiple Linear Regression Dr James Betts Developing Study Skills and Research Methods (HL20107)

Slides:



Advertisements
Similar presentations
Introduction to Study Skills & Research Methods (HL10040)
Advertisements

CHAPTER TWELVE ANALYSING DATA I: QUANTITATIVE DATA ANALYSIS.
Correlation and regression
Correlation Chapter 6. Assumptions for Pearson r X and Y should be interval or ratio. X and Y should be normally distributed. Each X should be independent.
Correlation and regression Dr. Ghada Abo-Zaid
Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 12 Measures of Association.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
LECTURE 3 Introduction to Linear Regression and Correlation Analysis
Lecture 3: Chi-Sqaure, correlation and your dissertation proposal Non-parametric data: the Chi-Square test Statistical correlation and regression: parametric.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Stat 217 – Week 10. Outline Exam 2 Lab 7 Questions on Chi-square, ANOVA, Regression  HW 7  Lab 8 Notes for Thursday’s lab Notes for final exam Notes.
SIMPLE LINEAR REGRESSION
Correlational Designs
Multiple Regression Research Methods and Statistics.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Correlation and Regression Quantitative Methods in HPELS 440:210.
Statistics Idiots Guide! Dr. Hamda Qotba, B.Med.Sc, M.D, ABCM.
Correlation & Regression
Inferential Statistics
Leedy and Ormrod Ch. 11 Gray Ch. 14
Lecture 16 Correlation and Coefficient of Correlation
Chapter 12 Correlation and Regression Part III: Additional Hypothesis Tests Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social.
Introduction to Linear Regression and Correlation Analysis
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Chapter 13: Inference in Regression
Hypothesis Testing Charity I. Mulig. Variable A variable is any property or quantity that can take on different values. Variables may take on discrete.
Learning Objective Chapter 14 Correlation and Regression Analysis CHAPTER fourteen Correlation and Regression Analysis Copyright © 2000 by John Wiley &
Statistics 11 Correlations Definitions: A correlation is measure of association between two quantitative variables with respect to a single individual.
L 1 Chapter 12 Correlational Designs EDUC 640 Dr. William M. Bauer.
Examining Relationships in Quantitative Research
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Chapter Sixteen Copyright © 2006 McGraw-Hill/Irwin Data Analysis: Testing for Association.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 16 Data Analysis: Testing for Associations.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Examining Relationships in Quantitative Research
Chapter Thirteen Copyright © 2006 John Wiley & Sons, Inc. Bivariate Correlation and Regression.
Correlation and Linear Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
Correlation & Regression Analysis
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
CORRELATION ANALYSIS.
Research Methodology Lecture No :26 (Hypothesis Testing – Relationship)
Chapter 13 Understanding research results: statistical inference.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
Statistics Correlation and regression. 2 Introduction Some methods involve one variable is Treatment A as effective in relieving arthritic pain as Treatment.
Topics, Summer 2008 Day 1. Introduction Day 2. Samples and populations Day 3. Evaluating relationships Scatterplots and correlation Day 4. Regression and.
Approaches to quantitative data analysis Lara Traeger, PhD Methods in Supportive Oncology Research.
Interpretation of Common Statistical Tests Mary Burke, PhD, RN, CNE.
Appendix I A Refresher on some Statistical Terms and Tests.
Chapter 13 Simple Linear Regression
32931 Technology Research Methods Autumn 2017 Quantitative Research Component Topic 4: Bivariate Analysis (Contingency Analysis and Regression Analysis)
Correlation, Bivariate Regression, and Multiple Regression
10.2 Regression If the value of the correlation coefficient is significant, the next step is to determine the equation of the regression line which is.
Non-Parametric Tests 12/1.
Non-Parametric Tests 12/1.
Non-Parametric Tests 12/6.
Regression Analysis.
Non-Parametric Tests.
Understanding Research Results: Description and Correlation
Correlation and Regression
CORRELATION ANALYSIS.
Non – Parametric Test Dr. Anshul Singh Thapa.
Unit XI: Data Analysis in nursing research
Presentation transcript:

Exploring Relationships: Correlations & Multiple Linear Regression Dr James Betts Developing Study Skills and Research Methods (HL20107)

Lecture Outline: Correlation Coefficients Coefficients of Determinations Prediction & Regression Multiple Linear Regression Assessment Details.

StatisticsDescriptiveInferentialCorrelational Relationships GeneralisingOrganising, summarising & describing data Significance

Correlation A measure of the relationship (correlation) between interval/ratio LOM variables taken from the same set of subjects A ratio which indicates the amount of concomitant variation between two sets of scores This ratio is expressed as a correlation coefficient (r): Perfect Negative Relationship Perfect Positive Relationship No Relationship + _ Strong ModerateWeak Strong ModerateWeak

Correlation Coefficient & Scatterplots Direction Variable X (e.g. VO 2 max). Variable Y (e.g. 10 km run time) Variable X (e.g. VO 2 max) Variable Y (e.g. Exercise Capacity).

Correlation Coefficient & Scatterplots Variable X (e.g. VO 2 max) Variable Y (e.g. Exercise Capacity). Variable X (e.g. Age) Variable Y (e.g. Strength) Form

Correlation Coefficient & Scatterplots Variable X (e.g. VO 2 max) Variable Y (e.g. Exercise Capacity). Significance Variable X (e.g. VO 2 max) Variable Y (e.g. 100 m Sprint time).

Correlation Coefficient & Scatterplots Variable X (e.g. VO 2 max) Variable Y (e.g. Exercise Capacity). Variable X (e.g. VO 2 max) Variable Y (e.g. 100 m sprint time). Significance

Methods of Calculating r Any method of calculating r requires: –Homoscedacity (i.e. equal scattering) –Linear data (curvilinear data requires eta η) Parametric data (i.e. raw data >ordinal LOM and either normal distribution or large sample) permits the use of Pearsons Product-Moment Correlation If raw data violates these assumptions then use Spearmans Rank Order Correlation instead.

X = Alcohol UnitsY = Skill ScoreX2X2 Y2Y2 XY Totals= Pearsons Product-Moment Correlation

r = n XY-( X)( Y) [n X 2 -( X) 2 ] [n Y 2 -( Y) 2 Pearsons Product-Moment Correlation

X = Alcohol UnitsY = Skill ScoreRank XRank YDD2D Total= Spearmans Rank-Order Correlation

r = D 2 n(n 2 -1)

SPSS Correlation Outputs

Coefficient of Determination (r 2 x 100) AKA variance explained, this figure denotes how much of the variance in Y can be explained/predicted by X e.g. to predict long jump distance (Y) from maximum sprint speed (X) r= 0.8 r 2 = 64% YX

Correlation versus Regression By attempting to predict one variable using another, we are now moving away from simple correlation and moving into the concept of regression Correlation = Regression =

Linear Regression The equation for a linear relationship can be expressed as: Y= a + bX - where: a = the y intercept; and b = the gradient Variable X (e.g. VO 2 max) Variable Y (e.g. Exercise Capacity).

SPSS Regression Output

Extrapolation versus Interpolation Variable X (e.g. VO 2 max) Variable Y (e.g. Exercise Capacity). Remember that the accuracy of your equation depends upon the linear relationship you observed ? Interpolation = Extrapolation =

Multiple Linear Regression We saw earlier how maximum sprint speed (X) can predict/explain 64% of variance in long jump distance (Y) Y X r 2 = 64% …but can Y be predicted any more effectively using more than one independent variable (i.e. X 1, X 2, X 3, etc)?

Multiple Linear Regression However, we can often predict Y effectively just using a specific subset of X variables (i.e. a reduced model) Y X1X1 X 2 Event Experience

Multiple Linear Regression Best Subset Selection Methods involve calculation of r for every possible combination of IVs Stepwise regression methods involve gradually either adding or removing variables and monitoring the impact of each action on r. –Standard methods add and remove variables –Forward selection methods begin with 1 IV and add more –Backwards elimination methods begin with all IVs and remove The order in which IVs are added/removed is critical as the variance explained solely by any one will be entirely dependent upon the presence of others.

SPSS Multiple Linear Regression Output

Summary: Exploring Relationships The relationship between two variables can be expressed as a correlation coefficient (r) The coefficient of determination (r 2 ) denotes the % of one variable that is explained by another Linear regression can provide an equation with which to predict one variable from another Multiple linear regression can potentially improve this prediction using multiple predictor variables.

Coursework Project (40 % overall grade) Your coursework will require you to address 2 out of 3 research scenarios that are available on the unit webpage For each of the 2 scenarios you will need to: –Perform a literature search in order to provide a comprehensive introduction to the research area –Identify the variables of interest and evaluate the research design which was adopted –Formulate and state appropriate hypotheses…

Contd… –Summarise descriptive statistics in an appropriate and well presented manner –Select the most appropriate statistical test with justification for your decision –Transfer the output of your inferential statistics into your word document –Interpret your results and discuss the validity and reliability of the study –Draw a meaningful conclusion (state whether hypotheses are accepted or rejected).

Coursework Details (see unit outline) 2000 words maximum (i.e for each) Any supporting SPSS data/outputs to be appended To be submitted on Thursday 6 th May Assessment Weighting Evaluation & Analysis (30 %) Reading & Research (20 %) Communication & Presentation (20 %) Knowledge (30 %)

Coursework Details All information relating to your coursework (including the relevant data files) are accessible via the module web page: Web address also referenced on shared area Electronic copy to be included with submission. Any further questions/problems can be raised in the CW revision lecture/labs after Easter

Timed Practical Computing Exercise (20 % overall grade) This test will involve analysis/interpretation of the resultant data assessed via short answer questions Practice session Wednesday 14 th April Duration = 80 min (2 groups) I will specific details after Easter.

both IVs unpaired Both IVs paired >2 variables 2 variables >2 groups 2 groups >2 observations 2 observations >1 observed frequency 1 observed frequency Looking for differences between categories/frequencies? (i.e. nominal LOM) Goodness of Fit χ 2 Looking for differences within the same group of subjects? (i.e. paired data) Looking for differences between 2 separate groups of subjects? (i.e. unpaired data) Looking for relationships? Looking for differences with >1 independent variable? Contingency χ 2 Paired t-test 1-way paired ANOVA Independent t-test 1-way unpaired ANOVA Pearsons r Multiple Linear Regression 2-way paired ANOVA 2-way unpaired ANOVA 1 IV paired 1 IV unpaired 2-way mixed model ANOVA Wilcoxon test Friedmans test Mann-Whitney test Kruskal Wallis test Spearmans r Post-Hoc Tests non-parametric Start Here If multiple DVs are involved then use MANOVA