September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Conducting and interpreting multivariate analyses.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
Correlation and Linear Regression.
Inference for Regression
Correlation and Linear Regression
Simple Linear Regression 1. Correlation indicates the magnitude and direction of the linear relationship between two variables. Linear Regression: variable.
Correlation Chapter 9.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 13-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Lecture 6: Multiple Regression
Examining Relationship of Variables  Response (dependent) variable - measures the outcome of a study.  Explanatory (Independent) variable - explains.
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Pengujian Parameter Koefisien Korelasi Pertemuan 04 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Chapter Topics Types of Regression Models
Linear Regression and Correlation Analysis
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
RESEARCH STATISTICS Jobayer Hossain Larry Holmes, Jr November 6, 2008 Examining Relationship of Variables.
Linear Regression Example Data
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Pertemua 19 Regresi Linier
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Business Statistics - QBM117 Statistical inference for regression.
Correlation and Regression Analysis
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Chapter 7 Forecasting with Simple Regression
Slide 1 Testing Multivariate Assumptions The multivariate statistical techniques which we will cover in this class require one or more the following assumptions.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Regression and Correlation Methods Judy Zhong Ph.D.
Introduction to Linear Regression and Correlation Analysis
Inference for regression - Simple linear regression
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
How to Analyze Data? Aravinda Guntupalli. SPSS windows process Data window Variable view window Output window Chart editor window.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
1 1 Slide © 2004 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
Soc 3306a Multiple Regression Testing a Model and Interpreting Coefficients.
Soc 3306a Lecture 9: Multivariate 2 More on Multiple Regression: Building a Model and Interpreting Coefficients.
Chapter 9 Analyzing Data Multiple Variables. Basic Directions Review page 180 for basic directions on which way to proceed with your analysis Provides.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Statistical Methods Statistical Methods Descriptive Inferential
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
University of Warwick, Department of Sociology, 2012/13 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 5 Multiple Regression.
Lecture 10: Correlation and Regression Model.
September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Tatjana Meschede, Ph.D., Center for Social Policy,
28. Multiple regression The Practice of Statistics in the Life Sciences Second Edition.
Correlation & Regression Analysis
ANOVA, Regression and Multiple Regression March
Statistics for Managers Using Microsoft® Excel 5th Edition
Lecture 10 Introduction to Linear Regression and Correlation Analysis.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
BUSINESS MATHEMATICS & STATISTICS. Module 6 Correlation ( Lecture 28-29) Line Fitting ( Lectures 30-31) Time Series and Exponential Smoothing ( Lectures.
Chapter 12: Correlation and Linear Regression 1.
Chapter 13 Simple Linear Regression
Regression Analysis.
Inference for Least Squares Lines
Lecture 10 Regression Analysis
Bivariate & Multivariate Regression Analysis
Correlation, Bivariate Regression, and Multiple Regression
Statistics for Managers using Microsoft Excel 3rd Edition
Regression Analysis.
Understanding Research Results: Description and Correlation
Regression Analysis Week 4.
Product moment correlation
Presentation transcript:

September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Conducting and interpreting multivariate analyses using SPSS and Excel David Patterson, College of Social Work The University of Tennessee

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 2 Regression Analysis of HMIS Data What is regression analysis? –AKA - Linear regression, Ordinary Least Squares (OLS) –Bivariate regression - measures the association or relationship between a dependent variable (DV) and an independent variable (IV). Estimates the measurable difference in the DV for each one-unit of change in an IV. –Multiple regression - Measures the relationship between a single DV and two or more IV.

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 3 Regression Analysis of HMIS Data What can regression analysis tell us about HMIS data? –It can us understand possible causal relationship between certain outcomes (DV) and possible causal factors (IV). E.G., Length of stay prior to housing (DV) and age (IV1), duration of homelessness (IV2), and current income (IV3). Stated another way, how is length of stay prior to housing predicted independently by each IV and through their combined influence?

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 4 Regression Analysis of HMIS Data Challenges of regression analysis with HMIS data –Level of measurement Many HMIS variables are NOT continuous variables, required for the DV in multiple regression. Most are nominal, e.g. race, zip code, gender, disability status. –High levels of missing data in many variables Common in social services data sets Requires careful evaluation of extent and pattern of missing data. Selection and implementation of missing data procedure Added complexity with nominal level data

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 5 Regression Analysis of HMIS Data Assumptions of regression analysis –Normality - scores or observations obtained would be normally distributed in the population of interest. Assumed if sampling is random or includes random assignment. Generally not the case in HMIS data. (Explore with a frequency distribution). Note- Age is not quite normally distributed in this graph.

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 6 Regression Analysis of HMIS Data Equality of Variances - Homoscedasticity –Points in the scatterplot of the residuals (difference between the observed and predicted values) are randomly distributed about a horizontal line from the mean of the residuals.

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 7 Regression Analysis of HMIS Data Independence of Observations –Scores or observations are independent of each other. Independence means that the observations or values independently derived and that one event or value will not depend on another event or value. –Durbin-Watson statistic between 1.5 and 2.5

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 8 Regression Analysis of HMIS Data Linearity - a linear relationship exists between variables. Evaluate with a scatterplot of DV and IV.

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 9 Regression Analysis of HMIS Data Two Methods Compared

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 10 Regression Analysis of HMIS Data Exploratory Research Question –Is there are relationship between duration of homelessness (days) and age, years of education, and weight? Method –Regression analysis with SPSS and Excel using two data sets. Intention –Demonstrate the utility of these two tools in regression analysis with HMIS data. –Demonstrate the challenges of regression analysis with HMIS data.

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 11 Regression Analysis of HMIS Data Steps of SPSS Regression - N= Report downloaded from HMIS data system. 2.Data cleaned and file prepared in Excel. 3.Excel file opened in SPSS.

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 12 Regression Analysis of HMIS Data Opening linear regression dialog window

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 13 Regression Analysis of HMIS Data Linear regression dialog window for stats

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 14 Regression Analysis of HMIS Data Linear regression dialog window for plots

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 15 Regression Analysis of HMIS Data Correlations measure the strength of the relationship between two variables. Correlation values range between 1.0 and The closer to zero, the weaker the correlation Note the weak correlations between the DV and each of the IV Mean = average value for each variable. Standard deviation = measures the dispersion of values from the mean. Together they describe the shape of the distribution for each variable

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 16 Regression Analysis of HMIS Data Results - While the results may be significant, are there problems with the model? Note the R Square value. R Square indicates the proportion of variation in the DV explained by the IV. In this model, an R Square of.031 means that the 3 IV account for very little of the variance in the DV. The fact that the model is statistically significant may be due to the large N (1550).

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 17 Regression Analysis of HMIS Data What do the plots tell us? Distribution of the DV is highly skewed. Departure from the straight line indicates the data are not normally distributed.

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 18 Regression Analysis of HMIS Data Second SPSS regression analysis with smaller sample. –N= 626 –(Constrained data set limiting homelessness to > 1 month and < 1 year. Is the distribution normal or skewed? The F-stat used in regression to test the significance of the model, is quite robust to violations of the assumption of normality.

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 19 Regression Analysis of HMIS Data Results Note the weak correlations between the DV and each of the IV. Note there is no statistically significant bivariate correlation between the DV and each of the IV.

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 20 Regression Analysis of HMIS Data Note the regression model is not significant. The results suggest that for this sample age, years of education, and weight cannot be used to predict duration (days) of homelessness.

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 21 Regression Analysis of HMIS Data What do the plots tell us? In this data set (N = 626), the distribution is much less skewed than the (N = 1550) data set. The data are more normally distributed than the (N = 1550) data set.

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 22 Regression Analysis of HMIS Data Steps of Excel regression analysis 1.Report downloaded from HMIS data system. 2.Data cleaned and file prepared in Excel.

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 23 Regression Analysis of HMIS Data Excel produced histograms to examine the shape of the distributions.

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 24 Regression Analysis of HMIS Data Steps of Excel regression analysis –Can use the Chart Wizard to produce scatterplots to examine the bivariate correlation between the DV and the IV of the model. There is a weak correlation between the variables. The variables are not correlated.

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 25 Regression Analysis of HMIS Data Steps of Excel regression analysis 1.Select Data Analysis under Tools in the menu bar. 2.If Data Analysis does not appear, then select Add- ins. The Analysis ToolPak.

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 26 Regression Analysis of HMIS Data Steps of Excel regression analysis Specify the input range for the Y (IV) and the X (DV) variables Check boxes for all plots.

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 27 Regression Analysis of HMIS Data Excel regression statistics are the same as the results from the second SPPS analysis.

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 28 Regression Analysis of HMIS Data Excel Regression Plots Residual plots are used to check for regression assumptions. Significant patterns in the scatterplot suggest a violation of regression assumptions. Use to check for the Normality assumption.

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 29 Regression Analysis of HMIS Data StatSPSSExcel R Square F-stat Sig Durbin-Watson1.972Not reported Results Comparison

September 18-19, Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development 30 Spreadsheet Data Analysis Resources