Department of Cognitive Science Michael J. Kalsher Adv. Experimental Methods & Statistics PSYC 4310 / COGS 6310 Regression 1 PSYC 4310/6310 Advanced Experimental.

Slides:



Advertisements
Similar presentations
Questions From Yesterday
Advertisements

Lesson 10: Linear Regression and Correlation
13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Correlation and regression Dr. Ghada Abo-Zaid
Understanding the General Linear Model
1 Lecture 2: ANOVA, Prediction, Assumptions and Properties Graduate School Social Science Statistics II Gwilym Pryce
Analysis of Economic Data
1 Lecture 2: ANOVA, Prediction, Assumptions and Properties Graduate School Social Science Statistics II Gwilym Pryce
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
Bivariate Regression CJ 526 Statistical Analysis in Criminal Justice.
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Linear Regression and Linear Prediction Predicting the score on one variable.
Lecture 5: Simple Linear Regression
Multiple Regression Research Methods and Statistics.
1 Chapter 17: Introduction to Regression. 2 Introduction to Linear Regression The Pearson correlation measures the degree to which a set of data points.
Multiple Regression Dr. Andy Field.
Linear Regression.  Uses correlations  Predicts value of one variable from the value of another  ***computes UKNOWN outcomes from present, known outcomes.
Lecture 5 Correlation and Regression
CHAPTER 5 REGRESSION Discovering Statistics Using SPSS.
Correlation and Linear Regression
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Chapter 8: Bivariate Regression and Correlation
Example of Simple and Multiple Regression
Statistics for the Social Sciences Psychology 340 Fall 2013 Tuesday, November 19 Chi-Squared Test of Independence.
Introduction to Linear Regression and Correlation Analysis
Linear Regression and Correlation
Multiple Regression PSYC 4310 Advanced Experimental Methods and Statistics © 2013, Michael Kalsher.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.
Department of Cognitive Science Michael J. Kalsher PSYC 4310 COGS 6310 MGMT 6969 © 2015, Michael Kalsher Unit 1B: Everything you wanted to know about basic.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Statistics for the Social Sciences Psychology 340 Fall 2013 Correlation and Regression.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 12-1 Correlation and Regression.
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 19 Linear Patterns.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Regression Lesson 11. The General Linear Model n Relationship b/n predictor & outcome variables form straight line l Correlation, regression, t-tests,
Regression Analysis Relationship with one independent variable.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Environmental Modeling Basic Testing Methods - Statistics III.
Multiple Regression I 1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Multiple Regression Analysis (Part 1) Terry Dielman.
Correlation & Regression Analysis
Regression. Outline of Today’s Discussion 1.Coefficient of Determination 2.Regression Analysis: Introduction 3.Regression Analysis: SPSS 4.Regression.
Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Michael J. Kalsher PSYCHOMETRICS MGMT 6971 Regression 1 PSYC 4310 Advanced Experimental Methods and Statistics © 2014, Michael Kalsher.
Stats Methods at IC Lecture 3: Regression.
Multiple Regression.
Chapter 14 Introduction to Multiple Regression
Inference for Least Squares Lines
Correlation and Simple Linear Regression
Regression The basic problem Regression and Correlation
Linear Regression Prof. Andy Field.
Relationship with one independent variable
Chapter 15 Linear Regression
Correlation and Simple Linear Regression
CHAPTER 29: Multiple Regression*
Multiple Regression.
24/02/11 Tutorial 3 Inferential Statistics, Statistical Modelling & Survey Methods (BS2506) Pairach Piboonrungroj (Champ)
Correlation and Simple Linear Regression
Relationship with one independent variable
Simple Linear Regression and Correlation
Product moment correlation
Introduction to Regression
Chapter Thirteen McGraw-Hill/Irwin
Introduction to Regression
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Department of Cognitive Science Michael J. Kalsher Adv. Experimental Methods & Statistics PSYC 4310 / COGS 6310 Regression 1 PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2012, Michael Kalsher

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher Introduction to Regression If two variables covary, we should be able to predict the value of one variable from another. Correlation only tells us how much two variables covary. In regression, we construct an equation that uses one or more variables (the IV(s) or predictor variable(s)) to predict another variable (the DV or outcome variable). –Predicting from one IV = Simple Regression –Predicting from multiple IVs = Multiple Regression

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher Simple Regression: The Model The general equation: Outcome i = (model) + error i In regression the model is linear and we summarize a data set with a straight line. The regression line is determined through the method of least squares. 3

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher 4 The Regression Line: Model = “things that define the line we fit to the data” Any straight line can be defined by: -The slope of the line (b 1 ) -The point at which the line crosses the ordinate, termed the intercept of the line (b 0 ) The general equation: Outcome i = (model) + error i … becomes Y i = (b 0 + b 1 X i ) + ε i b 1 and b 0 are termed regression coefficients b 1 tells us what the model looks like (it’s shape) b 0 tells us where the model is in geometric space ε i is the residual term and represents the difference between participant i’s predicted and obtained scores.

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher 5 Method of Least Squares: Finding the line of best fit The method of least squares selects the line (regression line) that has the lowest sum of squared differences and therefore best represents the observed data. Once we determine the slope (b 1 )and intercept (b 0 ) of the line, we can insert different values of our predictor variable into the model to estimate the value of the outcome variable. Regression Line Slope = b 1 = dy / dx Individual Data Points Residual (Error in Prediction) Sum of residuals = 0 Intercept (Constant) b 0

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher Assessing Goodness of Fit Even the best fitting line can be a lousy fit to the data, so we need to assess the goodness of fit of the model against our best estimate--the mean. Let’s consider an example (see Field, p. 201): –A music mogul wants to know how many records her company will sell if she spends £100,000 on advertising. –In the absence of a model of the relationship between advertising and sales, the best guess would be the mean number of record sales (say 200,000)--regardless of amount of advertising. –So, as a basic strategy for predicting the outcome, we could use the mean, because on average it is a good guess. 6

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher 7 Assessing Goodness of Fit Represents the total amount of differences present when the most basic model is applied to the data. SS T uses the differences between the observed data and the mean value of Y. Represents the degree of inaccuracy when the best model is fitted to the data. SS R uses the differences between the observed data and the regression line. Shows the reduction in inaccuracy resulting from fitting the regression model to the data. SS M uses the differences between the mean value of Y and the regression line. A large SS M implies the regression model predicts the outcome variable better than the mean. SS T SS R SS M

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher 8 Assessing Goodness of Fit A large SS M implies the regression model is much better than using the mean to predict the outcome variable. How big is big? Assessed in two ways: (1) Via R 2 and (2) the F-test (assesses the ratio of systematic to unsystematic variance). R 2 = SS M SS T Represents the amount of variance in the outcome explained by the model relative to how much variance there was to explain. F =F = SS M / df SS R / df = MS M MS R df for SS M = number of variables in the model df for SS R = number of observations minus number of parameters being estimated.

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher Simple Regression Using SPSS: Predicting Record Sales (Y) from Advertising Budget (X) Record1.sav What’s the overall relationship between record sales and advertising budget?

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher

11

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher Interpreting a Simple Regression: Overall Fit of the Model 12 SS M SS R SS T MS M MS R The significant “F” test allows us to conclude that the regression model results in significantly better prediction of record sales than the mean value of record sales. Advertising expenditure accounts for 33.5% of the variation in record sales.

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher 13 Df = 1, 198 F=99.587

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher 14 Critical Values for F

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher b 0, the Y intercept b 1, the slope, or the change in the outcome associated with a unit change in the predictor Interpreting a Simple Regression: Model Parameters The ANOVA tells us whether the overall model results in a significantly good prediction of the outcome variable … not about the individual contribution of variables in the model. b 0 = Tells us that when no money is spent on ads, the model predicts 134,140 records will be sold. b 1 =.096. The amount of change in the outcome associated with a unit change in the predictor. Thus, we can predict 96 extra record sales for every £ 1000 in advertising. Regression coefficients should be sig. different from 0 and big relative to their S.E.

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher 16 Unstandardized Regression Weights Y pred = b 0 + b 1 X Intercept and Slope are in original units of X and Y and so aren’t directly comparable Interpreting a Simple Regression: Model Parameters Standardized Regression Weights Z y(pred) =  Z x Standardized regression weights tell us the number of standard deviations that the outcome will change as a result of one standard deviation change in the predictor. Richards. (1982). Standardized versus Unstandardized Regression Weights. Applied Psychological Measurement, 6,

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher 17 Interpreting a Simple Regression: Using the Model Since we’ve demonstrated the model significantly improves our ability to predict the outcome variable (record sales), we can plug in different values of the predictor variable(s). record sales i = b 0 + b 1 advertising budget i = (0.096 x advertising budget i ) What could the record executive expect if she spent £500,000 in advertising? How about £1,000,000?

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher 18 Simple Regression: Supermodel.sav A fashion student interested in the factors that predict salaries of catwalk models collects data from 231 models. For each model, she asks them their salary per day on days they work (salary), their age (age), number of years they have worked as a model (years), and then gets a panel of experts from modeling agencies to rate the attractiveness of each model as a percentage with 100% being perfectly attractive (beauty). Use simple regression to predict the relationship between each of the potential predictor variables (i.e., age, years, beauty) to predict a model’s salary.

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher 19

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher 20 Attractiveness Age Years

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher 21 Attractiveness

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher 22 Age

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher 23 Years

PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2011, Michael Kalsher 24