More on linear regression – regression to the mean  Baseball Examples from Web Quartiles 1 2.

Slides:



Advertisements
Similar presentations
Questions From Yesterday
Advertisements

Wednesday AM  Presentation of yesterday’s results  Associations  Correlation  Linear regression  Applications: reliability.
Correlation and Linear Regression.
Regression What is regression to the mean?
© McGraw-Hill Higher Education. All Rights Reserved. Chapter 2F Statistical Tools in Evaluation.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
Statistics for the Social Sciences
Bivariate Regression CJ 526 Statistical Analysis in Criminal Justice.
Correlation and Regression Analysis
Lecture 5: Simple Linear Regression
PSY 307 – Statistics for the Behavioral Sciences Chapter 7 – Regression.
Correlation 1. Correlation - degree to which variables are associated or covary. (Changes in the value of one tends to be associated with changes in the.
Multiple Regression Research Methods and Statistics.
Simple Linear Regression Analysis
Descriptive measures of the strength of a linear association r-squared and the (Pearson) correlation coefficient r.
Chapter 9 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 What is a Perfect Positive Linear Correlation? –It occurs when everyone has the.
Review Regression and Pearson’s R SPSS Demo
Relationships Among Variables
Chapter 11 Simple Regression
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
ASSOCIATION BETWEEN INTERVAL-RATIO VARIABLES
Agenda Review Association for Nominal/Ordinal Data –  2 Based Measures, PRE measures Introduce Association Measures for I-R data –Regression, Pearson’s.
MULTIPLE REGRESSION Using more than one variable to predict another.
Regression Analysis. Scatter plots Regression analysis requires interval and ratio-level data. To see if your data fits the models of regression, it is.
Simple Linear Regression One reason for assessing correlation is to identify a variable that could be used to predict another variable If that is your.
Multiple Linear Regression. Purpose To analyze the relationship between a single dependent variable and several independent variables.
Linear Regression Model In regression, x = independent (predictor) variable y= dependent (response) variable regression line (prediction line) ŷ = a +
Hypothesis testing Intermediate Food Security Analysis Training Rome, July 2010.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Regression Chapter 16. Regression >Builds on Correlation >The difference is a question of prediction versus relation Regression predicts, correlation.
Regression Lesson 11. The General Linear Model n Relationship b/n predictor & outcome variables form straight line l Correlation, regression, t-tests,
Chapter 16 Data Analysis: Testing for Associations.
Aim: Review for Exam Tomorrow. Independent VS. Dependent Variable Response Variables (DV) measures an outcome of a study Explanatory Variables (IV) explains.
Analysis of Covariance (ANCOVA)
Chapter 10 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 A perfect correlation implies the ability to predict one score from another perfectly.
STA291 Statistical Methods Lecture LINEar Association o r measures “closeness” of data to the “best” line. What line is that? And best in what terms.
Regression Understanding relationships and predicting outcomes.
Correlation and Regression: The Need to Knows Correlation is a statistical technique: tells you if scores on variable X are related to scores on variable.
Scatter Diagrams scatter plot scatter diagram A scatter plot is a graph that may be used to represent the relationship between two variables. Also referred.
© Buddy Freeman, 2015 Let X and Y be two normally distributed random variables satisfying the equality of variance assumption both ways. For clarity let.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.3 Predicting the Outcome.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
LESSON 6: REGRESSION 2/21/12 EDUC 502: Introduction to Statistics.
Multiple Regression. PSYC 6130, PROF. J. ELDER 2 Multiple Regression Multiple regression extends linear regression to allow for 2 or more independent.
©2005, Pearson Education/Prentice Hall CHAPTER 6 Nonexperimental Strategies.
Continuous Outcome, Dependent Variable (Y-Axis) Child’s Height
Multiple Linear Regression An introduction, some assumptions, and then model reduction 1.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
GOAL: I CAN USE TECHNOLOGY TO COMPUTE AND INTERPRET THE CORRELATION COEFFICIENT OF A LINEAR FIT. (S-ID.8) Data Analysis Correlation Coefficient.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 9 l Simple Linear Regression 9.1 Simple Linear Regression 9.2 Scatter Diagram 9.3 Graphical.
Descriptive measures of the degree of linear association R-squared and correlation.
MGS4020_Minitab.ppt/Jul 14, 2011/Page 1 Georgia State University - Confidential MGS 4020 Business Intelligence Regression Analysis By Using Minitab Jul.
Chapter 14 Introduction to Multiple Regression
Regression Analysis.
REGRESSION G&W p
Introduction to Regression Analysis
Multiple Regression.
Linear Regression Prof. Andy Field.
Simple Linear Regression
Regression Analysis.
Chapter 15 Linear Regression
Merve denizci nazlıgül, M.s.
Simple Linear Regression
Section 6.2 Prediction.
Regression Analysis.
Chapter 14 Multiple Regression
Regression Part II.
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

More on linear regression – regression to the mean  Baseball Examples from Web Quartiles 1 2

More on linear regression – regression to the mean  What causes regression to the mean?  What causes the extreme score?  Random variation (chance)?  True score?  A bit of each…the more random variation there is, the more regression to the mean will occur  Example:  Think of 2 exams…in each of 2 cases  Case 1: Exam 1 and exam 2 scores totally determined by ability  Case 2: Exam 1 and exam 2 scores determined 50% by ability, 50% by chance variation (feeling good , knew that question , misunderstood a word , etc) KNR 445 Regression: slide 2 1 2

Exam 1 Exam 2 2 More on linear regression – regression to the mean  Correlate exam 1 with exam 2 in each case:  Case 1:Case 2: KNR 445 Regression: slide 3 1

 What if we take z-scores of each variable? More on linear regression – regression to the mean KNR 445 Regression: slide Intercept now = 0 Slope now = r 5 6 3

More on linear regression – regression to the mean  So, regression to the mean is proportional to r…useful to know in situations when post-test scores and pre-test scores are not perfectly correlated (all the time!)  Means outliers on the pre-test will generally drift towards the mean on the post-test  Originally demonstrated by Galton with offspring (taller parents had kids that were in general closer to mean height)  This is actually where the term regression came from in the procedure  See “the sophomore sink” and “but I regress” (web site)  Used in all good fantasy sport estimators 1

E.G. – Tattoos & workout time The thing we are trying to predict – workout time The predictor - # tattoos 1 2

E.G. - Output First box is just telling you what you analyzed - 2 nd box is quite informative – let’s look at that 1 2

E.G. - Output 4. 68% of people’s predicted # hours workout time will fall +/ of the value predicted using regression equation 3. Adjusts the R2 value based on sample size…small samples tend to overestimate the ability to predict the DV with the IV 1. Pearson’s r correlation between the two variables 2. Coefficient of Determination - % of variance in workout hours accounted for by # tattoos

E.G. – Output (continued) 1. ANOVA stuff – that’s after the midterm. For now, note that the two procedures give you the same answer 2. Still significant Intercept, slope 4. a) Intercept different from 0? b) Is this relationship reliable (significant, greater than chance, etc...)?

3 E.G. – Output (continued)  How effectively can we predict a (male) person’s workout hours per week from the number of tattoos he has?  Answer/interpretation:  A simple linear regression was calculated predicting males’ workout hours per week from their number of tattoos. A significant regression equation was found (F(1,53) = 7.27, p<.05), explaining 12.1% of the variance in workout hours. People’s workout hours are equal to times the number of tattoos they have. Males’ workout hours increase by.90 hours for each additional tattoo they possess.  Weird!  Next time… 1 2