Regression Continued. Example: Y [team finish] =  +  X [spending] Values of the Y variable (team finish) are a function of some constant, plus some.

Slides:



Advertisements
Similar presentations
CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.
Advertisements

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Limited Dependent Variables
Association for Interval Level Variables
Moderation: Assumptions
Data Analysis Statistics. Inferential statistics.
Université d’Ottawa / University of Ottawa 2001 Bio 4118 Applied Biostatistics L10.1 CorrelationCorrelation The underlying principle of correlation analysis.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.
Multiple Linear Regression Model
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Chapter 10 Simple Regression.
Correlation and Simple Regression Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College Publishing.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
SIMPLE LINEAR REGRESSION
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Topic 3: Regression.
Elaboration Elaboration extends our knowledge about an association to see if it continues or changes under different situations, that is, when you introduce.
SIMPLE LINEAR REGRESSION
Data Analysis Statistics. Inferential statistics.
Pertemua 19 Regresi Linier
Crash Course in Correlation and Regression MEASURING ASSOCIATION Establishing a degree of association between two or more variables gets at the central.
Stat 112: Lecture 16 Notes Finish Chapter 6: –Influential Points for Multiple Regression (Section 6.7) –Assessing the Independence Assumptions and Remedies.
Correlation 1. Correlation - degree to which variables are associated or covary. (Changes in the value of one tends to be associated with changes in the.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Simple Linear Regression and Correlation
Simple Linear Regression Analysis
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Ordinary Least Squares
Correlation & Regression
The Practice of Social Research
Leedy and Ormrod Ch. 11 Gray Ch. 14
SIMPLE LINEAR REGRESSION
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition
Chapter 11 Simple Regression
Understanding Multivariate Research Berry & Sanders.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Soc 3306a Multiple Regression Testing a Model and Interpreting Coefficients.
Chapter 17 Partial Correlation and Multiple Regression and Correlation.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Multiple Linear Regression. Purpose To analyze the relationship between a single dependent variable and several independent variables.
Examining Relationships in Quantitative Research
Managerial Economics Demand Estimation & Forecasting.
SW388R6 Data Analysis and Computers I Slide 1 Multiple Regression Key Points about Multiple Regression Sample Homework Problem Solving the Problem with.
Chapter 16 Data Analysis: Testing for Associations.
INDE 6335 ENGINEERING ADMINISTRATION SURVEY DESIGN Dr. Christopher A. Chung Dept. of Industrial Engineering.
Regression Analysis © 2007 Prentice Hall17-1. © 2007 Prentice Hall17-2 Chapter Outline 1) Correlations 2) Bivariate Regression 3) Statistics Associated.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and Regression and Time Series CHAPTER 11 Correlation and Regression: Measuring and Predicting Relationships.
Examining Relationships in Quantitative Research
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 22.
LESSON 6: REGRESSION 2/21/12 EDUC 502: Introduction to Statistics.
Chapter 3 The Ethics and Politics of Social Research.
Multiple Regression Analysis Regression analysis with two or more independent variables. Leads to an improvement.
Research Methodology Lecture No :26 (Hypothesis Testing – Relationship)
Multiple Independent Variables POLS 300 Butz. Multivariate Analysis Problem with bivariate analysis in nonexperimental designs: –Spuriousness and Causality.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Bivariate Regression. Bivariate Regression analyzes the relationship between two variables. Bivariate Regression analyzes the relationship between two.
Chapter 11: Linear Regression E370, Spring From Simple Regression to Multiple Regression.
Inference about the slope parameter and correlation
BIVARIATE REGRESSION AND CORRELATION
Simple Linear Regression
Multiple Regression A curvilinear relationship between one variable and the values of two or more other independent variables. Y = intercept + (slope1.
LEARNING OUTCOMES After studying this chapter, you should be able to
Simple Linear Regression
Regression III.
SIMPLE LINEAR REGRESSION
Regression Part II.
Presentation transcript:

Regression Continued

Example: Y [team finish] =  +  X [spending] Values of the Y variable (team finish) are a function of some constant, plus some amount of the X variable. The question we are interested in is how much change in the Y variable (team finish) is associated with a one-unit change in the X variable (spending). The answer lies in β (beta), this is know as the regression coefficient. In terms of the baseball example, it would be the amount of improvement in team finish (1=first, 2 = second, …. and 7 = last) associated with an additional $1 million in spending on players’ salaries. Q: Would we expect the relationship to be positive or negative?

Todd Donovan using 1999 season data and a bivariate regression found: Team finish = 4.4 – x spending (in $millions) This means that the slope of the relationship between spending and team finish was – Or, for each million dollars that a team spends, there is only a 3 percent change in division position. This result is significant at the.01 level. These results show that a team spending $70 million on players will finish close to second place. We can also show that any given team would have to spend $35 million more to improve its team finish by one position ( x $35million = 1.105). Correlation was which means spending explains only 15 percent of variation in the team’s finish (r2 =.15).

Another Baseball Example Testing Causality Between Team Performance and Payroll : The Cases of Major League Baseball and English Soccer By Stephen Hall, Stefan Szymanski and Andrew S. Zimbalist Journal of Sports Economics 2002

Multiple Regression Multiple regression contains a single dependent variable and two or more independent variables. Multiple regression is particularly appropriate when the causes (independent variables) are inter-correlated, which again is usually the case.

II. Assumptions 1.Normality of the Dependent Variable: Inference and hypothesis testing require that the distribution of e is normally distributed. 2.Interval Level Measures: The dependent variable is measured at the interval level 3.The effects of the independent variables on Y are additive: For each independent variable Xi, the amount of change in E(Y) associated with a unit increase in Xi (holding all other ind. Variables constant) is the same. 4.The regression model is properly specified. This means there is no specification bias or error in the model, the functional form, i.e., linear, non-linear, is correct, and our assumptions of the variable are correct.

Multivariate Regression is a powerful tool to examine how multiple factors (independent variables) influence a dependent variable. It differs from bivariate regression in that it can identify the independent effect a variable has on a dependent variable by holding all other variables constant? What other variables would we include in the baseball model to predict winning %?

Why do we need to hold the independent variables constant? Figures 1 and 2 may help clarify. Each circle may be thought of as representing the variance of the variable. The overlap in the two circles indicates the proportion of variance in each variable that is shared with the other.

Y X1 X2

X1 Y X2 c

In figure 1 the fact that X1 and X2 do not overlap means that they are not correlated, but each is correlated with Y. This is great and means we don’t need sophisticated analysis, just two separate bivariate regressions. In figure 2, X1 and X2 are correlated. The area C is created by the correlation between X1 and X2; c represents the proportion of the variance in Y that is shared jointly with X1 and X2. How do we deal with C? We can’t count it twice or we will get a variation that is greater than 100%. Multivariate Regression

Other Types of Regression Logit (Logistic) – dichotomous dep. variable Probit – dichotomous dep. variable Ordered logit or probit – ordinal dep. variable Multinomial logit/probit – nominal dep. Variable (more than 2 categories) Conditional logit/probit – similar to multinomial Probit Hierarchical Linear Models – for data that is clustered by time, space (and other conditions)