Regression Analysis in Theory and Practice. DON’T WRITE THE FORMULAS AHEAD!!!

Slides:



Advertisements
Similar presentations
Here we add more independent variables to the regression.
Advertisements

C 3.7 Use the data in MEAP93.RAW to answer this question
Review for the chapter 6 test 6. 1 Scatter plots & Correlation 6
1 Multiple Regression Interpretation. 2 Correlation, Causation Think about a light switch and the light that is on the electrical circuit. If you and.
Cal State Northridge  320 Andrew Ainsworth PhD Regression.
9. SIMPLE LINEAR REGESSION AND CORRELATION
Statistics: Data Analysis and Presentation Fr Clinic II.
Lecture 22 Multiple Regression (Sections )
Correlation-Regression The correlation coefficient measures how well one can predict X from Y or Y from X.
The Basics of Regression continued
SIMPLE LINEAR REGRESSION
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
Regression The basic problem Regression and Correlation Accuracy of prediction in regression Hypothesis testing Regression with multiple predictors.
SIMPLE LINEAR REGRESSION
Multiple Regression Research Methods and Statistics.
Introduction to Regression Analysis, Chapter 13,
Regression and Correlation BUSA 2100, Sect , 3.5.
Relationships Among Variables
Ordinary Least Squares
Lecture 5 Correlation and Regression
Example of Simple and Multiple Regression
Objectives of Multiple Regression
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
SIMPLE LINEAR REGRESSION
3.1 Ch. 3 Simple Linear Regression 1.To estimate relationships among economic variables, such as y = f(x) or c = f(i) 2.To test hypotheses about these.
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
CORRELATION & REGRESSION
Chapter 15 Correlation and Regression
Social Science Models What elements (at least two) are necessary for a “social science model”?
CHAPTER 15 Simple Linear Regression and Correlation
12a. Regression Analysis, Part 1 CSCI N207 Data Analysis Using Spreadsheet Lingma Acheson Department of Computer and Information Science,
Correlation and Linear Regression. Evaluating Relations Between Interval Level Variables Up to now you have learned to evaluate differences between the.
Statistics Review - 1 What is the difference between a variable and a constant? Why are we more interested in variables than constants? What are the four.
Simple Linear Regression. Deterministic Relationship If the value of y (dependent) is completely determined by the value of x (Independent variable) (Like.
Chapter 7 Relationships Among Variables What Correlational Research Investigates Understanding the Nature of Correlation Positive Correlation Negative.
Regression Chapter 16. Regression >Builds on Correlation >The difference is a question of prediction versus relation Regression predicts, correlation.
10B11PD311 Economics REGRESSION ANALYSIS. 10B11PD311 Economics Regression Techniques and Demand Estimation Some important questions before a firm are.
Simple Linear Regression. The term linear regression implies that  Y|x is linearly related to x by the population regression equation  Y|x =  +  x.
Active Learning Lecture Slides For use with Classroom Response Systems Association: Contingency, Correlation, and Regression.
Simple Linear Regression In the previous lectures, we only focus on one random variable. In many applications, we often work with a pair of variables.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
1 Regression & Correlation (1) 1.A relationship between 2 variables X and Y 2.The relationship seen as a straight line 3.Two problems 4.How can we tell.
Active Learning Lecture Slides For use with Classroom Response Systems Association: Contingency, Correlation, and Regression.
Basic Statistics Linear Regression. X Y Simple Linear Regression.
The temperature of the human body would be 37°C °C.
Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights.
Chapter 7 Calculation of Pearson Coefficient of Correlation, r and testing its significance.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 7: Regression.
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
Active Learning Lecture Slides For use with Classroom Response Systems Association: Contingency, Correlation, and Regression.
Hypothesis Testing Example 3: Test the hypothesis that the average content of containers of a particular lubricant is 10 litters if the contents of random.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
METHODOLOGY OF ECONOMETRICS Broadly speaking, traditional econometric methodology proceeds along the following lines:Broadly speaking, traditional econometric.
The simple linear regression model and parameter estimation
Simple Linear Regression
Lecture 10 Regression Analysis
Regression Analysis: Statistical Inference
Active Learning Lecture Slides For use with Classroom Response Systems
Chapter 5 STATISTICS (PART 4).
Chapter 15 Linear Regression
Correlation and Simple Linear Regression
1) A residual: a) is the amount of variation explained by the LSRL of y on x b) is how much an observed y-value differs from a predicted y-value c) predicts.
REGRESSION.
Introduction to bivariate data
Exam 5 Review GOVT 201.
Correlation and Simple Linear Regression
SIMPLE LINEAR REGRESSION
Simple Linear Regression and Correlation
SIMPLE LINEAR REGRESSION
Presentation transcript:

Regression Analysis in Theory and Practice

DON’T WRITE THE FORMULAS AHEAD!!!

REGRESSION ANALYSIS Formula for simple regression Formula for simple regression where is the predicted value of Y on the regression line. Do you remember y=mx + b? Same thing!

The dependence of Y on X can be of two types: “deterministic” or “probabilistic”. The classic case of deterministic relationship is that between Fahrenheit and Celsius measure of temperature: F 0 = 32 + (9/5)C Where a, the intercept, is So when C=0, degrees F=32, b beta, is the slope of the line, here (9/5) or 1.8. C is X, degrees Celsius.

So for every on degree of change in degrees C, Fahrenheit goes up by 1.8 degrees, starting at 32 degrees. So when C =0 F = (9/5)0 = 32 0 When C = F = 32 + (9/5)100=212 0 Note: 1.8 = 9/5

Probabilistic Regression  Not perfectly predictive.  On average, we expect a certain amount of change in Y for a certain change in X

Regression Example  Judges are advised to give longer sentences to repeat offenders than to first- time offenders. Does it really happen?  Hypothesis: In comparing criminals, those who illustrate the characteristic of having been convicted before will receive longer prison sentences than those with no prior convictions.  We collect data for 10 convicted criminals

Data and Formula: X (convctn) y (sen len) y (sen len) Σx = 40 Σy = 260 X = 4 Y = 26 X – X Y – Y Y – Y

Continued: (X-X) * (Y-Y) (X-X) Σ = 300 Σ= 100 X = 4 Y = 26 b = 3 X – X Y – Y Y – Y

Now Calculate “A” a = 26 – (3) * 4 a = 26 – 12 a = 14 Y = *X

Interpret the Equation Y = *X Interpret 14 Interpret 3

Scatterplot

Multiple Regression - 1  The mathematics of how the computer calculates regression coefficients in multiple regression is very complicated. Fortunately, there is an intuitive process that generates the correct answers and is much easier to understand. Let’s see how the computer obtained the value of for the impact of senator conservatism on the degree to which a senator voted for tax changes primarily benefitting households at, or below, the median income.

Multiple Regression - 2  Our “main equation” is:  Y = a 1 + b 1 X 1 + b 2 X 2 + b 3 X 3 + e 1  Y = percentage support for tax changes benefitting households with incomes at, or below, the median  X 1 = senator conservatism  X 2 = senator party affiliation  X 3 = state median household income  Our goal is to estimate b 1

Multiple Regression - 3  X 1 = a 2 + b 4 X 2 + b 5 X 3 + e 2  In the above equation e 2 represents that portion of a senator’s conservatism than CANNOT be explained by either their party affiliation or the median family income in their state.

Multiple Regression - 4  Y = a 3 + b 6 X 2 + b 7 X 3 + e 3  In the above equation e 3 represents that portion of a senator’s degree of support for tax changes favorable to households with incomes at, or below, the median that CANNOT be explained by either their party affiliation or the median family income in their state.

Multiple Regression - 5  e 3 = a 4 + b 8 e 2 + e 4  In the above equation b 8 represents the impact of that portion of a senator’s conservatism that CANNOT be explained by party and state median income on the percentage of times the senator voted in favor of tax changes primarily benefitting households at, or below, the median income that CANNOT be explained by either their party affiliation or the median income in their state. Thus, b 8 in the above equation = b 1 in the “main equation” (i.e., -.644).

Maximum Likelihood Estimation