Prediction, Goodness-of-Fit, and Modeling Issues ECON 6002 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Chapter 12 Simple Linear Regression
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Regression Analysis Simple Regression. y = mx + b y = a + bx.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Ch.6 Simple Linear Regression: Continued
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Bivariate Correlation and Regression CHAPTER Thirteen.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Simple Regression. Major Questions Given an economic model involving a relationship between two economic variables, how do we go about specifying the.
Definition  Regression Model  Regression Equation Y i =  0 +  1 X i ^ Given a collection of paired data, the regression equation algebraically describes.
Lecture 8 Relationships between Scale variables: Regression Analysis
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
The Simple Linear Regression Model: Specification and Estimation
Copyright © 2008 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics, 9e Managerial Economics Thomas Maurice.
Chapter 10 Simple Regression.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
The Simple Regression Model
SIMPLE LINEAR REGRESSION
Probability & Statistics for Engineers & Scientists, by Walpole, Myers, Myers & Ye ~ Chapter 11 Notes Class notes for ISE 201 San Jose State University.
SIMPLE LINEAR REGRESSION
Pertemua 19 Regresi Linier
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Simple Linear Regression. Introduction In Chapters 17 to 19, we examine the relationship between interval variables via a mathematical equation. The motivation.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Regression and Correlation Methods Judy Zhong Ph.D.
SIMPLE LINEAR REGRESSION
Prediction, Goodness-of-fit, and Modeling Issues
Introduction to Linear Regression and Correlation Analysis
Simple Linear Regression Models
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Go to Table of Content Single Variable Regression Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
Copyright © 2005 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics Thomas Maurice eighth edition Chapter 4.
Y X 0 X and Y are not perfectly correlated. However, there is on average a positive relationship between Y and X X1X1 X2X2.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
Prediction, Goodness-of-Fit, and Modeling Issues Prepared by Vera Tabakova, East Carolina University.
The Simple Linear Regression Model: Specification and Estimation ECON 4550 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s.
1 Econometrics (NA1031) Lecture 4 Prediction, Goodness-of-fit, and Modeling Issues.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Bivariate Regression. Bivariate Regression analyzes the relationship between two variables. Bivariate Regression analyzes the relationship between two.
Chapter 13 Simple Linear Regression
The simple linear regression model and parameter estimation
Chapter 4: Basic Estimation Techniques
Regression Analysis AGEC 784.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Basic Estimation Techniques
The Simple Linear Regression Model: Specification and Estimation
Prediction, Goodness-of-Fit, and Modeling Issues
Econ 3790: Business and Economics Statistics
1) A residual: a) is the amount of variation explained by the LSRL of y on x b) is how much an observed y-value differs from a predicted y-value c) predicts.
Regression Analysis Week 4.
CHAPTER 29: Multiple Regression*
6-1 Introduction To Empirical Models
Undergraduated Econometrics
The Simple Linear Regression Model: Specification and Estimation
Interval Estimation and Hypothesis Testing
SIMPLE LINEAR REGRESSION
Product moment correlation
SIMPLE LINEAR REGRESSION
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Prediction, Goodness-of-Fit, and Modeling Issues ECON 6002 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes

 4.1 Least Squares Prediction  4.2 Measuring Goodness-of-Fit  4.3 Modeling Issues  4.4 Log-Linear Models

where e 0 is a random error. We assume that and. We also assume that and

Figure 4.1 A point prediction Slide 4-4 Principles of Econometrics, 3rd Edition

The variance of the forecast error is smaller when i. the overall uncertainty in the model is smaller, as measured by the variance of the random errors ; ii. the sample size N is larger; iii. the variation in the explanatory variable is larger; and iv. the value of the square deviation of our predicted point from the mean of X is small, predictions far away from the mean will be more imprecise.

Figure 4.2 Point and interval prediction Slide 4-8 Principles of Econometrics, 3rd Edition

Figure 4.3 Explained and unexplained components of y i Slide 4-11 Principles of Econometrics, 3rd Edition

 = total sum of squares = SST: a measure of total variation in y about the sample mean.  = sum of squares due to the regression = SSR: that part of total variation in y, about the sample mean, that is explained by, or due to, the regression. Also known as the “explained sum of squares.”  = sum of squares due to error = SSE: that part of total variation in y about its mean that is not explained by the regression. Also known as the unexplained sum of squares, the residual sum of squares, or the sum of squared errors.  SST = SSR + SSE

 The closer R 2 is to one, the closer the sample values y i are to the fitted regression equation. If R 2 = 1, then all the sample data fall exactly on the fitted least squares line, so SSE = 0, and the model fits the data “perfectly.” If the sample data for y and x are uncorrelated and show no linear association, then the least squares fitted line is “horizontal,” so that SSR = 0 and R 2 = 0.

R 2 measures the linear association, or goodness-of-fit, between the sample data and their predicted values. Consequently R 2 is sometimes called a measure of “goodness-of-fit.”

Figure 4.4 Plot of predicted y, against y Slide 4-18 Principles of Econometrics, 3rd Edition

 FOOD_EXP = weekly food expenditure by a household of size 3, in dollars  INCOME = weekly household income, in $100 units * indicates significant at the 10% level ** indicates significant at the 5% level *** indicates significant at the 1% level

 The Effects of Scaling the Data  Changing the scale of x:  Changing the scale of y:

Variable transformations:  Power: if x is a variable then x p means raising the variable to the power p; examples are quadratic (x 2 ) and cubic (x 3 ) transformations.  The natural logarithm: if x is a variable then its natural logarithm is ln(x).  The reciprocal: if x is a variable then its reciprocal is 1/x.

Figure 4.5 A nonlinear relationship between food expenditure and income Slide 4-22 Principles of Econometrics, 3rd Edition

 The log-log model The parameter β is the elasticity of y with respect to x.  The log-linear model A one-unit increase in x leads to (approximately) a 100×β 2 percent change in y.  The linear-log model A 1% increase in x leads to a β 2 /100 unit change in y.

 The reciprocal model is  The linear-log model is

Remark: Given this array of models, that involve different transformations of the dependent and independent variables, and some of which have similar shapes, what are some guidelines for choosing a functional form? 1.Choose a shape that is consistent with what economic theory tells us about the relationship. 2.Choose a shape that is sufficiently flexible to “fit” the data 3.Choose a shape so that assumptions SR1-SR6 are satisfied, ensuring that the least squares estimators have the desirable properties described in Chapters 2 and 3.

Figure 4.6 EViews output: residuals histogram and summary statistics for food expenditure example Slide 4-26 Principles of Econometrics, 3rd Edition

 The Jarque-Bera statistic is given by where N is the sample size, S is skewness, and K is kurtosis.  In the food expenditure example

Figure 4.7 Scatter plot of wheat yield over time Slide 4-28 Principles of Econometrics, 3rd Edition

 use wa-wheat, clear  * examine data  summarize  rename greenough yield  twoway (scatter yield time)  more  * regression  reg yield time  predict yhat  * analyze residuals  predict ehat, residuals  histogram ehat, percent  more Undergraduate Econometrics, 3 rd Edition Principles of Econometrics, 3rd Edition

 twoway (scatter yield time) (connect yhat time)  more  twoway (scatter yield time) (lfit yield time)  more  twoway connected ehat time, yline(0)  more  rvpplot time, recast(bar) yline(0)  more Undergraduate Econometrics, 3 rd Edition Principles of Econometrics, 3rd Edition

Figure 4.8 Predicted, actual and residual values from straight line Slide 4-32 Principles of Econometrics, 3rd Edition

Figure 4.9 Bar chart of residuals from straight line Slide 4-33 Principles of Econometrics, 3rd Edition

Figure 4.10 Fitted, actual and residual values from equation with cubic term Slide 4-35 Principles of Econometrics, 3rd Edition

 The Growth Model

 A Wage Equation

 Prediction in the Log-Linear Model

 A Generalized R 2 Measure R 2 values tend to be small with microeconomic, cross-sectional data, because the variations in individual behavior are difficult to fully explain.

 Prediction Intervals in the Log-Linear Model

Slide 4-41 Principles of Econometrics, 3rd Edition  coefficient of determination  correlation  data scale  forecast error  forecast standard error  functional form  goodness-of-fit  growth model  Jarque-Bera test  kurtosis  least squares predictor  linear model  linear relationship  linear-log model  log-linear model  log-log model  log-normal distribution  prediction  prediction interval  R 2  residual  skewness

Slide 4-42 Principles of Econometrics, 3rd Edition

Slide 4-43 Principles of Econometrics, 3rd Edition

Slide 4-44 Principles of Econometrics, 3rd Edition

Slide 4-45 Principles of Econometrics, 3rd Edition (4A.1) (4A.2)

Slide 4-46 Principles of Econometrics, 3rd Edition

Slide 4-47 Principles of Econometrics, 3rd Edition

Slide 4-48 Principles of Econometrics, 3rd Edition If the model contains an intercept it is guaranteed that SST = SSR + SSE. If, however, the model does not contain an intercept, then and SST ≠ SSR + SSE.

Suppose that the variable y has a normal distribution, with mean μ and variance σ 2. If we consider then is said to have a log-normal distribution. Slide 4-49 Principles of Econometrics, 3rd Edition

Given the log-linear model If we assume that Slide 4-50 Principles of Econometrics, 3rd Edition

The growth and wage equations: and Slide 4-51 Principles of Econometrics, 3rd Edition