Y X 0 X and Y are not perfectly correlated. However, there is on average a positive relationship between Y and X X1X1 X2X2.

Y X 0 X and Y are not perfectly correlated. However, there is on average a positive relationship between Y and X X1X1 X2X2

11 Y1Y1 E(Y 1 /X 1 ) Y X0X1X1 E(Y i /X i ) =  0 +  1 X i We assume that expected conditional values of Y associated with alternative values of X fall on a line.  1 = Y 1 - E(Y 1 /X 1 )

âSpecification âEstimation âEvaluation âForecasting

Econometric models posit causal relationships among economic variables. Simple regression analysis is used to test the hypothesis about the relationship between a dependent variable (Y, or in our case, C) and independent variable (X, or in our case, Y)).  Our model is specified as follows: C = f (Y), where C: personal consumption expenditure Y: Personal disposable income

 Simple linear regression begins by plotting C-Y values (see table 1)on a scatter diagram (see figure 1) to determine if there exists an approximate linear relationship: (1) Since the data points are unlikely to fall exactly on a line, (1) must be modified to include a disturbance term (u i ) (2)  0 and  1 are called parameters or population parameters.  We estimate these parameters using the data we have available

Table 1

 We estimate the values of  0 and  1 using the Ordinary Least Squares (OLS) method. OLS is a technique for fitting the "best" straight line to the sample of XY observations. The line of best fit is that which minimizes the sum of the squared (vertical) deviations of the sample points from the line: Where, C i are the actual observations of consumption are fitted values of consumption

C Y C1C1 Y1Y1 0 e1e1

The OLS estimators--single variable case are estimators of the true parameters  0 and  1 Note that we use X to denote the explanatory variable and Y is the dependent variable.

Table 2

Thus, we have: Thus the equation obtained from the regression is:

Table 3: Fitted values of consumption

 Goodness of fit criteria Standard errors of the estimates Are the estimates statistically significant? Constructing confidence intervals The coefficient of determination (R 2 ). The standard error of the regression These statistics tell us how well the equation obtained from the regression performs in terms of producing accurate forecasts

We assume that the regression coefficients are normally distributed variables. The standard error (or standard deviation) of the estimates is a measure of the dispersion of the estimates around their mean value. As a general principle, the smaller the standard error, the better the estimates (in terms of yielding accurate forecasts of the dependent variable). The following rule-of-thumb is useful:"[the] standard error of the regression coefficient should be less than half of the size of [the] corresponding regression coefficient." Let denote the standard error of our estimate of the slope parameter

By reference to the SPSS output, we see that the standard error of our estimate of  1 is 0.049, whereas our estimate of  1 is 0.861. Hence our estimate is about 17 times the size of its standard error Note that:

ß To test for the significance of our estimate of  1, we set the following null hypothesis, H 0, and the alternative hypothesis, H 1 ßH 0 :  1  0 ßH 1 :  1 > 0 ßThe t distribution is used to test for statistical significance of the estimate: The t test is a way of comparing the error suggested by the null hypothesis to the standard error of the estimate A rule-of thumb: if t > 2, reject H 0

Constructing confidence intervals To find the 95 percent confidence interval for  1, that is: Pr( a <  1 < b) =.95 To find the upper and lower boundaries of the confidence interval (a and b): Where t c is the critical value of t at the 5 percent confidence level (two-sided,10 degrees of freedom ). t c = 2.228. Working it out, we have: Pr(.752<  1 <.970) =.95  We can be 95 percent confident that the true value of the slope coefficient is in this range.

The coefficient of determination (R 2 )  The coefficient of determination, R 2, is defined as the proportion of the total variation in the dependent variable (Y) "explained" by the regression of Y on the independent variable (X). The total variation in Y or the total sum of squares (TSS) is defined as:  The explained variation in the dependent variable(Y) is called the regression sum of squares (RSS) and is given by: Note:

What remains is the unexplained variation in the dependent variable or the error sum of squares (ESS) We can say the following: TSS = RSS + ESS, or Total variation = Explained variation + Unexplained variation R 2 is defined as:

 Note that: 0  R 2  1  If R 2 = 0, all the sample points lie on a horizontal line or in a circle  If R 2 = 1, the sample points all lie on the regression line  In our case, R 2  0.984, meaning that 98.4 percent of the variation in the dependent variable (consumption) is explained by the regression. Think of R 2 as the proportion of the total deviation of the dependent variable from its mean value that is accounted for by the explanatory variable(s).

 The standard error of the regression (s) is given by  In our case, s = 3.40  Regression is based on the assumption that the error term is normally distributed, so that 6.87% of the actual values of the dependent variable should be within one standard error (  $3.4 billion in our example) of their fitted value.  Also, 95.45% of the observed values of consumption should be within 2 standard errors of their fitted values (  $6.8 billion).

Our forecasting equation was estimated as follows:  At the most basic level, forecasting consists of inserting forecasted values of the explanatory variable X (disposable income) into the forecasting equation to obtain forecasted values of the dependent variable Y (personal consumption expenditure).

Our ability to generate accurate forecasts of the dependent variable depends on two factors:  Do we have good forecasts of the explanatory variable?  Does our model exhibit structural stability, i.e., will the causal relationship between C and Y expressed in our forecasting equation hold up over time? After all, the estimated coefficients are average values for a specific time interval (1987-1998). While the past may be a serviceable guide to the future in the case of purely physical phenomena, the same principle does not necessarily hold in the realm of social phenomena (to which economy belongs). Can we make a good forecast?

Having forecasted values of income in hand, we can forecast consumption through the year 2003

Y X 0 X and Y are not perfectly correlated. However, there is on average a positive relationship between Y and X X1X1 X2X2.

Similar presentations

Presentation on theme: "Y X 0 X and Y are not perfectly correlated. However, there is on average a positive relationship between Y and X X1X1 X2X2."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Y X 0 X and Y are not perfectly correlated. However, there is on average a positive relationship between Y and X X1X1 X2X2.

Similar presentations

Presentation on theme: "Y X 0 X and Y are not perfectly correlated. However, there is on average a positive relationship between Y and X X1X1 X2X2."— Presentation transcript:

Similar presentations

About project

Feedback