WHITE TEST FOR HETEROSCEDASTICITY 1 The White test for heteroscedasticity looks for evidence of an association between the variance of the disturbance.

Slides:



Advertisements
Similar presentations
Christopher Dougherty EC220 - Introduction to econometrics (chapter 7) Slideshow: White test for heteroscedasticity Original citation: Dougherty, C. (2012)
Advertisements

CHOW TEST AND DUMMY VARIABLE GROUP TEST
EC220 - Introduction to econometrics (chapter 5)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: slope dummy variables Original citation: Dougherty, C. (2012) EC220 -
1 THE NORMAL DISTRIBUTION In the analysis so far, we have discussed the mean and the variance of a distribution of a random variable, but we have not said.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 4) Slideshow: interactive explanatory variables Original citation: Dougherty, C. (2012)
HETEROSCEDASTICITY-CONSISTENT STANDARD ERRORS 1 Heteroscedasticity causes OLS standard errors to be biased is finite samples. However it can be demonstrated.
EC220 - Introduction to econometrics (chapter 7)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: exercise 3.5 Original citation: Dougherty, C. (2012) EC220 - Introduction.
1 ASSUMPTIONS FOR MODEL C: REGRESSIONS WITH TIME SERIES DATA Assumptions C.1, C.3, C.4, C.5, and C.8, and the consequences of their violations are the.
EC220 - Introduction to econometrics (chapter 2)
00  sd  0 –sd  0 –1.96sd  0 +sd 2.5% CONFIDENCE INTERVALS probability density function of X null hypothesis H 0 :  =  0 In the sequence.
EXPECTED VALUE OF A RANDOM VARIABLE 1 The expected value of a random variable, also known as its population mean, is the weighted average of its possible.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 6) Slideshow: variable misspecification iii: consequences for diagnostics Original.
EC220 - Introduction to econometrics (chapter 1)
1 INTERPRETATION OF A REGRESSION EQUATION The scatter diagram shows hourly earnings in 2002 plotted against years of schooling, defined as highest grade.
TESTING A HYPOTHESIS RELATING TO A REGRESSION COEFFICIENT This sequence describes the testing of a hypotheses relating to regression coefficients. It is.
SLOPE DUMMY VARIABLES 1 The scatter diagram shows the data for the 74 schools in Shanghai and the cost functions derived from a regression of COST on N.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 3) Slideshow: precision of the multiple regression coefficients Original citation:
Christopher Dougherty EC220 - Introduction to econometrics (chapter 4) Slideshow: semilogarithmic models Original citation: Dougherty, C. (2012) EC220.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 4) Slideshow: nonlinear regression Original citation: Dougherty, C. (2012) EC220 -
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: Chow test Original citation: Dougherty, C. (2012) EC220 - Introduction.
TOBIT ANALYSIS Sometimes the dependent variable in a regression model is subject to a lower limit or an upper limit, or both. Suppose that in the absence.
1 In a second variation, we shall consider the model shown above. x is the rate of growth of productivity, assumed to be exogenous. w is now hypothesized.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: dummy variable classification with two categories Original citation:
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: two sets of dummy variables Original citation: Dougherty, C. (2012) EC220.
1 PREDICTION In the previous sequence, we saw how to predict the price of a good or asset given the composition of its characteristics. In this sequence,
FIXED EFFECTS REGRESSIONS: WITHIN-GROUPS METHOD The two main approaches to the fitting of models using panel data are known, for reasons that will be explained.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: the effects of changing the reference category Original citation: Dougherty,
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: dummy classification with more than two categories Original citation:
DUMMY CLASSIFICATION WITH MORE THAN TWO CATEGORIES This sequence explains how to extend the dummy variable technique to handle a qualitative explanatory.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 10) Slideshow: Tobit models Original citation: Dougherty, C. (2012) EC220 - Introduction.
1 INTERACTIVE EXPLANATORY VARIABLES The model shown above is linear in parameters and it may be fitted using straightforward OLS, provided that the regression.
THE FIXED AND RANDOM COMPONENTS OF A RANDOM VARIABLE 1 In this short sequence we shall decompose a random variable X into its fixed and random components.
1 TWO SETS OF DUMMY VARIABLES The explanatory variables in a regression model may include multiple sets of dummy variables. This sequence provides an example.
Confidence intervals were treated at length in the Review chapter and their application to regression analysis presents no problems. We will not repeat.
ALTERNATIVE EXPRESSION FOR POPULATION VARIANCE 1 This sequence derives an alternative expression for the population variance of a random variable. It provides.
1 PROXY VARIABLES Suppose that a variable Y is hypothesized to depend on a set of explanatory variables X 2,..., X k as shown above, and suppose that for.
F TEST OF GOODNESS OF FIT FOR THE WHOLE EQUATION 1 This sequence describes two F tests of goodness of fit in a multiple regression model. The first relates.
MULTIPLE REGRESSION WITH TWO EXPLANATORY VARIABLES: EXAMPLE 1 This sequence provides a geometrical interpretation of a multiple regression model with two.
TYPE II ERROR AND THE POWER OF A TEST A Type I error occurs when the null hypothesis is rejected when it is in fact true. A Type II error occurs when the.
Simple regression model: Y =  1 +  2 X + u 1 We have seen that the regression coefficients b 1 and b 2 are random variables. They provide point estimates.
A.1The model is linear in parameters and correctly specified. PROPERTIES OF THE MULTIPLE REGRESSION COEFFICIENTS 1 Moving from the simple to the multiple.
. reg LGEARN S WEIGHT85 Source | SS df MS Number of obs = F( 2, 537) = Model |
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: exercise 5.2 Original citation: Dougherty, C. (2012) EC220 - Introduction.
POSSIBLE DIRECT MEASURES FOR ALLEVIATING MULTICOLLINEARITY 1 What can you do about multicollinearity if you encounter it? We will discuss some possible.
(1)Combine the correlated variables. 1 In this sequence, we look at four possible indirect methods for alleviating a problem of multicollinearity. POSSIBLE.
1 We will continue with a variation on the basic model. We will now hypothesize that p is a function of m, the rate of growth of the money supply, as well.
COST 11 DUMMY VARIABLE CLASSIFICATION WITH TWO CATEGORIES 1 This sequence explains how you can include qualitative explanatory variables in your regression.
Definition of, the expected value of a function of X : 1 EXPECTED VALUE OF A FUNCTION OF A RANDOM VARIABLE To find the expected value of a function of.
RAMSEY’S RESET TEST OF FUNCTIONAL MISSPECIFICATION 1 Ramsey’s RESET test of functional misspecification is intended to provide a simple indicator of evidence.
1 NONLINEAR REGRESSION Suppose you believe that a variable Y depends on a variable X according to the relationship shown and you wish to obtain estimates.
4 In our case, the starting point should be the model with all the lagged variables. DYNAMIC MODEL SPECIFICATION General model with lagged variables Static.
HETEROSCEDASTICITY 1 This sequence relates to Assumption A.4 of the regression model assumptions and introduces the topic of heteroscedasticity. This relates.
1 HETEROSCEDASTICITY: WEIGHTED AND LOGARITHMIC REGRESSIONS This sequence presents two methods for dealing with the problem of heteroscedasticity. We will.
1 ESTIMATORS OF VARIANCE, COVARIANCE, AND CORRELATION We have seen that the variance of a random variable X is given by the expression above. Variance.
1 CHANGES IN THE UNITS OF MEASUREMENT Suppose that the units of measurement of Y or X are changed. How will this affect the regression results? Intuitively,
SEMILOGARITHMIC MODELS 1 This sequence introduces the semilogarithmic model and shows how it may be applied to an earnings function. The dependent variable.
GRAPHING A RELATIONSHIP IN A MULTIPLE REGRESSION MODEL The output above shows the result of regressing EARNINGS, hourly earnings in dollars, on S, years.
1 REPARAMETERIZATION OF A MODEL AND t TEST OF A LINEAR RESTRICTION Linear restrictions can also be tested using a t test. This involves the reparameterization.
F TESTS RELATING TO GROUPS OF EXPLANATORY VARIABLES 1 We now come to more general F tests of goodness of fit. This is a test of the joint explanatory power.
1 We will illustrate the heteroscedasticity theory with a Monte Carlo simulation. HETEROSCEDASTICITY: MONTE CARLO ILLUSTRATION 1 standard deviation of.
1 COMPARING LINEAR AND LOGARITHMIC SPECIFICATIONS When alternative specifications of a regression model have the same dependent variable, R 2 can be used.
VARIABLE MISSPECIFICATION II: INCLUSION OF AN IRRELEVANT VARIABLE In this sequence we will investigate the consequences of including an irrelevant variable.
VARIABLE MISSPECIFICATION I: OMISSION OF A RELEVANT VARIABLE In this sequence and the next we will investigate the consequences of misspecifying the regression.
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Presentation transcript:

WHITE TEST FOR HETEROSCEDASTICITY 1 The White test for heteroscedasticity looks for evidence of an association between the variance of the disturbance term and the regressors without assuming any specific relationship.. reg MANU GDP Source | SS df MS Number of obs = F( 1, 26) = Model | e e+11 Prob > F = Residual | e R-squared = Adj R-squared = Total | e e+09 Root MSE = MANU | Coef. Std. Err. t P>|t| [95% Conf. Interval] GDP | _cons |

2 Since the variance of the disturbance term in observation i is unobservable, the squared residual for that observation is used as a proxy. WHITE TEST FOR HETEROSCEDASTICITY. reg MANU GDP Source | SS df MS Number of obs = F( 1, 26) = Model | e e+11 Prob > F = Residual | e R-squared = Adj R-squared = Total | e e+09 Root MSE = MANU | Coef. Std. Err. t P>|t| [95% Conf. Interval] GDP | _cons |

3 We will perform the test using the manufacturing and GDP data used to illustrate the Goldfeld–Quandt test. We have regressed MANU on GDP and have saved the residuals as EMANU. We define EMANUSQ to be the squared residual. WHITE TEST FOR HETEROSCEDASTICITY. reg MANU GDP Source | SS df MS Number of obs = F( 1, 26) = Model | e e+11 Prob > F = Residual | e R-squared = Adj R-squared = Total | e e+09 Root MSE = MANU | Coef. Std. Err. t P>|t| [95% Conf. Interval] GDP | _cons | predict EMANU, resid. gen EMANUSQ = EMANU*EMANU

4 The test consists of regressing the squared residuals on the explanatory variables in the model, their squares, and their cross-products, omitting any duplicative variables. (For example, the square of a dummy variable would be duplicative.) WHITE TEST FOR HETEROSCEDASTICITY. gen GDPSQ = GDP*GDP. reg EMANUSQ GDP GDPSQ Source | SS df MS Number of obs = F( 2, 25) = 3.35 Model | e e+18 Prob > F = Residual | e e+18 R-squared = Adj R-squared = Total | e e+18 Root MSE = 1.4e EMANUSQ | Coef. Std. Err. t P>|t| [95% Conf. Interval] GDP | GDPSQ | _cons | -4.21e e e e Test regression: regress squared residuals on the explanatory variables in the model, their squares, and their cross-products, omitting any duplicative variables.

5 In the present case we regress EMANUSQ on GDP and its square (and a constant). WHITE TEST FOR HETEROSCEDASTICITY Test regression: regress squared residuals on the explanatory variables in the model, their squares, and their cross-products, omitting any duplicative variables.. gen GDPSQ = GDP*GDP. reg EMANUSQ GDP GDPSQ Source | SS df MS Number of obs = F( 2, 25) = 3.35 Model | e e+18 Prob > F = Residual | e e+18 R-squared = Adj R-squared = Total | e e+18 Root MSE = 1.4e EMANUSQ | Coef. Std. Err. t P>|t| [95% Conf. Interval] GDP | GDPSQ | _cons | -4.21e e e e

6 The test statistic is nR 2, using R 2 from this regression. Under the null hypothesis of no association, it is distributed as a chi-squared statistic with degrees of freedom equal to the number of regressors, including the constant, minus one, in large samples. WHITE TEST FOR HETEROSCEDASTICITY Test statistic: nR 2, using R 2 from this regression. Under H 0, chi-squared statistic with degrees of freedom equal to the number of regressors, including the constant, minus one, in large samples.. gen GDPSQ = GDP*GDP. reg EMANUSQ GDP GDPSQ Source | SS df MS Number of obs = F( 2, 25) = 3.35 Model | e e+18 Prob > F = Residual | e e+18 R-squared = Adj R-squared = Total | e e+18 Root MSE = 1.4e EMANUSQ | Coef. Std. Err. t P>|t| [95% Conf. Interval] GDP | GDPSQ | _cons | -4.21e e e e

7 R 2 is and n is 28. The test statistic is therefore The critical value of chi-squared with two degrees of freedom is 5.99 at the 5 percent level and so the null hypothesis of homoscedasticity is not rejected. WHITE TEST FOR HETEROSCEDASTICITY. gen GDPSQ = GDP*GDP. reg EMANUSQ GDP GDPSQ Source | SS df MS Number of obs = F( 2, 25) = 3.35 Model | e e+18 Prob > F = Residual | e e+18 R-squared = Adj R-squared = Total | e e+18 Root MSE = 1.4e EMANUSQ | Coef. Std. Err. t P>|t| [95% Conf. Interval] GDP | GDPSQ | _cons | -4.21e e e e

8 Why has the White test failed to detect heteroscedasticity when the Goldfeld–Quandt test concluded that it was present at a high level of significance? One reason is that it is a large-sample test, and the sample is actually quite small. WHITE TEST FOR HETEROSCEDASTICITY. gen GDPSQ = GDP*GDP. reg EMANUSQ GDP GDPSQ Source | SS df MS Number of obs = F( 2, 25) = 3.35 Model | e e+18 Prob > F = Residual | e e+18 R-squared = Adj R-squared = Total | e e+18 Root MSE = 1.4e EMANUSQ | Coef. Std. Err. t P>|t| [95% Conf. Interval] GDP | GDPSQ | _cons | -4.21e e e e

9 A second is that the White test tends to have low power — a price that one has to pay for its generality. These problems can be exacerbated by a loss of degrees of freedom if there are many explanatory variables in the original model. WHITE TEST FOR HETEROSCEDASTICITY. gen GDPSQ = GDP*GDP. reg EMANUSQ GDP GDPSQ Source | SS df MS Number of obs = F( 2, 25) = 3.35 Model | e e+18 Prob > F = Residual | e e+18 R-squared = Adj R-squared = Total | e e+18 Root MSE = 1.4e EMANUSQ | Coef. Std. Err. t P>|t| [95% Conf. Interval] GDP | GDPSQ | _cons | -4.21e e e e

Copyright Christopher Dougherty These slideshows may be downloaded by anyone, anywhere for personal use. Subject to respect for copyright and, where appropriate, attribution, they may be used as a resource for teaching an econometrics course. There is no need to refer to the author. The content of this slideshow comes from Section 7.2 of C. Dougherty, Introduction to Econometrics, fourth edition 2011, Oxford University Press. Additional (free) resources for both students and instructors may be downloaded from the OUP Online Resource Centre Individuals studying econometrics on their own who feel that they might benefit from participation in a formal course should consider the London School of Economics summer school course EC212 Introduction to Econometrics or the University of London International Programmes distance learning course EC2020 Elements of Econometrics