1/68: Topic 1.3 – Linear Panel Data Regression Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA William.

Slides:



Advertisements
Similar presentations
Introduction Describe what panel data is and the reasons for using it in this format Assess the importance of fixed and random effects Examine the Hausman.
Advertisements

Econometric Analysis of Panel Data Panel Data Analysis – Random Effects Assumptions GLS Estimator Panel-Robust Variance-Covariance Matrix ML Estimator.
Econometric Analysis of Panel Data
PANEL DATA 1. Dummy Variable Regression 2. LSDV Estimator
Data organization.
Econometric Analysis of Panel Data
Econometric Analysis of Panel Data Random Regressors –Pooled (Constant Effects) Model Instrumental Variables –Fixed Effects Model –Random Effects Model.
Econometric Analysis of Panel Data Dynamic Panel Data Analysis –First Difference Model –Instrumental Variables Method –Generalized Method of Moments –Arellano-Bond-Bover.
Econometric Analysis of Panel Data Instrumental Variables in Panel Data –Assumptions of Instrumental Variables –Fixed Effects Model –Random Effects Model.
Econometric Analysis of Panel Data Lagged Dependent Variables –Pooled (Constant Effects) Model –Fixed Effects Model –Random Effects Model –First Difference.
Econometric Analysis Using Stata
Econometrics I Professor William Greene Stern School of Business
Part 7: Regression Extensions [ 1/59] Econometric Analysis of Panel Data William Greene Department of Economics Stern School of Business.
1/62: Topic 2.3 – Panel Data Binary Choice Models Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA.
Econometric Analysis of Panel Data Fixed Effects and Random Effects: Extensions – Time-invariant Variables – Two-way Effects – Nested Random Effects.
1. Descriptive Tools, Regression, Panel Data. Model Building in Econometrics Parameterizing the model Nonparametric analysis Semiparametric analysis Parametric.
Part 5: Random Effects [ 1/54] Econometric Analysis of Panel Data William Greene Department of Economics Stern School of Business.
[Part 3: Common Effects ] 1/57 Econometric Analysis of Panel Data William Greene Department of Economics Stern School of Business.
Topics in Microeconometrics William Greene Department of Economics Stern School of Business.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Panel Data Analysis Introduction
[Topic 2-Endogeneity] 1/33 Topics in Microeconometrics William Greene Department of Economics Stern School of Business.
1/69: Topic Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New.
1/62: Topic 2.3 – Panel Data Binary Choice Models Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA.
Panel Data Analysis Using GAUSS
Empirical Methods for Microeconomic Applications University of Lugano, Switzerland May 27-31, 2013 William Greene Department of Economics Stern School.
[Topic 1-Regression] 1/37 1. Descriptive Tools, Regression, Panel Data.
1/53: Topic 3.1 – Models for Ordered Choices Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA William.
Spatial Econometric Analysis Using GAUSS 8 Kuan-Pin Lin Portland State University.
Part 4A: GMM-MDE[ 1/33] Econometric Analysis of Panel Data William Greene Department of Economics Stern School of Business.
Econometric Analysis of Panel Data
1/61: Topic 1.2 – Extensions of the Linear Regression Model Microeconometric Modeling William Greene Stern School of Business New York University New York.
The Instrumental Variables Estimator The instrumental variables (IV) estimator is an alternative to Ordinary Least Squares (OLS) which generates consistent.
1/26: Topic 2.2 – Nonlinear Panel Data Models Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA William.
Chapter 15 Panel Data Models.
Econometric Analysis of Panel Data
Esman M. Nyamongo Central Bank of Kenya
PANEL DATA REGRESSION MODELS
REGRESSION DIAGNOSTIC II: HETEROSCEDASTICITY
PANEL DATA 1. Dummy Variable Regression 2. LSDV Estimator
Microeconometric Modeling
STOCHASTIC REGRESSORS AND THE METHOD OF INSTRUMENTAL VARIABLES
Microeconometric Modeling
Advanced Panel Data Methods
Microeconometric Modeling
Microeconometric Modeling
Microeconometric Modeling
Migration and the Labour Market
Microeconometric Modeling
Econometric Analysis of Panel Data
Econometrics I Professor William Greene Stern School of Business
Chengyuan Yin School of Mathematics
Econometric Analysis of Panel Data
Microeconometric Modeling
Tutorial 1: Misspecification
Microeconometric Modeling
Microeconometric Modeling
Econometrics I Professor William Greene Stern School of Business
Econometric Analysis of Panel Data
Microeconometric Modeling
Microeconometric Modeling
Econometrics I Professor William Greene Stern School of Business
Econometric Analysis of Panel Data
Econometrics I Professor William Greene Stern School of Business
Empirical Methods for Microeconomic Applications University of Lugano, Switzerland May 27-31, 2019 William Greene Department of Economics Stern School.
Advanced Panel Data Methods
Econometrics I Professor William Greene Stern School of Business
Econometrics I Professor William Greene Stern School of Business
Econometrics I Professor William Greene Stern School of Business
Presentation transcript:

1/68: Topic 1.3 – Linear Panel Data Regression Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA William Greene Stern School of Business New York University New York NY USA 1.3 Linear Panel Data Regression Models

2/68: Topic 1.3 – Linear Panel Data Regression Concepts Unbalanced Panel Cluster Estimator Block Bootstrap Difference in Differences Incidental Parameters Problem Endogeneity Instrumental Variable Control Function Estimator Mundlak Form Correlated Random Effects Hausman Test Lagrange Multiplier (LM) Test Variable Addition (Wu) Test Models Linear Regression Fixed Effects LR Model Random Effects LR Model

3/68: Topic 1.3 – Linear Panel Data Regression

4/68: Topic 1.3 – Linear Panel Data Regression

5/68: Topic 1.3 – Linear Panel Data Regression

6/68: Topic 1.3 – Linear Panel Data Regression BHPS Has Evolved

7/68: Topic 1.3 – Linear Panel Data Regression German Socioeconomic Panel

8/68: Topic 1.3 – Linear Panel Data Regression Balanced and Unbalanced Panels  Distinction: Balanced vs. Unbalanced Panels  A notation to help with mechanics z i,t, i = 1,…,N; t = 1,…,T i  The role of the assumption Mathematical and notational convenience:  Balanced, n=NT  Unbalanced: The fixed T i assumption almost never necessary. If unbalancedness is due to nonrandom attrition from an otherwise balanced panel, then this will require special considerations.

9/68: Topic 1.3 – Linear Panel Data Regression An Unbalanced Panel: RWM’s GSOEP Data on Health Care N = 7,293 Households Some households exited then returned

10/68: Topic 1.3 – Linear Panel Data Regression Cornwell and Rupert Data Cornwell and Rupert Returns to Schooling Data, 595 Individuals, 7 Years (Extracted from NLSY.) Variables in the file are EXP = work experience WKS = weeks worked OCC = occupation, 1 if blue collar, IND = 1 if manufacturing industry SOUTH = 1 if resides in south SMSA= 1 if resides in a city (SMSA) MS = 1 if married FEM = 1 if female UNION = 1 if wage set by union contract ED = years of education LWAGE = log of wage = dependent variable in regressions These data were analyzed in Cornwell, C. and Rupert, P., "Efficient Estimation with Panel Data: An Empirical Comparison of Instrumental Variable Estimators," Journal of Applied Econometrics, 3, 1988, pp See Baltagi, page 122 for further analysis. The data were downloaded from the website for Baltagi's text.

11/68: Topic 1.3 – Linear Panel Data Regression

12/68: Topic 1.3 – Linear Panel Data Regression Common Effects Models  Unobserved individual effects in regression: E[y it | x it, c i ] Notation:  Linear specification: Fixed Effects: E[c i | X i ] = g(X i ). Cov[x it,c i ] ≠0 effects are correlated with included variables. Random Effects: E[c i | X i ] = μ; effects are uncorrelated with included variables. If X i contains a constant term, μ=0 WLOG. Common: Cov[x it,c i ] =0, but E[c i | X i ] = μ is needed for the full model

13/68: Topic 1.3 – Linear Panel Data Regression Convenient Notation  Fixed Effects – the ‘dummy variable model’  Random Effects – the ‘error components model’ Individual specific constant terms. Compound (“composed”) disturbance

14/68: Topic 1.3 – Linear Panel Data Regression Estimating β  β is the partial effect of interest  Can it be estimated (consistently) in the presence of (unmeasured) c i ? Does pooled least squares “work?” Strategies for “controlling for c i ” using the sample data.

15/68: Topic 1.3 – Linear Panel Data Regression 1. The Pooled Regression  Presence of omitted effects  Potential bias/inconsistency of OLS – Depends on ‘fixed’ or ‘random’ If FE, X is endogenous: Omitted Variables Bias If RE, OLS is OK but standard errors are incorrect.

16/68: Topic 1.3 – Linear Panel Data Regression OLS with Individual Effects The omitted variable(s) are the group means

17/68: Topic 1.3 – Linear Panel Data Regression Ordinary Least Squares  Standard results for OLS in a generalized regression model Consistent if RE, inconsistent if FE. Unbiased for something in either case. Inefficient in all cases.  True Variance

18/68: Topic 1.3 – Linear Panel Data Regression Estimating the Sampling Variance of b  b may or may not be consistent for . We estimate its variance regardless  s 2 (X X) -1 is not the correct matrix Correlation across observations: Yes Heteroscedasticity: Maybe  Is there a “robust” covariance matrix? Robust estimation (in general) The White estimator for heteroscedasticity A Robust estimator for OLS.

19/68: Topic 1.3 – Linear Panel Data Regression A Cluster Estimator

20/68: Topic 1.3 – Linear Panel Data Regression Alternative OLS Variance Estimators Cluster correction increases SEs |Variable | Coefficient | Standard Error |b/St.Er.|P[|Z|>z] | Constant EXP EXPSQ D OCC SMSA MS FEM UNION ED Robust Constant EXP EXPSQ D OCC SMSA MS FEM UNION ED

21/68: Topic 1.3 – Linear Panel Data Regression Results of Bootstrap Estimation

22/68: Topic 1.3 – Linear Panel Data Regression Bootstrap variance for a panel data estimator  Panel Bootstrap = Block Bootstrap  Data set is N groups of size T i  Bootstrap sample is N groups of size T i drawn with replacement. The bootstrap replication must account for panel data nature of the data set.

23/68: Topic 1.3 – Linear Panel Data Regression

24/68: Topic 1.3 – Linear Panel Data Regression Difference-in-Differences Model With two periods and strict exogeneity of D and T, This is a linear regression model. If there are no regressors,

25/68: Topic 1.3 – Linear Panel Data Regression Difference in Differences

26/68: Topic 1.3 – Linear Panel Data Regression UK Office of Fair Trading, May 2012; Stephen Davies

27/68: Topic 1.3 – Linear Panel Data Regression Outcome is the fees charged. Activity is collusion on fees.

28/68: Topic 1.3 – Linear Panel Data Regression Treatment Schools: Treatment is an intervention by the Office of Fair Trading Control Schools were not involved in the conspiracy Treatment is not voluntary

29/68: Topic 1.3 – Linear Panel Data Regression Apparent Impact of the Intervention

30/68: Topic 1.3 – Linear Panel Data Regression

31/68: Topic 1.3 – Linear Panel Data Regression Treatment (Intervention) Effect =  1 +  2 if SS school

32/68: Topic 1.3 – Linear Panel Data Regression In order to test robustness two versions of the fixed effects model were run. The first is Ordinary Least Squares, and the second is heteroscedasticity and auto-correlation robust (HAC) standard errors in order to check for heteroscedasticity and autocorrelation.

33/68: Topic 1.3 – Linear Panel Data Regression

34/68: Topic 1.3 – Linear Panel Data Regression The cumulative impact of the intervention is the area between the two paths from intervention to time T.

35/68: Topic 1.3 – Linear Panel Data Regression

36/68: Topic 1.3 – Linear Panel Data Regression 2. Estimation with Fixed Effects  The fixed effects model  c i is arbitrarily correlated with x it but E[ε it |X i,c i ]=0  Dummy variable representation

37/68: Topic 1.3 – Linear Panel Data Regression The Fixed Effects Model y i = X i  + d i  i + ε i, for each individual E[c i | X i ] = g(X i ); Effects are correlated with included variables. Cov[x it,c i ] ≠0

38/68: Topic 1.3 – Linear Panel Data Regression Estimating the Fixed Effects Model  The FEM is a plain vanilla regression model but with many independent variables  Least squares is unbiased, consistent, efficient, but inconvenient if N is large.

39/68: Topic 1.3 – Linear Panel Data Regression The Within Transformation Removes the Effects Wooldridge notation for data in deviations from group means

40/68: Topic 1.3 – Linear Panel Data Regression Least Squares Dummy Variable Estimator  b is obtained by ‘within’ groups least squares (group mean deviations)  Normal equations for a are D’Xb+D’Da=D’y a = (D’D) -1 D’(y – Xb) Notes: This is simple algebra – the estimator is just OLS Least squares is an estimator, not a model. (Repeat twice.) Note what a i is when T i = 1. Follow this with y it -a i -x it ’b=0 if T i =1.

41/68: Topic 1.3 – Linear Panel Data Regression Inference About OLS  Assume strict exogeneity: Cov[ε it,(x js,c j )]=0. Every disturbance in every period for each person is uncorrelated with variables and effects for every person and across periods.  Now, it’s just least squares in a classical linear regression model.  Asy.Var[b] =

42/68: Topic 1.3 – Linear Panel Data Regression Application Cornwell and Rupert

43/68: Topic 1.3 – Linear Panel Data Regression LSDV Results Pooled OLS Note huge changes in the coefficients. SMSA and MS change signs. Significance changes completely.

44/68: Topic 1.3 – Linear Panel Data Regression Estimated Fixed Effects

45/68: Topic 1.3 – Linear Panel Data Regression The Effect of the Effects R 2 rises from to.90542

46/68: Topic 1.3 – Linear Panel Data Regression Robust Covariance Matrix for LSDV Cluster Estimator for Within Estimator Effect is less pronounced than for OLS

47/68: Topic 1.3 – Linear Panel Data Regression Endogeneity in the FEM y i = X i  + d i α i + ε i for each individual E[w i | X i ] = g(X i ); Effects are correlated with included variables. Cov[x it,w i ] ≠0 X is endogenous because of the correlation between x it and w i

48/68: Topic 1.3 – Linear Panel Data Regression The within (LSDV) estimator is an instrumental variable (IV) estimator

49/68: Topic 1.3 – Linear Panel Data Regression LSDV is a Control Function Estimator

50/68: Topic 1.3 – Linear Panel Data Regression LSDV is a Control Function Estimator

51/68: Topic 1.3 – Linear Panel Data Regression The problem here is the estimator of the disturbance variance. The matrix is OK. Note, for example, / (top panel) = / (bottom panel).

52/68: Topic 1.3 – Linear Panel Data Regression

53/68: Topic 1.3 – Linear Panel Data Regression 3. The Random Effects Model The random effects model c i is uncorrelated with x it for all t; E[c i |X i ] = 0 E[ε it |X i,c i ]=0

54/68: Topic 1.3 – Linear Panel Data Regression Random vs. Fixed Effects  Random Effects Small number of parameters Efficient estimation Objectionable orthogonality assumption (c i  X i )  Fixed Effects Robust – generally consistent Large number of parameters More reasonable assumption Precludes time invariant regressors   Which is the more reasonable model?

55/68: Topic 1.3 – Linear Panel Data Regression Mundlak’s Estimator Mundlak, Y., “On the Pooling of Time Series and Cross Section Data, Econometrica, 46, 1978, pp

56/68: Topic 1.3 – Linear Panel Data Regression Mundlak Form of FE Model |Variable| Coefficient | Standard Error |b/St.Er.|P[|Z|>z]| Mean of X| x(i,t) OCC | SMSA | MS | EXP | z(i) FEM | ED | Means of x(I,t) and constant Constant| OCCB | SMSAB | MSB | EXPB | Estimates: Var[e] = Var[u] =

57/68: Topic 1.3 – Linear Panel Data Regression A “Hierarchical” Model

58/68: Topic 1.3 – Linear Panel Data Regression Mundlak’s Approach for an FE Model with Time Invariant Variables

59/68: Topic 1.3 – Linear Panel Data Regression Correlated Random Effects

60/68: Topic 1.3 – Linear Panel Data Regression LM Tests | Random Effects Model: v(i,t) = e(i,t) + u(i) | Unbalanced Panel | Estimates: Var[e] = D+02 | #(T=1) = 1525 | Var[u] = D+01 | #(T=2) = 1079 | Corr[v(i,t),v(i,s)] = | #(T=3) = 825 | Lagrange Multiplier Test vs. Model (3) = | #(T=4) = 926 | ( 1 df, prob value = ) | #(T=5) = 1051 | (High values of LM favor FEM/REM over CR model.) | #(T=6) = 1200 | Baltagi-Li form of LM Statistic = | #(T=7) = | Random Effects Model: v(i,t) = e(i,t) + u(i) | | Estimates: Var[e] = D+02 | Balanced Panel | Var[u] = D+01 | T = 7 | Corr[v(i,t),v(i,s)] = | | Lagrange Multiplier Test vs. Model (3) = | | ( 1 df, prob value = ) | | (High values of LM favor FEM/REM over CR model.) | | Baltagi-Li form of LM Statistic = | REGRESS ; Lhs=docvis ; Rhs=one,hhninc,age,female,educ ; panel $

61/68: Topic 1.3 – Linear Panel Data Regression A One Way REM

62/68: Topic 1.3 – Linear Panel Data Regression A Hausman Test for FE vs. RE EstimatorRandom Effects E[c i |X i ] = 0 Fixed Effects E[c i |X i ] ≠ 0 FGLS (Random Effects) Consistent and Efficient Inconsistent LSDV (Fixed Effects) Consistent Inefficient Consistent Possibly Efficient

63/68: Topic 1.3 – Linear Panel Data Regression Hausman Test for Effects

64/68: Topic 1.3 – Linear Panel Data Regression Hausman There is a built in procedure for this. It is not always appropriate to compare estimators this way.

65/68: Topic 1.3 – Linear Panel Data Regression A Variable Addition Test  Asymptotically equivalent to Hausman  Also equivalent to Mundlak formulation  In the random effects model, using FGLS Only applies to time varying variables Add expanded group means to the regression (i.e., observation i,t gets same group means for all t. Use standard F or Wald test to test for coefficients on means equal to 0. Large F or chi-squared weighs against random effects specification.

66/68: Topic 1.3 – Linear Panel Data Regression Variable Addition

67/68: Topic 1.3 – Linear Panel Data Regression Application: Wu Test NAMELIST ; XV = exp,expsq,wks,occ,ind,south,smsa,ms,union$ NAMELIST ; (new) xmeans = expb,expsqb,wksb,occb,indb,southb,smsab,msb,unionb $ CREATE ; xmeans = Group Mean(xv,pds=ti) $ REGRESS ; Lhs = lwage ; Rhs = xmeans,Xv,ed,fem,one ; panel ; random ; Test: xmeans $

68/68: Topic 1.3 – Linear Panel Data Regression Means Added