Empirical Example Walter Sosa Escudero Universidad de San Andres - UNLP.

Slides:



Advertisements
Similar presentations
Autocorrelation and Heteroskedasticity
Advertisements

Introduction Describe what panel data is and the reasons for using it in this format Assess the importance of fixed and random effects Examine the Hausman.
Dynamic panels and unit roots
PANEL DATA 1. Dummy Variable Regression 2. LSDV Estimator
Panel Data Models Prepared by Vera Tabakova, East Carolina University.
Economics 20 - Prof. Anderson
Managerial Economics in a Global Economy
There are at least three generally recognized sources of endogeneity. (1) Model misspecification or Omitted Variables. (2) Measurement Error.
C 3.7 Use the data in MEAP93.RAW to answer this question
Econ 488 Lecture 5 – Hypothesis Testing Cameron Kaplan.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Specification Error II
Correlation and regression
Qualitative Variables and
Lecture 12 (Ch16) Simultaneous Equations Models (SEMs)
Random effects estimation RANDOM EFFECTS REGRESSIONS When the observed variables of interest are constant for each individual, a fixed effects regression.
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
Multiple Linear Regression Model
Economics Prof. Buckles1 Time Series Data y t =  0 +  1 x t  k x tk + u t 1. Basic Analysis.
Chapter 10 Simple Regression.
CHAPTER 3 ECONOMETRICS x x x x x Chapter 2: Estimating the parameters of a linear regression model. Y i = b 1 + b 2 X i + e i Using OLS Chapter 3: Testing.
Prof. Dr. Rainer Stachuletz
1 Pertemuan 13 Uji Koefisien Korelasi dan Regresi Matakuliah: A0392 – Statistik Ekonomi Tahun: 2006.
Pengujian Parameter Koefisien Korelasi Pertemuan 04 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Sustainability of economic growth and inequality in incomes distribution Assistant, PhD, BURZ R ă zvan-Dorin West University of Timisoara, Romania Lecturer,
Chapter 11 Multiple Regression.
Topic 3: Regression.
Chapter 15: Model Building
Demand Estimation & Forecasting
12 Autocorrelation Serial Correlation exists when errors are correlated across periods -One source of serial correlation is misspecification of the model.
ECON 6012 Cost Benefit Analysis Memorial University of Newfoundland
Regression Method.
Chapter 4-5: Analytical Solutions to OLS
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
Chapter 10 Hetero- skedasticity Copyright © 2011 Pearson Addison-Wesley. All rights reserved. Slides by Niels-Hugo Blunch Washington and Lee University.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Lecturer: Kem Reat, Viseth, PhD (Economics)
Managerial Economics Demand Estimation & Forecasting.
Chapter 5 Demand Estimation Managerial Economics: Economic Tools for Today’s Decision Makers, 4/e By Paul Keat and Philip Young.
Chapter 4 The Classical Model Copyright © 2011 Pearson Addison-Wesley. All rights reserved. Slides by Niels-Hugo Blunch Washington and Lee University.
Copyright © 2005 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics Thomas Maurice eighth edition Chapter 7.
1/69: Topic Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Chapter 15 Panel Data Models Walter R. Paczkowski Rutgers University.
11 Chapter 5 The Research Process – Hypothesis Development – (Stage 4 in Research Process) © 2009 John Wiley & Sons Ltd.
When should you use fixed effects estimation rather than random effects estimation, or vice versa? FIXED EFFECTS OR RANDOM EFFECTS? 1 NLSY 1980–1996 Dependent.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
1 Empirical methods: endogeneity, instrumental variables and panel data Advanced Corporate Finance Semester
Chapter 13 Understanding research results: statistical inference.
Quantitative research methods in business administration Lecture 3 Multivariate analysis OLS, ENDOGENEITY BIAS, 2SLS Panel Data Exemplified by SPSS and.
Endogeneity in Econometrics: Simultaneous Equations Models Ming LU.
Multiple Regression Analysis: Inference
Regression and Correlation of Data Summary
Vera Tabakova, East Carolina University
Chapter 15 Panel Data Models.
Vera Tabakova, East Carolina University
Econometrics ITFD Week 8.
PANEL DATA 1. Dummy Variable Regression 2. LSDV Estimator
Simultaneous equation system
Microeconometric Modeling
More on Specification and Data Issues
Instrumental Variables and Two Stage Least Squares
Pooling Cross Sections across Time: Simple Panel Data Methods
Advanced Panel Data Methods
Instrumental Variables and Two Stage Least Squares
Instrumental Variables and Two Stage Least Squares
Tutorial 1: Misspecification
Seminar in Economics Econ. 470
Microeconometric Modeling
Instrumental Variables Estimation and Two Stage Least Squares
Presentation transcript:

Empirical Example Walter Sosa Escudero Universidad de San Andres - UNLP

Panel Data Models In this exercise, we will replicate the results in “Estimating the Economic Model of Crime with Panel Data”, by C. Cornwell and W. Trumbull (1994).

The Data Cornwell and Trumbull estimate an economic model of crime. Panel dataset of North Carolina counties. They use single and simultaneous equations panel data estimators to address two sources of endogeneity: unobserved heterogeneity and conventional simultaneity. The data are county level, so there is a relatively low level of aggregation.

Model and Alternative Estimators The basic assumption is: An individual´s participation in the criminal sector depends on the relative monetary return to illegal activities and the degree to which the criminal justice system is able to affect the probabilities of apprehension and punishment.

Cornwell and Trumbull specify the following crime equation: where: R it is the crime rate. X´ it contains variables which control for the relative return to legal opportunities. (wcon, wtuc, wtrd, wfir, wser, wser, wmfg, wfed, wsta, wloc, polpc, urban, density, west, central, pctymle, pctmin) P´ it contains a set of deterrent variables. (prbarr, prbconv, prbpris, avgsen)  i are fixed effects (may be correlated with (X´ it, P´ it )). e it are typical disturbance terms. ( 1 )

Dependant variable Probability of arrest Probability of conviction Probability of prison Sanction severity Summary of variables

The “between” transformation of (1) is: The data are expressed in county means: The “within” transformation of (1) is: The data are in deviations from means:. (3) Does NOT depend on the county effects. ( 2 ) ( 3 ) The authors adopt a log-linear specification so that their estimated coefficients are interpretable as elasticities.

“Between” Model: (2) leads to cross-section estimators which neglect unobserved county heterogeneity. If unobserved characteristics are correlated with (X´ it, P´ it ), such procedure will produce inconsistent estimates.

“Within” Model: By using (3), both sources of endogeneity may be addressed. If the only problem is correlation between (X´ it, P´ it ) and unobserved heterogeneity, then consistent estimation is possible by performing least squared on (3). Conventional simultaneity can be accounted for by using 2SLS to (3).

“Between” Model Balanced Panel: N = 90 T = 7 Test F: Joint significance, it rejects the null.

With the exception of P P, the elements of have the correct NEGATIVE signs. Only the estimated coefficient of P A and P C are statistically significant at the usual significance levels. The estimated arrest and conviction elasticities are, respectively, and –0.53. For the rest of the variables, only lpolpc, ldensity, west, central and lpctmin are statistically significant at 5%. For example, if the number of police per capita increases 1%, the crime rate increases in 0.36%.

The “between” estimator is consistent only if (X´ it, P´ it ) is orthogonal to both  i and e it. The “within” estimator is a simple solution to the violation of the orthogonality condition that (X´ it, P´ it ) is uncorrelated with unobserved heterogeneity.

Balanced Panel: N = 90 T = 7 Fixed Effects Estimation Test F: Joint significance, it rejects the null. Fixed Effects Test: it rejects the null. So, the fixed effects are significative. Region and urban dummies and percentage minority variable do not vary over time, they are eliminated by the within transformation.

Now, the estimated coefficient of P P has the correct (negative) sign and is statistically significant. The within estimate of the deterrent effect of S is small and statistically insignificant. Conditioning on the county effects causes the (absolute value of the) estimated deterrent elasticities associated with P A and P c decrease by 41% and 43%, respectively.

In the Fixed Effects model, both sources of endogeneity may be addressed. First, if the only problem is correlation between (X´ it, P´ it ) and unobserved heterogeneity, then consistent estimation is possible by performing OLS on (3). This within estimator can be viewed as an instrumental variables estimator with instruments (deviations from means) that are orthogonal to the effects by construction. Conventional simultaneity can be accounted for by using 2SLS to estimate (3).

If the constant terms specific for each county were randomly distributed, between counties, we can estimate a Random Effects Model. In order to estimate a Random Effects Model, it´s necessary to assume that the explanatory variables are uncorrelated to the specific term for each county. A Hausman test can be constructed to evaluate FE / RE estimates.

It rejects the null, so there are systematic differences between FE and RE coefficients. RE estimators: INCONSISTENT FE estimators: CONSISTENT Hausman Test

Random Effects and Serial Correlation Bera-Yoon-Sosa Escudero (2001): –BP Test for random effects implicitly assume no autocorrelation. –The presence of random effects confuse the BP test, inducing to reject H o, even though it is correct. –The same thing happens with the autocorrelation test. –BYS: modified tests. Joint Test Baltagi-Li (1991) –Test for the joint null of no autocorrelation and no random effects (low power, less informative). Sosa Escudero (2001): –Joint Test for random effects and positive serial correlation (one-sided, one-directional).

Results of the Tests In the Random Effects tests: the null is in the Random Effects model. The test rejects this null, so the OLS estimators are NOT BLUE. In the Serial Correlation tests: the null is The test rejects this null In all tests, we reject the null.

Note that the statistics decrease in all the adjusted versions of the tests: LM Test for random effects, assuming no serial correlation: Adjusted LM test for random effects, which works even under serial correlation: LM test for first order serial correlation, assuming no random effects: Adjusted test for first order serial correlation, which works even under random effects: LM Joint test for random effects and serial correlation: This Joint Test rejects the joint null, but is NOT informative about the direction of the misspecification.

Instrumental Variables Conventional simultaneity may exist between the crime rate, the probability of arrest and the number of police per capita. Counties experiencing rising crime rates, holding police resources constant, would see probabilities of arrest fall. But, increases in crime may motivate a county to increase policing resources which would increase the probability of arrest. Now, we also allow for the possibility that P A and the number of police per capita may be correlated with e it.

Applying 2SLS to the “within” model, we address simultaneity as well as unobserved heterogeneity. We need at least two exogenous instruments (uncorrelated with e and the effects). We use as instruments a mix of different offense types and per capita tax revenue.

2SLS with Fixed Effects P A, P C and P P are NOT statistically significant. Only lwfed and lwloc are statistically significant. The Fixed Effects are statistically significant.

2SLS to Between Model P A and P C are statistically significant and have the correct signs. P P is NOT statistically significant. P A is 30% lower in 2SLS to “between” than to 2SLS to “within” model.

The statistical consequences of neglecting unobserved heterogeneity in our sample are serious whether single or simultaneous equations estimators are used!