Presentation on theme: "Efficiency and Productivity Measurement: Bootstrapping DEA Scores"— Presentation transcript:
1Efficiency and Productivity Measurement: Bootstrapping DEA Scores D.S. Prasada RaoSchool of EconomicsThe University of Queensland, Australia
2Measures of Reliability for DEA Scores As DEA is a non-parametric and non-stochastic approach, efficiency scores from DEA have been treated as non-stochastic.However, there are attempts to see how DEA scores are affected by changes in data – mainly to see the effect of outliers.Simar and Wilson have been working on the problem of generating standard errors for DEA scores using “bootstrap” technique.An alternative to the bootstrap technique is the technique of “jackknife” which is a simpler technique.
3Jackknife TechniqueRun DEA and get efficiency scores for each of the DMUs in the data set.Drop one DMU at a time and use the remaining data to compute DEA scores for the remaining DMUs.Repeat this until the full sample is covered. At this stage, we will have M-1 efficiency scores for each of the M DMUs in the sample.Compute standard deviation for each of the efficiency scores using M-1 different estimates.This is a fairly mechanical procedure, but provides an indication about the presence of outliers – in such cases dropping a DMU may change the scores significantly.
4THE DEA BOOTSTRAPMonte Carlo simulation experiments are often used to estimatethe sampling distributions of econometric estimators. Suchexperiments typically involve several steps:Specify a data generating process (DGP)Use the DGP to generate data (i.e., simulate).Apply the estimator to the generated data.Repeat from Step 2.The distribution of the estimates obtained in step 3 approximates the sampling distribution of the estimator. The bootstrap is a form of Monte Carlo experiment where the DGP is unknown.
5Alternative DEA Bootstrap Methods Methods for conducting a DEA bootstrap have been suggested byFerrier and Hirschberg (1997)Lothgren and Tambour (1997)Simar and Wilson (1998)We only discuss the Lothgren-Tambour (LT) method becauseSimar and Wilson (1997) identify theoretical problems with the Ferrier-Hirschberg (FH) method.Lothgren (1998) provides evidence that the LT method outperforms the Simar-Wilson (SW) method.the LT method is relatively straightforward.
6The DGPLet us consider input-oriented DEA models where the output vectors q1, …, qI are treated as fixed. We need to specify a DGP that will allow us to generate data on x1, …, xI.Let Then is a technically-efficient input combination capable of producing qi. Suppose the process generating the distances for all firms is Then a DGP for x1, …, xI is completely characterised byq1, …, qI and F.
8Estimating the DGPLet denote the DEA estimate of ρi (computed as the inverse of the optimised value of the DEA objective function). We estimate by projecting xi onto the estimated frontier:i = 1, …, I,We estimate F using the empirical distribution function (EDF) of the
10The Bootstrap Algorithm To obtain B bootstrap samples:Use the observed data to estimate the input-oriented DEA model, and project the observed data points onto the frontier using Set b = 1.Draw independently from and generate the bootstrap sample usingUse the bootstrap sample to estimate the DEA frontier. Set b = b + 1.Repeat from Step 2 until b = B.These B bootstrap samples can be used to construct confidenceintervals.
11Example cont. In the hospital example and To illustrate generation of the first bootstrap sample, suppose 4 drawings from the U(0,1) distribution happen to be 0.46, 0.76, 0.18 and This implies andWe then solve the DEA problem using this data.
12Bias and SE’s for DEA Scores Let be the computed DEA score for firm i in the sample.Suppose be the scores generated from the bootstrapped sampling procedure which is conducted B times. Then we can compute bias and SE as:
13Some remarksIt is a computationally intensive exercise to compute bias and standard errors for DEA scores but the idea is quite simple.The analytical aspects involved in proving that the bootstrapped bias and standard errors are consistent are quite difficult. That is where much of the work is focused.The model we have looked at simply generates technical efficiency scores using a simple random sample without replacement – this ignores any firm-specific characteristics that may drive inefficiencies.It may be possible to make use of a second stage regression and residuals from the regression to bootstrap after taking into account firm specific characteristics.