In previous lecture, we dealt with the unboundedness problem of LPM using the logit model. In this lecture, we will consider another alternative, i.e.

Slides:



Advertisements
Similar presentations
Dummy Dependent variable Models
Advertisements

Brief introduction on Logistic Regression
EC220 - Introduction to econometrics (chapter 10)
Christopher Dougherty EC220 - Introduction to econometrics (chapter 4) Slideshow: interactive explanatory variables Original citation: Dougherty, C. (2012)
ELASTICITIES AND DOUBLE-LOGARITHMIC MODELS
1 BINARY CHOICE MODELS: LOGIT ANALYSIS The linear probability model may make the nonsense predictions that an event will occur with probability greater.
Lecture 9 Today: Ch. 3: Multiple Regression Analysis Example with two independent variables Frisch-Waugh-Lovell theorem.
1 BINARY CHOICE MODELS: PROBIT ANALYSIS In the case of probit analysis, the sigmoid function F(Z) giving the probability is the cumulative standardized.
Multilevel Models 4 Sociology 8811, Class 26 Copyright © 2007 by Evan Schofer Do not copy or distribute without permission.
Logistic Regression Multivariate Analysis. What is a log and an exponent? Log is the power to which a base of 10 must be raised to produce a given number.
1Prof. Dr. Rainer Stachuletz Limited Dependent Variables P(y = 1|x) = G(  0 + x  ) y* =  0 + x  + u, y = max(0,y*)
Binary Response Lecture 22 Lecture 22.
In previous lecture, we highlighted 3 shortcomings of the LPM. The most serious one is the unboundedness problem, i.e., the LPM may make the nonsense predictions.
GRA 6020 Multivariate Statistics; The Linear Probability model and The Logit Model (Probit) Ulf H. Olsson Professor of Statistics.
Ordered probit models.
FIN357 Li1 Binary Dependent Variables Chapter 12 P(y = 1|x) = G(  0 + x  )
So far, we have considered regression models with dummy variables of independent variables. In this lecture, we will study regression models whose dependent.
TESTING A HYPOTHESIS RELATING TO A REGRESSION COEFFICIENT This sequence describes the testing of a hypotheses relating to regression coefficients. It is.
SLOPE DUMMY VARIABLES 1 The scatter diagram shows the data for the 74 schools in Shanghai and the cost functions derived from a regression of COST on N.
BINARY CHOICE MODELS: LOGIT ANALYSIS
Christopher Dougherty EC220 - Introduction to econometrics (chapter 4) Slideshow: semilogarithmic models Original citation: Dougherty, C. (2012) EC220.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 4) Slideshow: nonlinear regression Original citation: Dougherty, C. (2012) EC220 -
TOBIT ANALYSIS Sometimes the dependent variable in a regression model is subject to a lower limit or an upper limit, or both. Suppose that in the absence.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: the effects of changing the reference category Original citation: Dougherty,
DUMMY CLASSIFICATION WITH MORE THAN TWO CATEGORIES This sequence explains how to extend the dummy variable technique to handle a qualitative explanatory.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 10) Slideshow: Tobit models Original citation: Dougherty, C. (2012) EC220 - Introduction.
1 INTERACTIVE EXPLANATORY VARIABLES The model shown above is linear in parameters and it may be fitted using straightforward OLS, provided that the regression.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 10) Slideshow: binary choice logit models Original citation: Dougherty, C. (2012) EC220.
Review of normal distribution. Exercise Solution.
MODELS OF QUALITATIVE CHOICE by Bambang Juanda.  Models in which the dependent variable involves two ore more qualitative choices.  Valuable for the.
1 PROXY VARIABLES Suppose that a variable Y is hypothesized to depend on a set of explanatory variables X 2,..., X k as shown above, and suppose that for.
1 G Lect 11W Logistic Regression Review Maximum Likelihood Estimates Probit Regression and Example Model Fit G Multiple Regression Week 11.
Methods Workshop (3/10/07) Topic: Event Count Models.
1 BINARY CHOICE MODELS: PROBIT ANALYSIS In the case of probit analysis, the sigmoid function is the cumulative standardized normal distribution.
F TEST OF GOODNESS OF FIT FOR THE WHOLE EQUATION 1 This sequence describes two F tests of goodness of fit in a multiple regression model. The first relates.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
Session 10. Applied Regression -- Prof. Juran2 Outline Binary Logistic Regression Why? –Theoretical and practical difficulties in using regular (continuous)
9-1 MGMG 522 : Session #9 Binary Regression (Ch. 13)
Lecture 3 Linear random intercept models. Example: Weight of Guinea Pigs Body weights of 48 pigs in 9 successive weeks of follow-up (Table 3.1 DLZ) The.
© Department of Statistics 2012 STATS 330 Lecture 20: Slide 1 Stats 330: Lecture 20.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 5) Slideshow: exercise 5.2 Original citation: Dougherty, C. (2012) EC220 - Introduction.
(1)Combine the correlated variables. 1 In this sequence, we look at four possible indirect methods for alleviating a problem of multicollinearity. POSSIBLE.
The dangers of an immediate use of model based methods The chronic bronchitis study: bronc: 0= no 1=yes poll: pollution level cig: cigarettes smokes per.
Logistic Regression Analysis Gerrit Rooks
Qualitative and Limited Dependent Variable Models ECON 6002 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes.
1 NONLINEAR REGRESSION Suppose you believe that a variable Y depends on a variable X according to the relationship shown and you wish to obtain estimates.
SEMILOGARITHMIC MODELS 1 This sequence introduces the semilogarithmic model and shows how it may be applied to an earnings function. The dependent variable.
1 BINARY CHOICE MODELS: LINEAR PROBABILITY MODEL Economists are often interested in the factors behind the decision-making of individuals or enterprises,
1 REPARAMETERIZATION OF A MODEL AND t TEST OF A LINEAR RESTRICTION Linear restrictions can also be tested using a t test. This involves the reparameterization.
F TESTS RELATING TO GROUPS OF EXPLANATORY VARIABLES 1 We now come to more general F tests of goodness of fit. This is a test of the joint explanatory power.
Nonparametric Statistics
Birthweight (gms) BPDNProp Total BPD (Bronchopulmonary Dysplasia) by birth weight Proportion.
1 BINARY CHOICE MODELS: LOGIT ANALYSIS The linear probability model may make the nonsense predictions that an event will occur with probability greater.
1 COMPARING LINEAR AND LOGARITHMIC SPECIFICATIONS When alternative specifications of a regression model have the same dependent variable, R 2 can be used.
The Probit Model Alexander Spermann University of Freiburg SS 2008.
VARIABLE MISSPECIFICATION I: OMISSION OF A RELEVANT VARIABLE In this sequence and the next we will investigate the consequences of misspecifying the regression.
Instructor: R. Makoto 1richard makoto UZ Econ313 Lecture notes.
The Probit Model Alexander Spermann University of Freiburg SoSe 2009
Nonparametric Statistics
Lecture 18 Matched Case Control Studies
THE LOGIT AND PROBIT MODELS
Introduction to Logistic Regression
THE LOGIT AND PROBIT MODELS
Nonparametric Statistics
LIMITED DEPENDENT VARIABLE REGRESSION MODELS
MPHIL AdvancedEconometrics
Introduction to Econometrics, 5th edition
Introduction to Econometrics, 5th edition
Presentation transcript:

In previous lecture, we dealt with the unboundedness problem of LPM using the logit model. In this lecture, we will consider another alternative, i.e. the probit model. Adapted from “Introduction to Econometrics” by Christopher Dougherty 1

BINARY CHOICE MODELS: PROBIT ANALYSIS In the case of probit analysis, the sigmoid function is the cumulative standardized normal distribution. The maximum likelihood principle is again used to obtain estimates of the parameters. 2

Estimating the probability of success Suppose that the probit equation yields a Z = Since Z is positive, the area in the larger portion of the curve is , or a prediction of a 58.59% success rate [You can use a standard normal table or Excel function NORMSDIST]. Z= Area =

Quantifying the Marginal Effect Since p is a function of Z, and Z is a function of the X variables, the marginal effect of X i on p can be written as: We will do this theoretically for the general case where Z is a function of several explanatory variables. 4

(1) The marginal effect of Z on p is given by the standardized normal distribution. (2) The marginal effect of X i on Z is given by  i. 5

(3) Hence we obtain an expression for the marginal effect of X i on p. As with logit analysis, the marginal effects vary with Z. A common procedure is to evaluate them for the value of Z given by the sample means of the explanatory variables. 6

Here we use the same multivariate example as in the case of logit model (see Illustration 2 in logit lecture slides), so as to facilitate comparison. ILLUSTRATION Why do some people graduate from high school while others drop out? 7

. probit GRAD ASVABC SM SF MALE Iteration 0: log likelihood = Iteration 1: log likelihood = Iteration 2: log likelihood = Iteration 3: log likelihood = Iteration 4: log likelihood = Probit estimates Number of obs = 540 LR chi2(4) = Prob > chi2 = Log likelihood = Pseudo R2 = GRAD | Coef. Std. Err. z P>|z| [95% Conf. Interval] ASVABC | SM | SF | MALE | _cons |

. sum GRAD ASVABC SM SF MALE Variable | Obs Mean Std. Dev. Min Max GRAD | ASVABC | SM | SF | MALE | As with logit analysis, the coefficients have no direct interpretation. However, we can use them to quantify the marginal effects of the explanatory variables on the probability of graduating from high school. We will estimate the marginal effects, putting all the explanatory variables equal to their sample means. 9

ASVABC SM11.58–0.008–0.094 SF MALE Constant1.00–1.451–1.451 Total1.881 Step 1: Calculate Z, when the X variables are equal to their sample means. 10

Step 2: Calculate Step 3: Calculate Note that: 11

We see that a one-point increase in ASVABC increases the probability of graduating from high school by about 0.004, i.e. 0.4%. Mother's schooling (SM) has negligible effect and father's schooling (SF) has no discernible effect at all. Males have 0.4 percent higher probability of graduating than females. ASVABC SM– SF MALE

ASVABC SM11.58–0.008–0.094 SF MALE Constant1.00–1.451–1.451 Total0.503 What is the probability of graduating when ASVABC equal to (a) 30 (b) 50 ? Set the values of other X variables equal to their sample means. When ASVABC = 30, the probability of graduating is 69.25%. NORMSDIST (0.503) =

ASVABC SM11.58–0.008–0.094 SF MALE Constant1.00–1.451–1.451 Total1.803 When ASVABC = 50, the probability of graduating is 96.43%. NORMSDIST (0.503) =

Logit Probit Linear f(Z)b f(Z)b b ASVABC SM–0.001–0.001–0.002 SF MALE –0.007 The logit and probit results are displayed for comparison. The coefficients in the regressions are very different because different mathematical functions are being fitted. Nevertheless the estimates of the marginal effects are usually similar. Logit versus Probit 15

However, if the outcomes in the sample are divided between a large majority and a small minority, they can differ. This is because the observations are then concentrated in a tail of the distribution. Although the logit and probit functions share the same sigmoid outline, their tails are somewhat different. This is the case here, but even so the estimates are identical to three decimal places. 16

So, logit or probit? The logit model is easier to compute, and used to be more popular than the probit model. Probit model is theoretically more appealing as it is based on normal distribution. However, it uses more computer time. Given computer technology advanced nowadays, the choice between the logit model and probit model is a matter of taste. 17