Presentation is loading. Please wait.

Presentation is loading. Please wait.

In previous lecture, we highlighted 3 shortcomings of the LPM. The most serious one is the unboundedness problem, i.e., the LPM may make the nonsense predictions.

Similar presentations


Presentation on theme: "In previous lecture, we highlighted 3 shortcomings of the LPM. The most serious one is the unboundedness problem, i.e., the LPM may make the nonsense predictions."— Presentation transcript:

1 In previous lecture, we highlighted 3 shortcomings of the LPM. The most serious one is the unboundedness problem, i.e., the LPM may make the nonsense predictions that an event will occur with probability greater than 1 or less than 0. In this lecture, we will deal with this problem using the logit model. 1 Adapted from “Introduction to Econometrics” by Christopher Dougherty

2 The usual way of avoiding this problem is to hypothesize that the probability is a sigmoid (S-shaped) function of Z, F(Z), where Z is a function of the explanatory variables. The "input" is Z and the "output" is F(Z). BINARY CHOICE MODELS: LOGIT ANALYSIS 2

3 Several mathematical functions are sigmoid in character. One is the logistic function. The logistic function is useful because it can take as an input any value from negative infinity to positive infinity, whereas the output is confined to values between 0 and 1. The variable Z represents the exposure to some set of independent variables, in this case, Z =  1 +  2 X. Z is also known as logit. F(Z) represents the probability of an event occurring (probability of success, p), given that set of explanatory variables. 3

4 As Z goes to + , e –Z goes to 0 and p goes to 1 (but cannot exceed 1). As Z goes to - , e –Z goes to  and p goes to 0 (but cannot be below 0). 4

5 For a nonlinear model of this kind, maximum likelihood estimation is much superior to the use of OLS for estimating the parameters. ILLUSTRATION 1 Why do some people graduate from high school while others drop out? Our LPM model is in the form (see previous lecture): GRAD =  1 +  2 ASVABC +  i Our logit model is in the form: Z =  1 +  2 ASVABC i 5

6 6

7 . logit GRAD ASVABC Iteration 0: log likelihood = -118.67769 Iteration 1: log likelihood = -104.45292 Iteration 2: log likelihood = -97.135677 Iteration 3: log likelihood = -96.887294 Iteration 4: log likelihood = -96.886017 Logit estimates Number of obs = 540 LR chi2(1) = 43.58 Prob > chi2 = 0.0000 Log likelihood = -96.886017 Pseudo R2 = 0.1836 ------------------------------------------------------------------------------ GRAD | Coef. Std. Err. z P>|z| [95% Conf. Interval] -------------+---------------------------------------------------------------- ASVABC |.1313626.022428 5.86 0.000.0874045.1753206 _cons | -3.240218.9444844 -3.43 0.001 -5.091373 -1.389063 ------------------------------------------------------------------------------ 7

8 8

9 What is the probability that an individual with a score of 30 will graduate? When ASVABC = 30, An individual with a score of 30 has a 67 percent probability of graduating. 9

10 The coefficients of the Z function do not have any direct intuitive interpretation. However, we can use them to quantify the marginal effect of a change in ASVABC on the probability of graduating. We will do this theoretically for the general case where Z is a function of several explanatory variables. Since p is a function of Z, and Z is a function of the X variables, the marginal effect of X i on p can be written as: 10

11 (1) 11

12 (2) (3) Hence we obtain an expression for the marginal effect of X i on p. The marginal effect is not constant because it depends on the value of Z, which in turn depends on the values of the explanatory variables. A common procedure is to evaluate it for the sample means of the explanatory variables. 12

13 The sample mean of ASVABC in this sample is 51.36.. sum GRAD ASVABC Variable | Obs Mean Std. Dev. Min Max -------------+-------------------------------------------------------- GRAD | 540.9425926.2328351 0 1 ASVABC | 540 51.36271 9.567646 25.45931 66.07963 When evaluated at the mean, Z is equal to 3.507. 13

14 The marginal effect, evaluated at the mean, is therefore 0.004. This implies that a one point increase in ASVABC would increase the probability of graduating from high school by 0.4%. 14

15 As noted earlier, the marginal effect does vary because it depends on the value of Z, which in turn depends on the values of X. How much is the marginal effect when ASVABC equal to 30? A one point increase in ASVABC then increases the probability of graduating by 2.9% 15

16 16

17 . logit GRAD ASVABC SM SF MALE Iteration 0: log likelihood = -118.67769 Iteration 1: log likelihood = -104.73493 Iteration 2: log likelihood = -97.080528 Iteration 3: log likelihood = -96.806623 Iteration 4: log likelihood = -96.804845 Iteration 5: log likelihood = -96.804844 Here we consider a multivariate model with a somewhat better specification. ILLUSTRATION 2 Why do some people graduate from high school while others drop out? 17

18 Logit estimates Number of obs = 540 LR chi2(4) = 43.75 Prob > chi2 = 0.0000 Log likelihood = -96.804844 Pseudo R2 = 0.1843 ------------------------------------------------------------------------------ GRAD | Coef. Std. Err. z P>|z| [95% Conf. Interval] -------------+---------------------------------------------------------------- ASVABC |.1329127.0245718 5.41 0.000.0847528.1810726 SM | -.023178.0868122 -0.27 0.789 -.1933267.1469708 SF |.0122663.0718876 0.17 0.865 -.1286307.1531634 MALE |.1279654.3989345 0.32 0.748 -.6539318.9098627 _cons | -3.252373 1.065524 -3.05 0.002 -5.340761 -1.163985 ------------------------------------------------------------------------------. sum GRAD ASVABC SM SF MALE Variable | Obs Mean Std. Dev. Min Max -------------+-------------------------------------------------------- GRAD | 540.9425926.2328351 0 1 ASVABC | 540 51.36271 9.567646 25.45931 66.07963 SM | 540 11.57963 2.816456 0 20 SF | 540 11.83704 3.53715 0 20 MALE | 540.5.5004636 0 1 18

19 ASVABC51.360.1336.826 SM11.58–0.023–0.269 SF11.840.0120.146 MALE0.500.1280.064 Constant1.00–3.252–3.252 Total3.514 We will estimate the marginal effects, putting all the explanatory variables equal to their sample means. Step 1: Calculate Z, when the X variables are equal to their sample means. 19

20 Step 2: Calculate Step 3: Calculate Note that: 20

21 We see that the effect of ASVABC is about the same as before. Mother's schooling (SM) has negligible effect and father's schooling (SF) has no discernible effect at all. Males have 0.4 percent higher probability of graduating than females. ASVABC51.360.1336.8260.0280.004 SM11.58–0.023–0.2690.028–0.001 SF11.840.0120.1460.0280.000 MALE0.500.1280.0640.0280.004 21

22 Suppose that we want to examine the effect of a teaching method known as PSI on the performance of students in GD30703. The question was whether students exposed to the method scored higher on exams in the class. Data were collected from 32 students in two classes, one in which PSI was used and another in which a traditional teaching method was employed. 1) GPA — Grade point average before taking the class. 2) TUCE — the score on an exam given at the beginning of the term to test entering knowledge of the material. 3) PSI — a dummy variable indicating the teaching method used (1 = used Psi, 0 = other method). 4) GRADE — coded 1 if the final grade was an A, 0 if the final grade was a B or C. GRADE was the dependent variable, and of particular interest was whether PSI had a significant effect on GRADE. TUCE and GPA are included as control variables. ILLUSTRATION 3 22

23 . logit GRADE GPA TUCE PSI Logit estimates Number of obs = 32 LR chi2(4) = 15.40 Prob > chi2 = 0.0015 Log likelihood = -12.889633 Pseudo R2 = 0.3740 ------------------------------------------------------------------------------ GRADE | Coef. Std. Err. z P>|z| [95% Conf. Interval] -------------+---------------------------------------------------------------- GPA | 2.826113 1.262941 2.24 0.025.3507938 5.301432 TUCE |.0951577.1415542 0.67 0.501 -.1822835.3725988 PSI | 2.378688 1.064564 2.23 0.025.29218 4.465195 _cons | -13.02135 4.931325 -2.64 0.008 -22.68657 -3.35613 ------------------------------------------------------------------------------ 23

24 GPA3.02.8268.478 TUCE 20 0.0951.903 PSI 0 2.3790.000 Constant1.00–13.021–13.021 Total-2.6399 What is the probability of getting an A for a student who has a GPA of 3.0, is taught by traditional methods, and has a score of 20 on TUCE? This student has a 6.66% chance of getting an A. 24

25 GPA3.02.8268.478 TUCE 20 0.0951.903 PSI 1 2.3792.379 Constant1.00–13.021–13.021 Total-0.26117 What is the probability of getting an A for the same student if he is placed in the PSI class? Hence, getting into the PSI class would substantially increase the chances of getting an A, i.e., the student would have about a 37% better chance. 25


Download ppt "In previous lecture, we highlighted 3 shortcomings of the LPM. The most serious one is the unboundedness problem, i.e., the LPM may make the nonsense predictions."

Similar presentations


Ads by Google