Ordered probit models.

Slides:



Advertisements
Similar presentations
Continued Psy 524 Ainsworth
Advertisements

Qualitative predictor variables
Econometrics I Professor William Greene Stern School of Business
Empirical Methods for Microeconomic Applications University of Lugano, Switzerland May 27-31, 2013 William Greene Department of Economics Stern School.
VII. Ordinal & Multinomial
Analysis of Categorical Data Nick Jackson University of Southern California Department of Psychology 10/11/
Gologit2: Generalized Logistic Regression/ Partial Proportional Odds Models for Ordinal Dependent Variables Part 1: The gologit model & gologit2 program.
EC220 - Introduction to econometrics (chapter 10)
Simple Logistic Regression
Introduction to Logistic Regression In Stata Maria T. Kaylen, Ph.D. Indiana Statistical Consulting Center WIM Spring 2014 April 11, 2014, 3:00-4:30pm.
COPYRIGHT OF: ABHINAV ANAND JYOTI ARORA SHRADDHA RAMSWAMY DISCRETE CHOICE MODELING IN HEALTH ECONOMICS.
Gologit2: Generalized Logistic Regression/ Partial Proportional Odds Models for Ordinal Dependent Variables Richard Williams Department of Sociology University.
1 BINARY CHOICE MODELS: LOGIT ANALYSIS The linear probability model may make the nonsense predictions that an event will occur with probability greater.
1 BINARY CHOICE MODELS: PROBIT ANALYSIS In the case of probit analysis, the sigmoid function F(Z) giving the probability is the cumulative standardized.
Multilevel Models 4 Sociology 8811, Class 26 Copyright © 2007 by Evan Schofer Do not copy or distribute without permission.
Introduction to Categorical Data Analysis
Statistical Tests Karen H. Hagglund, M.S.
1 Logit/Probit Models. 2 Making sense of the decision rule Suppose we have a kid with great scores, great grades, etc. For this kid, x i β is large. What.
Models with Discrete Dependent Variables
Generalized Ordered Logit Models Part II: Interpretation
In previous lecture, we highlighted 3 shortcomings of the LPM. The most serious one is the unboundedness problem, i.e., the LPM may make the nonsense predictions.
Sociology 601: Class 5, September 15, 2009
Duration models Bill Evans 1. timet0t0 t2t2 t 0 initial period t 2 followup period a b c d e f h g i Flow sample.
1 Discrete and Categorical Data William N. Evans Department of Economics University of Maryland.
So far, we have considered regression models with dummy variables of independent variables. In this lecture, we will study regression models whose dependent.
1 Discrete and Categorical Data William N. Evans Department of Economics/MPRC University of Maryland.
1 Section 3 Probit and Logit Models. 2 Dichotomous Data Suppose data is discrete but there are only 2 outcomes Examples –Graduate high school or not –Patient.
In previous lecture, we dealt with the unboundedness problem of LPM using the logit model. In this lecture, we will consider another alternative, i.e.
Section Count Data Models. Introduction Many outcomes of interest are integer counts –Doctor visits –Low work days –Cigarettes smoked per day –Missed.
MR2300: MARKETING RESEARCH PAUL TILLEY Unit 10: Basic Data Analysis.
Missing Data.. What do we mean by missing data? Missing observations which were intended to be collected but: –Never collected –Lost accidently –Wrongly.
BINARY CHOICE MODELS: LOGIT ANALYSIS
Discrete Choice Modeling William Greene Stern School of Business New York University Lab Sessions.
Christopher Dougherty EC220 - Introduction to econometrics (chapter 10) Slideshow: binary choice logit models Original citation: Dougherty, C. (2012) EC220.
Lecture 14-1 (Wooldridge Ch 17) Linear probability, Probit, and
Methods Workshop (3/10/07) Topic: Event Count Models.
1 The Receiver Operating Characteristic (ROC) Curve EPP 245 Statistical Analysis of Laboratory Data.
1 BINARY CHOICE MODELS: PROBIT ANALYSIS In the case of probit analysis, the sigmoid function is the cumulative standardized normal distribution.
Repeated Measures, Part 2 May, 2009 Charles E. McCulloch, Division of Biostatistics, Dept of Epidemiology and Biostatistics, UCSF.
Wei Sun Renmin University, Hanqing Advanced Institute of Economics and Finance and School of Finance Anthony Webb Center for Retirement Research at Boston.
Limited Dependent Variable Models ECON 6002 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes.
Discrete Choice Modeling William Greene Stern School of Business New York University.
Discrete Choice Modeling William Greene Stern School of Business New York University.
Lecture 18 Ordinal and Polytomous Logistic Regression BMTRY 701 Biostatistical Methods II.
Discrete Choice Modeling William Greene Stern School of Business New York University.
A first order model with one binary and one quantitative predictor variable.
The dangers of an immediate use of model based methods The chronic bronchitis study: bronc: 0= no 1=yes poll: pollution level cig: cigarettes smokes per.
DON’T WRITE DOWN THE MATERIAL ON THE FOLLOWING SLIDES, JUST LISTEN TO THE DISCUSSION AND TRY TO INTERPRET DIAGRAMS AND STATISTICAL RESULTS.
Qualitative and Limited Dependent Variable Models ECON 6002 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s notes.
Analysis of Experimental Data IV Christoph Engel.
1/53: Topic 3.1 – Models for Ordered Choices Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA William.
Exact Logistic Regression
1 BINARY CHOICE MODELS: LINEAR PROBABILITY MODEL Economists are often interested in the factors behind the decision-making of individuals or enterprises,
6. Ordered Choice Models. Ordered Choices Ordered Discrete Outcomes E.g.: Taste test, credit rating, course grade, preference scale Underlying random.
Birthweight (gms) BPDNProp Total BPD (Bronchopulmonary Dysplasia) by birth weight Proportion.
[Part 5] 1/43 Discrete Choice Modeling Ordered Choice Models Discrete Choice Modeling William Greene Stern School of Business New York University 0Introduction.
1 BINARY CHOICE MODELS: LOGIT ANALYSIS The linear probability model may make the nonsense predictions that an event will occur with probability greater.
The Probit Model Alexander Spermann University of Freiburg SS 2008.
Logit VS Probit Pseudo R 2 LogitProbit Gain Loss Consistent Prospect
The Probit Model Alexander Spermann University of Freiburg SoSe 2009
Logistic Regression APKC – STATS AFAC (2016).
Advanced Quantitative Techniques
Advanced Quantitative Techniques
Lecture 18 Matched Case Control Studies
William Greene Stern School of Business New York University
Introduction to Logistic Regression
Problems with infinite solutions in logistic regression
Introduction to Econometrics, 5th edition
Presentation transcript:

Ordered probit models

Ordered Probit Many discrete outcomes are to questions that have a natural ordering but no quantitative interpretation: Examples: Self reported health status (excellent, very good, good, fair, poor) Do you agree with the following statement Strongly agree, agree, disagree, strongly disagree

Can use the same type of model as in the previous section to analyze these outcomes Another ‘latent variable’ model Key to the model: there is a monotonic ordering of the qualitative responses

Self reported health status Excellent, very good, good, fair, poor Coded as 1, 2, 3, 4, 5 on National Health Interview Survey We will code as 5,4,3,2,1 (easier to think of this way) Asked on every major health survey Important predictor of health outcomes, e.g. mortality Key question: what predicts health status?

Important to note – the numbers 1-5 mean nothing in terms of their value, just an ordering to show you the lowest to highest The example below is easily adapted to include categorical variables with any number of outcomes

Model yi* = latent index of reported health The latent index measures your own scale of health. Once yi* crosses a certain value you report poor, then good, then very good, then excellent health

yi = (1,2,3,4,5) for (fair, poor, VG, G, excel) Interval decision rule yi=1 if yi* ≤ u1 yi=2 if u1 < yi* ≤ u2 yi=3 if u2 < yi* ≤ u3 yi=4 if u3 < yi* ≤ u4 yi=5 if yi* > u4

As with logit and probit models, we will assume yi As with logit and probit models, we will assume yi* is a function of observed and unobserved variables yi* = β0 + x1i β1 + x2i β2 …. xki βk + εi yi* = xi β + εi

The threshold values (u1, u2, u3, u4) are unknown The threshold values (u1, u2, u3, u4) are unknown. We do not know the value of the index necessary to push you from very good to excellent. In theory, the threshold values are different for everyone Computer will not only estimate the β’s, but also the thresholds – average across people

As with probit and logit, the model will be determined by the assumed distribution of ε In practice, most people pick nornal, generating an ‘ordered probit’ (I have no idea why) We will generate the math for the probit version

Probabilities Lets do the outliers, Pr(yi=1) and Pr(yi=5) first = Pr(yi* ≤ u1) = Pr(xi β +εi ≤ u1 ) =Pr(εi ≤ u1 - xi β) = Φ[u1 - xi β] = 1- Φ[xi β – u1]

Pr(yi=5) = Pr(yi* > u4) = Pr(xi β +εi > u4 ) =Pr(εi > u4 - xi β) = 1 - Φ[u4 - xi β] = Φ[xi β – u4]

Sample one for y=3 Pr(yi=3) = Pr(u2 < yi* ≤ u3) = Pr(yi* ≤ u3) – Pr(yi* ≤ u2) = Pr(xi β +εi ≤ u3) – Pr(xi β +εi ≤ u2) = Pr(εi ≤ u3- xi β) - Pr(εi ≤ u2 - xi β) = Φ[u3- xi β] - Φ[u2 - xi β] = 1 - Φ[xi β - u3] – 1 + Φ[xi β - u2] = Φ[xi β - u2] - Φ[xi β - u3]

Summary Pr(yi=1) = 1- Φ[xi β – u1] Pr(yi=2) = Φ[xi β – u1] - Φ[xi β – u2] Pr(yi=3) = Φ[xi β – u2] - Φ[xi β – u3] Pr(yi=4) = Φ[xi β – u3] - Φ[xi β – u4] Pr(yi=5) = Φ[xi β – u4]

Likelihood function There are 5 possible choices for each person Only 1 is observed L = Σi ln[Pr(yi=k)] for k

Programming example Cancer control supplement to 1994 National Health Interview Survey Question: what observed characteristics predict self reported health (1-5 scale) 1=poor, 5=excellent Key covariates: income, education, age, current and former smoking status Programs sr_health_status.do, .dta, .log

desc; male byte %9.0g =1 if male age byte %9.0g age in years educ byte %9.0g years of education smoke byte %9.0g current smoker smoke5 byte %9.0g smoked in past 5 years black float %9.0g =1 if respondent is black othrace float %9.0g =1 if other race (white is ref) sr_health float %9.0g 1-5 self reported health, 5=excel, 1=poor famincl float %9.0g log family income

tab sr_health; 1-5 self | reported | health, | 5=excel, | 1=poor | Freq. Percent Cum. ------------+----------------------------------- 1 | 342 2.65 2.65 2 | 991 7.68 10.33 3 | 3,068 23.78 34.12 4 | 3,855 29.88 64.00 5 | 4,644 36.00 100.00 Total | 12,900 100.00

In STATA oprobit sr_health male age educ famincl black othrace smoke smoke5;

Ordered probit estimates Number of obs = 12900 LR chi2(8) = 2379.61 Prob > chi2 = 0.0000 Log likelihood = -16401.987 Pseudo R2 = 0.0676 ------------------------------------------------------------------------------ sr_health | Coef. Std. Err. z P>|z| [95% Conf. Interval] -------------+---------------------------------------------------------------- male | .1281241 .0195747 6.55 0.000 .0897583 .1664899 age | -.0202308 .0008499 -23.80 0.000 -.0218966 -.018565 educ | .0827086 .0038547 21.46 0.000 .0751535 .0902637 famincl | .2398957 .0112206 21.38 0.000 .2179037 .2618878 black | -.221508 .029528 -7.50 0.000 -.2793818 -.1636341 othrace | -.2425083 .0480047 -5.05 0.000 -.3365958 -.1484208 smoke | -.2086096 .0219779 -9.49 0.000 -.2516855 -.1655337 smoke5 | -.1529619 .0357995 -4.27 0.000 -.2231277 -.0827961 _cut1 | .4858634 .113179 (Ancillary parameters) _cut2 | 1.269036 .11282 _cut3 | 2.247251 .1138171 _cut4 | 3.094606 .1145781

Interpret coefficients Marginal effects/changes in probabilities are now a function of 2 things Point of expansion (x’s) Frame of reference for outcome (y) STATA Picks mean values for x’s You pick the value of y

Continuous x’s Consider y=5 d Pr(yi=5)/dxi = d Φ[xi β – u4]/dxi = βφ[xi β – u4] Consider y=3 d Pr(yi=3)/dxi = βφ[xi β – u3] - βφ[xi β – u4]

Discrete X’s xi β = β0 + x1i β1 + x2i β2 …. xki βk ΔPr(yi=5) = X2i is yes or no (1 or 0) ΔPr(yi=5) = Φ[β0 + x1i β1 + β2 + x3i β3 +.. xki βk] - Φ[β0 + x1i β1 + x3i β3 …. xki βk] Change in the probabilities when x2i=1 and x2i=0

Ask for marginal effects mfx compute, predict(outcome(5));

mfx compute, predict(outcome(5)); Marginal effects after oprobit y = Pr(sr_health==5) (predict, outcome(5)) = .34103717 ------------------------------------------------------------------------------ variable | dy/dx Std. Err. z P>|z| [ 95% C.I. ] X ---------+-------------------------------------------------------------------- male*| .0471251 .00722 6.53 0.000 .03298 .06127 .438062 age | -.0074214 .00031 -23.77 0.000 -.008033 -.00681 39.8412 educ | .0303405 .00142 21.42 0.000 .027565 .033116 13.2402 famincl | .0880025 .00412 21.37 0.000 .07993 .096075 10.2131 black*| -.0781411 .00996 -7.84 0.000 -.097665 -.058617 .124264 othrace*| -.0843227 .01567 -5.38 0.000 -.115043 -.053602 .04124 smoke*| -.0749785 .00773 -9.71 0.000 -.09012 -.059837 .289147 smoke5*| -.0545062 .01235 -4.41 0.000 -.078719 -.030294 .081395 (*) dy/dx is for discrete change of dummy variable from 0 to 1

Interpret the results Males are 4.7 percentage points more likely to report excellent Each year of age decreases chance of reporting excellent by 0.7 percentage points Current smokers are 7.5 percentage points less likely to report excellent health

Minor notes about estimation Wald tests/-2 log likelihood tests are done the exact same was as in PROBIT and LOGIT

Use PRCHANGE to calculate marginal effect for a specific person prchange, x(age=40 black=0 othrace=0 smoke=0 smoke5=0 educ=16); When a variable is NOT specified (famincl), STATA takes the sample mean.

PRCHANGE will produce results for all outcomes male Avg|Chg| 1 2 3 4 0->1 .0203868 -.0020257 -.00886671 -.02677558 -.01329902 5 0->1 .05096698

age Avg|Chg| 1 2 3 4 Min->Max .13358317 .0184785 .06797072 .17686112 .07064757 -+1/2 .00321942 .00032518 .00141642 .00424452 .00206241 -+sd/2 .03728014 .00382077 .01648743 .04910323 .0237889 MargEfct .00321947 .00032515 .00141639 .00424462 .00206252