DON’T WRITE DOWN THE MATERIAL ON THE FOLLOWING SLIDES, JUST LISTEN TO THE DISCUSSION AND TRY TO INTERPRET DIAGRAMS AND STATISTICAL RESULTS.

Slides:



Advertisements
Similar presentations
Brief introduction on Logistic Regression
Advertisements

Hypothesis Testing Steps in Hypothesis Testing:
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Regression Extensions W&W Chapter 14. Introduction So far we have assumed that our independent variables are measured intervally. Today we will discuss.
Assignment 12 – Appendix B- 1 DON’T TAKE NOTES! ALL THAT FOLLOWS IS IN THE COURSE MATERIALS! The purpose of Appendix B is to statistically analyze a behavioral.
Statistical Issues in Research Planning and Evaluation
Major power intervention in international crises, Paul K. Huth.
Linear Regression: Making Sense of Regression Results
1 BINARY CHOICE MODELS: LOGIT ANALYSIS The linear probability model may make the nonsense predictions that an event will occur with probability greater.
1 BINARY CHOICE MODELS: PROBIT ANALYSIS In the case of probit analysis, the sigmoid function F(Z) giving the probability is the cumulative standardized.
Sociology 601 Class 13: October 13, 2009 Measures of association for tables (8.4) –Difference of proportions –Ratios of proportions –the odds ratio Measures.
Correlation Chapter 9.
Multinomial Logit Sociology 8811 Lecture 11 Copyright © 2007 by Evan Schofer Do not copy or distribute without permission.
Models with Discrete Dependent Variables
In previous lecture, we highlighted 3 shortcomings of the LPM. The most serious one is the unboundedness problem, i.e., the LPM may make the nonsense predictions.
Sociology 601 Class 28: December 8, 2009 Homework 10 Review –polynomials –interaction effects Logistic regressions –log odds as outcome –compared to linear.
Ordered probit models.
Chi-square Test of Independence
So far, we have considered regression models with dummy variables of independent variables. In this lecture, we will study regression models whose dependent.
Exam 1 Review GOVT 120.
In previous lecture, we dealt with the unboundedness problem of LPM using the logit model. In this lecture, we will consider another alternative, i.e.
Notes on Logistic Regression STAT 4330/8330. Introduction Previously, you learned about odds ratios (OR’s). We now transition and begin discussion of.
Game Theory and Empirics: What Can Go Wrong. Using Formal Theory to Identify and Characterize Biases James Fearon, “Signaling versus the Balance of Power.
Richard M. Jacobs, OSA, Ph.D.
DESIGNING, CONDUCTING, ANALYZING & INTERPRETING DESCRIPTIVE RESEARCH CHAPTERS 7 & 11 Kristina Feldner.
BINARY CHOICE MODELS: LOGIT ANALYSIS
Ordinal Logistic Regression “Good, better, best; never let it rest till your good is better and your better is best” (Anonymous)
Measures of Central Tendency
Hypothesis Testing:.
Logistic Regression- Dichotomous Dependent Variables March 21 & 23, 2011.
1 BINARY CHOICE MODELS: PROBIT ANALYSIS In the case of probit analysis, the sigmoid function is the cumulative standardized normal distribution.
Understanding Statistics
Using the Margins Command to Estimate and Interpret Adjusted Predictions and Marginal Effects Richard Williams
Social Science Models What elements (at least two) are necessary for a “social science model”?
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Hypothesis Testing PowerPoint Prepared by Alfred.
Correlation and Linear Regression. Evaluating Relations Between Interval Level Variables Up to now you have learned to evaluate differences between the.
DON’T TAKE NOTES! MUCH OF THE FOLLOWING IS IN THE COURSEPACK! Just follow the discussion and try to interpret the statistical results that follow.
From Theory to Practice: Inference about a Population Mean, Two Sample T Tests, Inference about a Population Proportion Chapters etc.
Statistics Review - 1 What is the difference between a variable and a constant? Why are we more interested in variables than constants? What are the four.
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 These tests can be used when all of the data from a study has been measured on.
Voluntary Disclosure Not Covered in Textbook. You’re on a job interview and the interviewer knows what the distribution of GPAs are for MBA students at.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 8 First Part.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
Examples of Presentations  The following are examples of presentations of regression tables and their interpretations.  These interpretations target.
Exam 1 Review GOVT 120. Review: Levels of Analysis Theory: Concept 1 is related to Concept 2 Hypothesis: Variable 1 (IV) is related to Variable 2 (DV)
U.S. Senate Voting on Taxation – Variable List -1 Tax = Percentage of times the senator voted in favor of federal tax changes where over 50% of the benefits.
1.What is Pearson’s coefficient of correlation? 2.What proportion of the variation in SAT scores is explained by variation in class sizes? 3.What is the.
Logistic Regression Analysis Gerrit Rooks
Describing Variables & Hypotheses Testing May 26, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
1 BINARY CHOICE MODELS: LINEAR PROBABILITY MODEL Economists are often interested in the factors behind the decision-making of individuals or enterprises,
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
1 BINARY CHOICE MODELS: LOGIT ANALYSIS The linear probability model may make the nonsense predictions that an event will occur with probability greater.
Techniques for Decision-Making: Data Visualization Sam Affolter.
What statistical tests have we learned so far? Descriptive statistics (chp. 12) –Mean, median, mode –Frequency of each response (frequencies), range, standard.
Political Socialization. Political socialization – The process through which an individual acquires his or her particular political orientations, including.
Statistics 19 Confidence Intervals for Proportions.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Chapter 9 Introduction to the t Statistic
The Probit Model Alexander Spermann University of Freiburg SoSe 2009
Logic of Hypothesis Testing
Notes on Logistic Regression
Discussion: Week 4 Phillip Keung.
QM222 Class 16 & 17 Today’s New topic: Estimating nonlinear relationships QM222 Fall 2017 Section A1.
Hypothesis Testing Review
POSC 202A: Lecture Lecture: Substantive Significance, Relationship between Variables 1.
Introduction to Logistic Regression
Political Affiliation and per capita income
Count Models 2 Sociology 8811 Lecture 13
Introduction to Econometrics, 5th edition
Presentation transcript:

DON’T WRITE DOWN THE MATERIAL ON THE FOLLOWING SLIDES, JUST LISTEN TO THE DISCUSSION AND TRY TO INTERPRET DIAGRAMS AND STATISTICAL RESULTS.

Social Science Models What elements (at least two) are necessary for a “social science model”?

Why Regression? - 1 Measures of Association (e.g., correlation) only tell us the strength of the relationship between X and Y, NOT the MAGNITUDE of the relationship. Regression tells us the MAGNITUDE of the relationship (i.e., how MUCH the dependent variable changes for a specified amount of change in the independent variable).

Why Regression? - 2

California Election Correlation of the Percent of the Countywide Vote for Barbara Boxer and Jerry Brown in 2010 with the Percentage of those 25, and Older, Who Have at Least a Bachelor’s Degree in 2000 and Median Household Income in correlate boxer10 brown10 coll00 medinc08 (obs=58) | boxer10 brown10 coll00 medinc boxer10 | brown10 | coll00 | medinc08 |

Graph of.97 Correlation of Brown10 and Boxer10

Graph of.74 Correlation of Coll00 and Boxer10

Graph of -.58 Correlation of %White in 2005 and Boxer10

Graph of -.23 Correlation of %Senior in 2005 and Boxer10

California Election Given the correlations below, what should you expect in the regression table on the next slide where the dependent variable is “boxer 10” (percent of county vote for Boxer in 2010)? correlate boxer10 brown10 coll00 medinc08 (obs=58) | boxer10 brown10 coll00 medinc boxer10 | brown10 | coll00 | medinc08 |

Probit and Logit - 1 One of the assumptions of regression is that the dependent variable has an unlimited number of categories of responses. This assumption is often violated. For example, in this class the dependent variable has frequently been a percentage. If we are not using decimals, there are 101 possible values (i.e., 0 plus 1 through 100).

Probit and Logit - 2 Having 101 options in the dependent variable is not a serious violation of the assumption of an unlimited number of options in the dependent variable. However, having less than 7 options is a serious violation. The “basic rule” is that if the dependent variable has fewer than 6-7 options, we should not use regression.

Visualizing Why We Need Probit or Logit - 1

Visualizing Why We Need Probit or Logit - 2

Visualizing the Estimation of Probit and Logit Coefficients

Probit and Logit - 3 Fortunately, probit and logit provide the many advantages of regression discussed in the readings in situations where the dependent variable has very few options. This situation occurs frequently in political science (e.g., in an international dispute war either broke out or did not – i.e., two categories – a voter voted either Democratic or Republican).

Probit and Logit - 4 Depending upon the dataset you select for your term paper, you will use either regression or probit/logit. The choice between probit and logit is largely arbitrary (i.e., typically, either is fine). The following two examples are from the vast quantitative literature in international relations.

Probit – International Relations-1 Major powers have frequently been in conflict over a small nation that is aligned with one of the major powers. For example, suppose China threatened to invade the Philippine Islands. In this case, the United States would be the “defender,” China the “attacker” and the Philippine Islands the “protégé” of the United States.

Probit – International Relations-2 The dependent variable in the ensuing analysis, “outcom” can assume only two values: 1 – deterrence succeeded – i.e., the potential attacker decided not to attack the smaller nation in question (called the “protégé”) or 0 – the potential attacker did attack and, hence, deterrence failed.

Probit – International Relations-3 Even if the defender defeated the attacker in an ensuing war, deterrence still “failed” because the attacker wasn’t deterred (i.e., the attacker did attack – regardless of whether the attacker “won” or “lost” the war). The following analysis is based on a dataset of 58 cases where a protégé was threatened by a major power.

Probit – International Relations-4 Dependent Variable: outcom: 1 = success – i.e., the potential attacking major power did not attack – deterrence was successful; 0 = failure – the potential attacker did attack – i.e., deterrence failed.

Probit – International Relations-5 The independent variables focus either on the value of the protégé (i.e., the more valuable the protégé the more likely the attacker will attack – that deterrence will “fail”) and the military capability of the defender (i.e., the greater the military capability of the defender the less likely the attacker will attack).

Probit - International Relations-6 Independent Variables: min: 1 = protégé possesses strategic minerals; 0 = protégé does not possess strategic minerals vppa: population size of the protégé nw: 1 = defender possesses nuclear weapons; 0 = defender does not possess nuclear weapons

Probit – International Relations-7 Stata Results: Probit regression Number of obs = 58 LR chi2(3) = 4.44 Prob > chi2 = Log likelihood = Pseudo R2 = outcom | Coef. Std. Err. z P>|z| [95% Conf. Interval] min | vppa | -3.34e nw | _cons | Probit coefficients don’t directly yield the magnitude of change but they tell the direction and statistical significance of the relationship.

Probit – International Relations-8 How disputes are solved has frequently been a topic of quantitative studies in international relations. In the study ahead, the dependent variable is the number of times a nation was the first to use force during a dispute with another nation. Both nations receive scores on the dependent variable. The scores range from 0 to 2. Hence, probit is used.

Probit – International Relations-9 If a nation is the first to use a minor level of force (commit up to 1,000 troops to a combat zone) and the first to escalate the crisis to a major level (commit more than a 1,000 troops to a combat zone), then that nation receives a score of “2.”

Probit–International Relations-10 Alternatively, if one nation is the first to escalate a crisis to a low level of force while the other nation is the first to escalate to a high level of force, both nations receive a score of “1.” Finally, if neither nation uses military force, each nation receives a score of “0.”

Probit–International Relations-11 Independent variables include the level of democracy of each nation (higher scores mean greater democracy), the balance of military power between the two nations (0 to 1: scores >.50 indicate the nation has greater military capability than it’s opponent), shared alliance ties (coded “1” if the two nations share a defense alliance, “0” if not) and

Probit–International Relations-12 satisfaction with the status quo (based upon policy statements the nation made prior to the dispute – coded “1” if satisfied, “0” if not satisfied).

Probit–International Relations-13 Results: Coefficient St. Error Actor’s Democracy Actor’s Democracy x Opponent’s Democracy Opponent’s Democracy (“x” signals an interaction term)

Probit–International Relations-14 Results: Coefficient St. Error Balance of Forces Shared Alliance Satisfaction with Status Quo What do these results tell you?

Probit – State Adoption of TRAP Abortion Laws DON’T WRITE THE NUMBERS! Ind. Var. Coefficient St. Error Dem. Control State Ideology % Catholic % Fundamental Public Opinion about Abortion

Probit – State Adoption of an Income Tax Over DON’T WRITE THE NUMBERS! Ind. Var. Coefficient St. Error Liberal Control Real Per Capita Income Governor Election Year

Probit – Issues and Voting - 1 In the ensuing analysis the dependent variable (the probability of voting Democratic for President) has only two categories of responses (the voter either voted Democratic for President – coded as “1”, or did not vote Democratic for President - coded “0”). Therefore, we will use probit instead of regression.

Probit – Issues and Voting - 2 The independent variables are the voter’s position on various policy oriented scales. The voter’s responses to each of the policy questions are coded -1, 0, or +1. The more liberal the answer the higher the score.

For example, since liberals support government services and typically want to spend less on defense, a respondent who wanted the government to provide many more services and increase spending a lot was coded “1” (0 = keep the same, -1 decrease a lot) Probit – Issues and Voting - 3

Probit – Issues and Voting - 4 while those wanting to decrease military spending a lot were also coded “1” (0 = keep the same, -1 increase spending a lot). Do the results on the next slide support the view that social issues have replaced economic issues as the prime issue basis of voting?

Probit – Issues and Voting - 5 High Low Income Voters Income Voters Govt. services 1.08 (.10).52 (.17) Defense spending 1.00 (.14).83 (.15) Govt. jobs.52 (.05).31 (.08) Abortion.47 (.10).23 (.10) Aid to Blacks.35 (.11).18 (.08) Women’s Role.25 (.07).33 (.08) Note the relative size of the coefficients