Presentation on theme: "Simpson’s Paradox Jeff Witmer 18 March 2010 G. Udny Yule (Not Edward H. Simpson)"— Presentation transcript:
Simpson’s Paradox Jeff Witmer 18 March 2010 G. Udny Yule (Not Edward H. Simpson)
Treatment A 81/8793% Treatment B234/27087% Blue disease Red disease Treatment C192/26373% Treatment D 55/8069% Black disease Treatment E78% Treatment F 83% 273/ / Treatment A Treatment B Treatment A Treatment B
Applicants % admitted Men % Women % Department Men Women Applicants % admitted Applicants % admitted A % % B % 25 68% C % % D % % E % % F 272 6% 341 7% UC Berkeley 1973 grad school admissions 6 large departments:
AirportLateTotal %LateTotal % Newark LaGuardia Pittsburg Detroit ContinentalUnited Percent of Planes Delayed from City of Origin January 2009 Totals
Showing two of the four airports graphically: Circles correspond to Continental (Newark much larger than LGA) Squares correspond to United Airlines (LGA larger than Newark) X equals average for the two airports Continental at 23.4% and United at 21.9% ContinentalUnited Marginal effect (ignoring/combining airports)
X equals average for the two airports Continental at 23.4% and United at 21.9% ContinentalUnited Marginal effect (ignoring/combining airports)
Hat tip: Roger Nelsen
Justice104/ Jeter12/ / / / / Combined What level of aggregation is “right”?
WSJ article, 2 December Unemployment higher now than in 1980s recession for each education level, but lower overall (graph only shows two education levels…)
White def19/ Black def 11/ / / / / White victim Black victim All victims 1977 data from 20 counties in Florida White def Black def The probability of a convicted murderer being given the death penalty (vs life in prison) depends more on the victim’s race than on the defendant’s race Hat tip: Mike Radelet (and Alan Agresti) and Jimmy Doi Sentence = death penalty?
Regression data: Olympic 1500 winners Hat tip: Phil Everson
State average SAT scores (1995) Average SAT vs Expenditure per student Average SAT vs Average teacher salary
Average SAT vs Pupil/teacher ratio
The 10 states with the lowest per pupil spending included four -- North Dakota, South Dakota, Tennessee, Utah -- among the 10 states with the top SAT scores. Only one of the 10 states with the highest per pupil expenditures -- Wisconsin -- was among the 10 states with the highest SAT scores. New Jersey has the highest per pupil expenditures, an astonishing $10,561, which teachers' unions elsewhere try to use as a negotiating benchmark. New Jersey's rank regarding SAT scores? Thirty-ninth... The fact that the quality of schools... [fails to correlate] with education appropriations will have no effect on the teacher unions' insistence that money is the crucial variable. -- George F. Will, (September 12, 1993), "Meaningless Money Factor," The Washington Post, C7.
Consider the fraction of students in a state who take the SAT States with high SAT scores but low fractions taking the test North Dakota Iowa Minnesota Wisconsin Utah South Dakota Average SAT vs Fraction of students taking the test
Added variable plot for “expend” Coefficients: Estimate Std. Error t value Pr(>|t|) (Intercept) < 2e-16 *** expend * frac e-12 *** I(frac^2) e-06 *** Hat tip: Stacey Hancock and Albyn Jones
Other examples US Senate: replace moderate Dems with moderate Reps and the Senate could become more more conservative while each party caucus becomes more liberal The 20-yr death rate was higher for non-smokers than for smokers in a UK city – but lower in (almost) each age group. Hat tip: Jo Hardin
G. U. Yule (1903). "Notes on the Theory of Association of Attributes in Statistics". Biometrika 2: 121–134. Simpson, Edward H. (1951). "The Interpretation of Interaction in Contingency Tables". Journal of the Royal Statistical Society, Ser. B 13: 238–241. Stigler, S. M. (1980). Stigler's law of eponymy. Transactions of the New York Academy of Sciences, 39: (Merton Frestschrift Volume, F. Gieryn (ed)).