X 11 X 12 X 13 X 21 X 22 X 23 X 31 X 32 X 33. Research Question Are nursing homes dangerous for seniors? Does admittance to a nursing home increase risk.

Slides:



Advertisements
Similar presentations
From study objectives to analysis plan Helen Maguire.
Advertisements

Surviving Survival Analysis
Three or more categorical variables
M2 Medical Epidemiology
1 Arlene Ash QMC - Third Tuesday September 21, 2010 (as amended, Sept 23) Analyzing Observational Data: Focus on Propensity Scores.
Logistic Regression I Outline Introduction to maximum likelihood estimation (MLE) Introduction to Generalized Linear Models The simplest logistic regression.
Simple Logistic Regression
1 Contingency Tables: Tests for independence and homogeneity (§10.5) How to test hypotheses of independence (association) and homogeneity (similarity)
Propensity Score Matching Lava Timsina Kristina Rabarison CPH Doctoral Seminar Fall 2012.
Nearest neighbor matching USING THE GREEDY MATCH MACRO Note: Much of the code originally was written by Lori Parsons
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
1 Arlene Ash QMC - Third Tuesday September 21, 2010 Analyzing Observational Data: Focus on Propensity Scores.
Chapter 11 Survival Analysis Part 3. 2 Considering Interactions Adapted from "Anderson" leukemia data as presented in Survival Analysis: A Self-Learning.
PH6415 Review Questions. 2 Question 1 A journal article reports a 95%CI for the relative risk (RR) of an event (treatment versus control as (0.55, 0.97).
Magnet Schools and Peers: Effects on Student Achievement Dale Ballou Vanderbilt University November, 2007 Thanks to Steve Rivkin, Julie Berry Cullen, Adam.
BIOST 536 Lecture 9 1 Lecture 9 – Prediction and Association example Low birth weight dataset Consider a prediction model for low birth weight (< 2500.
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
EPI 809/Spring Multiple Logistic Regression.
Notes on Logistic Regression STAT 4330/8330. Introduction Previously, you learned about odds ratios (OR’s). We now transition and begin discussion of.
1 G Lect 11M Binary outcomes in psychology Can Binary Outcomes Be Studied Using OLS Multiple Regression? Transforming the binary outcome Logistic.
Adjusting for extraneous factors Topics for today More on logistic regression analysis for binary data and how it relates to the Wolf and Mantel- Haenszel.
Logistic Regression II Simple 2x2 Table (courtesy Hosmer and Lemeshow) Exposure=1Exposure=0 Disease = 1 Disease = 0.
Lecture 8 Chi-Square STAT 3120 Statistical Methods I.
STAT 3130 Statistical Methods II Missing Data and Imputation.
Logistic Regression III: Advanced topics Conditional Logistic Regression for Matched Data Conditional Logistic Regression for Matched Data.
Chapter 9 Producing Descriptive Statistics PROC MEANS; Summarize descriptive statistics for continuous numeric variables. PROC FREQ; Summarize frequency.
April 11 Logistic Regression –Modeling interactions –Analysis of case-control studies –Data presentation.
Proc freq: Five secrets* *Okay, well, lesser known facts.
18b. PROC SURVEY Procedures in SAS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
Categorical Data Analysis: When life fits in little boxes AnnMaria DeMars, PhD.
Logistic Regression STA2101/442 F 2014 See last slide for copyright information.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
Propensity Scores How to do it – Part 1. X 11 X 12 X 13 X 21 X 22 X 23 X 31 X 32 X 33 No matrices were harmed in this presentation.
Article Review Cara Carty 09-Mar-06. “Confounding by indication in non-experimental evaluation of vaccine effectiveness: the example of prevention of.
O RGANIZING & DISPLAYING DATA Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Applied Epidemiologic Analysis - P8400 Fall 2002
1 היחידה לייעוץ סטטיסטי אוניברסיטת חיפה פרופ’ בנימין רייזר פרופ’ דוד פרג’י גב’ אפרת ישכיל.
Grant Brown.  AIDS patients – compliance with treatment  Binary response – complied or no  Attempt to find factors associated with better compliance.
Categorical Data Analysis: When life fits in little boxes AnnMaria DeMars, PhD.
Using Weighted Data Donald Miller Population Research Institute 812 Oswald Tower, December 2008.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
AnnMaria De Mars, Ph.D. The Julia Group Santa Monica, CA Categorical data analysis: For when your data DO fit in little boxes.
Please turn off cell phones, pagers, etc. The lecture will begin shortly.
Categorical data analysis: An overview of statistical techniques AnnMaria De Mars The Julia Group AnnMaria De Mars The Julia Group.
Lecture 18 Ordinal and Polytomous Logistic Regression BMTRY 701 Biostatistical Methods II.
1 Multivariable Modeling. 2 nAdjustment by statistical model for the relationships of predictors to the outcome. nRepresents the frequency or magnitude.
Using Propensity Score Matching in Observational Services Research Neal Wallace, Ph.D. Portland State University February
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
Multiple Imputation using SAS Don Miller 812 Oswald Tower
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
Doing Analyses on Binary Outcome. From November 14 th Dr Sainani talked about how the math works for binomial data.
Logistic Regression For a binary response variable: 1=Yes, 0=No This slide show is a free open source document. See the last slide for copyright information.
Four way analysis Nursing home residence Gender Age Death.
Beginners statistics Assoc Prof Terry Haines. 5 simple steps 1.Understand the type of measurement you are dealing with 2.Understand the type of question.
Analysis of matched data Analysis of matched data.
March 28 Analyses of binary outcomes 2 x 2 tables
BINARY LOGISTIC REGRESSION
Advanced Quantitative Techniques
April 18 Intro to survival analysis Le 11.1 – 11.2
Notes on Logistic Regression
Advanced Quantitative Techniques
Sec 9C – Logistic Regression and Propensity scores
Typical biostatistics tasks
Multiple logistic regression
SPSS Propensity Score Matching: An overview
Categorical Data Analysis Review for Final
How to realize random in SAS Professor: Lijue Student: Zhengliang
Improving Overlap Farrokh Alemi, Ph.D.
Kaplan-Meier survival curves and the log rank test
Presentation transcript:

X 11 X 12 X 13 X 21 X 22 X 23 X 31 X 32 X 33

Research Question Are nursing homes dangerous for seniors? Does admittance to a nursing home increase risk of death in adults over 65 years of age when controlling for age, gender, race, and number of emergency room visits?

Propensity Score Matching or Do nursing homes kill you? ANNMARIA DE MARS, PH.D. & CHELSEA HEAVEN THE JULIA GROUP

WHY YOU NEED IT TWO NON- EQUIVALENT GROUPS Patients in specialized units People who attend a fundraising event

Any time you can ask the question …. Is there a difference on OUTCOME between levels of “treatment” A, controlling for X, Y and Z ?

Examples OUTCOME“TREATMENT” LEVELS COVARIATES DROP OUTPUBLIC, PRIVATEINCOME PARENT EDUCATION GR. 8 ACHIEVEMENT BMIDAILY SOFT DRINKS NO SOFT DRINKS GENDER AGE RACE EXERCISE FREQ. DEATHLIVES AT HOME NURSING HOME AGE GENDER TOTAL ER VISITS

1. Make sure there are pre-existing differences (Thank you, Captain Obvious)

2a. Decide on covariates Are the differences pre-existing or could they possibly be due to the different “treatment” levels? Are the differences pre-existing or could they possibly be due to the different “treatment” levels? Race and gender are good choices for covariates. If more students at private vs public schools are black or female, the schooling probably didn’t cause that Race and gender are good choices for covariates. If more students at private vs public schools are black or female, the schooling probably didn’t cause that Differences in grade 10 math scores may be a result of the type of school Differences in grade 10 math scores may be a result of the type of school

2b. Decide on covariates Don’t use your outcome variable as one of your covariates

3. Run logistic regression to generate propensity scores PROC LOGISTIC DATA= datasetname ; CLASS categorical variables ; MODEL dependent = list-of-covariates ; OUTPUT OUT = newdataset PREDICTED= propensity-score; PREDICTED= propensity-score;

4. Select matching method 1.Quintiles 2.Nearest neighbors 3.Calipers ALL OF THE ABOVE CAN BE DONE EITHER WITH OR WITHOUT REPLACEMENT

5. Run matching program & test its effectiveness 6. Run your analysis using the matched data set

An actual example Do nursing homes kill you?

Our data Kaiser Permanente Study of the Oldest Old, and : [California] DEPENDENT VARIABLE: Dthflag = 1 if Died during study period 0 if alive at end of study period 0 if alive at end of study period

Our data TREATMENT VARIABLE athome = 1 if lived at home continuously 0 if admitted to nursing home any time during study period 0 if admitted to nursing home any time during study period

Before matching AT HOME >NOYESTOTAL DIED Frequency (Column %) ======= == NO184 (14.6) 2,486 (52.6) 2,670 (44.6) YES1,077 (85.4) 2,239 (47.4) 3,316 (55.4) TOTAL1,2614,7255,986

Covariates * AGEAGE RACERACE GENDERGENDER TOTAL Emergency Room VISITS **TOTAL Emergency Room VISITS ** * Three out of four were DEFINITELY pre-existing differences ** Proxy for health

PROC LOGISTIC PROC LOGISTIC DATA= saslib.old ; CLASS athome race sex ; MODEL athome = race sex age_comp vissum1; OUTPUT OUT =study.allpropen PREDICTED = prob; Create propensity scores NOTE: No DESCENDING option

ODDS Ratios

Yes, pre-existing differences TYPE 3 ANALYSIS OF EFFECTS EffectDF Wald Chi-SquarePr > ChiSq RACE SEX age_comp <.0001 VISSUM <.0001

QUINTILE MATCHING EXAMPLE ONE

Part on creating quintiles blatantly copied (almost) /sas/quintiles.php

Calculate Quintile Cutpoints PROC UNIVARIATE DATA= saslib.allpropen; VAR prob; VAR prob; OUTPUT OUT=quintile OUTPUT OUT=quintile PCTLPTS= PCTLPRE=pct; PCTLPTS= PCTLPRE=pct; Remember the dataset we created with the predicted probabilities saved in it?

PROC UNIVARIATE VAR prob; *** predicted probability as variable OUTPUT OUT=quintile OUTPUT OUT=quintile PCTLPTS= PCTLPRE=pct; PCTLPTS= PCTLPRE=pct; *** output to a dataset named quintile, *** create four variables at these percentiles *** with the prefix pct ;

/* write the quintiles to macro variables */ data _null_ ; set quintile; call symput('q1',pct20) ; call symput('q2',pct40) ; call symput('q3',pct60) ; call symput('q4',pct80) ; Just because I am too lazy to write down the percentiles

Create quintiles data STUDY.AllPropen; set STUDY.AllPropen ; if prob =. then quintile =.; if prob =. then quintile =.; else if prob le &q1 then quintile=1; else if prob le &q1 then quintile=1; else if prob le &q2 then quintile=2; else if prob le &q2 then quintile=2; else if prob le &q3 then quintile=3; else if prob le &q3 then quintile=3; else if prob le &q4 then quintile=4; else if prob le &q4 then quintile=4; else quintile=5; else quintile=5;

Quintiles QuintileFrequencyPercent Cumulative Frequency Cumulative Percent

The matching part Try to control your excitement

Create case & control data sets DATA small large ; SET study.allpropen ; IF athome = 0 THEN OUTPUT small ; ELSE IF athome = 1 THEN OUTPUT large ;

Create data set of sampling percentages PROC FREQ DATA = small ; quintile / OUT = samp_pct ; quintile / OUT = samp_pct ;

Quintiles in smaller data set QuintileFrequencyPercent Cumulative Frequency Cumulative Percent

Create data set of sampling percentages PROC FREQ DATA = small ; quintile / OUT = samp_pct ; quintile / OUT = samp_pct ;

Create sampling data set DATA samp_pct ; SET samp_pct ; _NSIZE_ = 1 ; _NSIZE_ = _NSIZE_ * COUNT ; DROP PERCENT ; Just here to make it easy to modify

PROC SURVEYSELECT SAMPSIZE= input data set can provide stratum sample sizes in the _NSIZE_ variable STRATA groups should appear in the same order in the secondary data set as in the DATA= data set.

SELECT RANDOM SAMPLE PROC SORT DATA = large ; BY quintile ; PROC SURVEYSELECT DATA= large SAMPSIZE = samp_pct OUT = largesamp ; STRATA quintile ;

Concatenate data sets DATA study.psm_sample ; SET largesamp small ;

Did it work? VariableBeforeAfter AT Home NOT Home ProbAT HomeNOT Home Prob Age ER visits ****3.8 ****.0001 Female49%54%.0152%54%.36 Race ** P <.01 **** P <.0001

Before odds ratio 6.5 : 1 Effect Point Estimate 95% Wald Confidence Limits athome 0 vs Effect Point Estimate 95% Wald Confidence Limits quintile athome 0 vs AFTER ODDS RATIO = 3.7: 1