Why Dummy Tables are Smart!

Slides:



Advertisements
Similar presentations
From study objectives to analysis plan Helen Maguire.
Advertisements

Study Objectives and Questions for Observational Comparative Effectiveness Research Prepared for: Agency for Healthcare Research and Quality (AHRQ)
The road map to problem solving The analysis plan FETP India.
V.: 9/7/2007 AC Submit1 Statistical Review of the Observational Studies of Aprotinin Safety Part I: Methods, Mangano and Karkouti Studies CRDAC and DSaRM.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition Instructor’s Presentation Slides 1.
Chance, bias and confounding
Estimation and Reporting of Heterogeneity of Treatment Effects in Observational Comparative Effectiveness Research Prepared for: Agency for Healthcare.
Elements of a clinical trial research protocol
Effect of Physician Asthma Education on Health Care Utilization of Children at Different Income Levels Randall Brown, Noreen Clark, Niko Kaciroti, Molly.
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
THE QUANTITATIVE RESEARCH APPROACH
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
STRATEGIES FOR RESEARCH Approaching the Paper Assignment.
Reporting Results P9419 Class #6 November 17, 2003.
1 Statewide Assessment Elements of a Successful Statewide Assessment.
Cohort Studies Hanna E. Bloomfield, MD, MPH Professor of Medicine Associate Chief of Staff, Research Minneapolis VA Medical Center.
CHAPTER 14, QUANTITATIVE DATA ANALYSIS. Chapter Outline  Quantification of Data  Univariate Analysis  Subgroup Comparisons  Bivariate Analysis  Introduction.
Quantifying Data.
Studying treatment of suicidal ideation & attempts: Designs, Statistical Analysis, and Methodological Considerations Jill M. Harkavy-Friedman, Ph.D.
Quality Improvement Prepeared By Dr: Manal Moussa.
ASSESSING GRADE 5 PUPIL PERFORMANCE (Reading and Mathematics in 2001) Socialist Republic of Vietnam.
David Card, Carlos Dobkin, Nicole Maestas
Marketing Research Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides.
STrengthening the Reporting of OBservational Studies in Epidemiology
Research Methods. Research Projects  Background Literature  Aims and Hypothesis  Methods: Study Design Data collection approach Sample Size and Power.
Annex I: Methods & Tools prepared by some members of the ICH Q9 EWG for example only; not an official policy/guidance July 2006, slide 1 ICH Q9 QUALITY.
Simple Linear Regression
THE RELATIONSHIP BETWEEN BMI AND SUICIDALITY IN YOUNG ADULT WOMEN Alexis E. Duncan, Pamela A.F. Madden, and Andrew C. Heath Washington University Department.
Frequency and type of adverse events associated with treating women with trauma in community substance abuse treatment programs T. KIlleen 1, C. Brown.
Survey Methodology Data interpretation and presentation EPID 626 Lecture 11.
Epidemiology The Basics Only… Adapted with permission from a class presentation developed by Dr. Charles Lynch – University of Iowa, Iowa City.
Biostatistics Case Studies Peter D. Christenson Biostatistician Session 5: Analysis Issues in Large Observational Studies.
Self-reported cognitive and emotional effects and lifestyle changes shortly after preventive cardiovascular consultations in general practice Dea Kehler.
Slide 1 Long-Term Care (LTC) Collaborative PIP: Medication Review Tuesday, October 29, 2013 Presenter: Christi Melendez, RN, CPHQ Associate Director, PIP.
1 Research Methods in Psychology Overview of the Research Report Chapter 16.
Copyright restrictions may apply Household, Family, and Child Risk Factors After an Investigation for Suspected Child Maltreatment: A Missed Opportunity.
Evaluating a Research Report
Statistics and Quantitative Analysis U4320 Segment 12: Extension of Multiple Regression Analysis Prof. Sharyn O’Halloran.
Tips for Researchers on Completing the Data Analysis Section of the IRB Application Don Allensworth-Davies, MSc Statistical Manager, Data Coordinating.
Meredith Cook – PharmD Candidate Mercer University COPHS August, 2012 Cognitive Trajectories after Postoperative Delirium.
Systematic Review Module 7: Rating the Quality of Individual Studies Meera Viswanathan, PhD RTI-UNC EPC.
Organizing a project, making a table Biostatistics 212 Lecture 7.
Organizing a project, making a table Biostatistics 212 Session 5.
 2008 Johns Hopkins Bloomberg School of Public Health Evaluating Mass Media Anti-Smoking Campaigns Marc Boulay, PhD Center for Communication Programs.
ANALYSIS PLAN: MOCK TABLES Lu Ann Aday, Ph.D. The University of Texas School of Public Health.
Basic epidemiologic analysis with Stata Biostatistics 212 Lecture 5.
The Practice of Social Research Chapter 14 – Quantitative Data Analysis.
Organizing a project, making a table Biostatistics 212 Lecture 7.
EPID Introduction to Analysis and Interpretation of HIV/STD Data LECTURE 1: Examining Your Data and Steps in Data Analysis Manya Magnus, Ph.D. Summer.
Descriptive Research Study Investigation of Positive and Negative Affect of UniJos PhD Students toward their PhD Research Project Dr. K. A. Korb University.
Research Project Title Researcher Name #1 Researcher Name #2 Researcher Name #3 Researcher Name #4 Date of Presentation Name of Conference Presenting at.
HIV INFECTION AND INJECTION DRUG USE: The Importance of Gender 1 Amy B. Wisniewski, Ph.D. 2 Adrian S. Dobs, M.D., MPH Departments of Pediatrics 1 and Medicine.
Psychological Distress and Recurrent Pain: Results from the 2002 NHIS Psychological Distress and Recurrent Pain: Results from the 2002 NHIS Loren Toussaint,
Can Physical Activity Attenuate Aging- related Weight Loss in Older People? The Yale Health and Aging Study, James Dziura, Carlos Mendes de Leon,
Case-Control Studies Abdualziz BinSaeed. Case-Control Studies Type of analytic study Unit of observation and analysis: Individual (not group)
1 Statistical Review of the Observational Studies of Aprotinin Safety Part II: The i3 Drug Safety Study CRDAC and DSaRM Meeting September 12, 2007 P. Chris.
Monitoring Afghanistan, 2015 Food Security and Agriculture Working Group – 9 December 2015.
Unit 11: Evaluating Epidemiologic Literature. Unit 11 Learning Objectives: 1. Recognize uniform guidelines used in preparing manuscripts for publication.
Approaches to quantitative data analysis Lara Traeger, PhD Methods in Supportive Oncology Research.
REGRESSION MODEL FITTING & IDENTIFICATION OF PROGNOSTIC FACTORS BISMA FAROOQI.
Template provided by: “posters4research.com”   Ideals: mental constructs that represent an idea of traits we are attracted to in potential partners (Fletcher.
BREAST CANCER BY STAGE OF DISEASE AT DIAGNOSIS, CENTRAL OKLAHOMA Arthur Owora, MPH; Aaron Wendelboe, PhD; David Thompson, PhD; Janis Campbell, PhD The.
Implementing Digital Campaigns Clive Whysall CAM Examiner Assignment brief December 2013 / March 2014.
Transportation-related Injuries among US Immigrants: Findings from National Health Interview Survey.
Meta-analysis of observational studies Nicole Vogelzangs Department of Psychiatry & EMGO + institute.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Stats 242.3(02) Statistical Theory and Methodology.
Sofija Zagarins1, PhD, Garry Welch1, PhD, Jane Garb2, MS
Part III: Designing Psychological Research
Psych 231: Research Methods in Psychology
Presentation transcript:

Why Dummy Tables are Smart! A Systematic Approach to Data Analysis for Your M.Sc. Thesis Lisa Fredman, Ph.D. Department of Epidemiology, BUSPH CREST Seminar March 17, 2009

Outline: 1. Research fundamentals (the basics) 2. Analytic plan in research a. Hypothesis guides plan b. Identify measures for E, D, and covariables c. Descriptive statistics on E, D, and covariables d. Analyses on E-D association i. Crude analyses ii. Evaluate potential confounders iii. Multivariable analyses 3. Present results in tables and text Aim: describe how dummy tables used in Steps 2a-d, 3

Research fundamentals: - systematic investigation of E-D association - analysis follows sequential steps from descriptive analyses -> univariate E-D association -> confounder assessment -> multivariate modeling - document methods and variables - document analytic steps, results at each step, decisions that influence next steps - clear communication throughout - hypothesis - methods - analytic steps - results

Dummy tables Definition: Dummy tables (aka mock tables) are shells of tables with variable names, SAS names, and statistical measures. Do not include data. Create dummy tables when develop analysis plan. Fill in dummy tables as perform analyses. Use dummy tables to guide analyses record SAS programs used for analyses names of measures used document interim results draft methods and results

Example of generic dummy table: Title: Distribution of key variables (SAS program used to generate results, date) Variable Distribution Exposure: Variable (VARNAME) (mean, std, range) Outcome: Variable (VARNAME) (%) Covariables Covar1 (VARNAME) (%) Covar2 (VARNAME) (%) Covar3 (VARNAME) (%) … . . Brief notes on results, decisions, next steps

Why are dummy tables smart? Stay focused on analyses to test YOUR hypothesis. Provides template for systematic steps in your analysis. Internal documentation. Centralized record of analyses, results, decisions. Communication aid.

Dumb things that smart researchers often do: Analyze associations that look interesting but are tangential to their hypothesis. DON’T BE TEMPTED TO DO THIS! Revise analytic variables and not rename vars or record changes. DON’T LET YOURSELF FALL INTO THIS TRAP! Dummy tables help you avoid doing these dumb things.

Guide to dummy tables for analyses for epidemiologic study: Before starting analyses: Write down hypothesis Make dummy table for each stage of analysis Make note to write summary of table, decisions, next steps.

Guide to dummy tables for analyses for epidemiologic study, con’t: Start with 4-5 dummy tables: Descriptive analyses: variable distributions Crude analyses Bivariate analyses Confounder analysis Multivariable analyses

Guide to dummy tables for analyses for epidemiologic study, con’t: While doing analyses, at each step: Fill in dummy table and/or checklist at each stage Make decisions based on analyses at this stage (operationalizing variables, selecting confounders, excluding variables from multivariate model) that will influence next stage Write each decision and rationale for it Proceed to next stage

EX: Making Corned Beef with Cabbage dinner

Generic dummy table aka “Shopping List” EX: Making Corned Beef with Cabbage dinner Generic dummy table aka “Shopping List” Shopping list for Corned Beef dinner Ingredients Amount Cost Cabbage 1 head Carrots 3 large Corned brisket or beef 4 lbs Toadstools 6 small … Stop & Shop, or Shaws? (Title) (Variables)

Stop & Shop or Shaws? Need subgroup analyses!

Shopping list for Corned Beef dinner EX: Making Corned Beef with Cabbage dinner Fill in shopping list! Shopping list for Corned Beef dinner Ingredients Amount Cost Cabbage 1 head Carrots 3 large Corned brisket or beef -- Hummell 4 lbs $1.49/lb Toadstools 6 small … Stop & Shop, or Shaws? Either (Title) (Variables)

Make notes to improve recipe LF: use fewer onions, more carrots LF: definitely plan on 2 hrs! Use less water

Another example: is positive affect associated with better recovery in physical functioning following hip fracture? Main study hypothesis: Elderly hip fracture patients with high positive affect will show recovery in more ADLs, and in more mobility-related ADLs over 2-years following fracture than patients with low positive affect or depression.

Dummy tables for Positive Affect study: Title: Table 1 (manuscript): baseline characteristics of hip fracture sample, by positive affect category (OCESD) (SAS pgm used for results, date) Total sample High PA (n=xxx) Low PA (n=xxx) Depressed (n=xxx) p-value Sociodemographic variables Age groups: % (AGE) Sex: % female (RACE) Medical conditions Past stroke: % (V508) Past hip fx: % (V515) Functional status at baseline ADL limitations (0-7): mean, std (KATZ0)

More dummy tables: Age-adjusted mean KATZ ADL score at each interview point, by baseline Positive Affect Category Positive Affect category Baseline (KATZ0) 2-month (KATZ02) 6-months (KATZ06) 12-months (KATZ12) 18- months (KATZ18) 24-months (KATZ24)  (OCESD) Mean (se) High pos. affect Low pos. affect Depressive symptoms

Filled-in dummy table and summary: Age-adjusted mean KATZ ADL score at each interview point, by baseline Positive Affect Category (pgm=hipKatz2_age adjusted means, 5/3/06) Positive Affect category Baseline (KATZ0) 2-month (KATZ02) 6-months (KATZ06) 12-months (KATZ12) 18- months (KATZ18) 24-months (KATZ24)  (OCESD) Mean (se) High pos. affect 0.72 (0.12) 3.76 (0.13) 2.49 (0.16) 2.03 (0.17) 1.98 (0.19) 2.02 (0.18) Low pos. affect 0.49 (0.20) 3.82 (0.21) 2.59 (0.27) 2.28 (0.28) 2.14 (0.30) 1.91 (0.28) Depressive symptoms 1.29 (0.10) 4.20 (0.11) 3.05 (0.13) 2.83 (0.14) 2.86 (0.16) 2.63 (0.15) Summary of age-adjusted analyses: Respondents with low positive affect (PA) reported the fewest ADL limitations at baseline, and those with depressive symptoms reported the most. On average, respondents in each affect category reported more ADL limitations at each interview following the fracture. On the KatzADL variable, the high PA group reported the fewest ADL limitations 2-months through 18-months post-fracture. However, there were no statistically significant differences between respondents with high and low PA.

Dummy table for confounder assessment: Confounder assessment for Positive Affect_ADLs analyses Beta coefficient Beta coefficients for models with individual potential confounders Outcome OCESD level Age % change Race %change medsum42 KATZ ADL measure: model with cesd* time interaction term  OCESD-level 1 OCESD-level 2

Filled-in dummy table for confounder assessment: Confounder assessment for Positive Affect_ADLs analyses Beta coefficient Beta coefficients for models with individual potential confounders Outcome OCESD level Age % change Race %change medsum42 KATZ ADL measure: model with cesd* time interaction term  OCESD-level 1 -0.3805 -0.354 107.5 -0.3969 95.9 -0.3612 105.3 OCESD-level 2 -0.2796 -0.4252 65.8 -0.3021 92.6 -0.2544 109.9 from hipKatzmix1_mixed models baseline, 5/3/06 Summary: Age and 1 or more medical conditions (medsum42) met the criteria as potential confounders. I will also include race in the multivariable models since it may turn out to be a confounder in the models of the KatzADL outcome.

Dummy tables for multivariable analyses: Predicted mean KATZ ADL score at each interview point, by baseline Positive Affect Category, PROC MIXED results (pgm=hipKatzmix2_mixed models, prelim multivariable models, 5/4/06) Positive Affect category 2-months (KATZ02) 6-months (KATZ06) 12-months (KATZ12) 18-months (KATZ18) 24-months (KATZ24) (n=352) (n=321) (n=306) (n=245) (n=232)  (OCESD) Mean (se) High positive affect Low positive affect Depressive symptoms Differences and 95% CI’s: High vs. low positive affect High positive affect vs. depr.

Filled-in dummy tables and summary for multivariable analyses: Predicted mean KATZ ADL score at each interview point, by baseline Positive Affect Category, PROC MIXED results (pgm=hipKatzmix2_mixed models, prelim multivariable models, 5/4/06) Positive Affect category 2-months 6-months 12-months 18-months 24-months (n=352) (n=321) (n=306) (n=245) (n=232)   Mean (se) High positive affect 3.87 (0.14) 2.62 (0.14) 2.18 (0.14) 2.19 (0.15) 2.35 (0.16) Low positive affect 3.96 (0.23) 2.75 (0.24) 2.51 (0.23) 2.35 (0.24) 2.27 (0.25) Depressive symptoms 3.97 (0.12) 2.94 (0.12) 2.75 (0.12) 2.88 (0.13) 2.70 (0.13) Differences and 95% CI’s: High vs. low positive affect -0.09 (-0.61,0.43) -0.14 (-.68,0.41) -0.34 (-.88,0.20) -0.15 (-0.72,0.42) 0.07 (-0.50,.65) High positive affect vs. depr. -0.10 (-0.46,0.25) -0.32 (-.68,0.05) -0.57 (-0.94,-.20) -0.68 (-1.08, .29) -0.35 (-0.76,.06) Summary: In the multivariable model, positive affect and followup time were associated with the KatzADL score over time. Mean KatzADL scores were significantly lower (ie, less impaired) in respondents with high positive affect compared to those with depressive symptoms at months 12 and 18; there were no differences between respondents with high and low positive affect.

Additional records to supplement dummy tables: Data memos to co-investigators/self Footers and WORD file names with filename and date created/revised ex: Positive Affect ADLs_datamemo3_050306

Conclusion: Dummy tables are an organizational tool to ensure that data analyses follow hypothesis and are systematically recorded. Provide internal documentation. Link analytic plan, interim results, final tables and manuscript. That’s why dummy tables are smart!