Institute for Social Research University of Michigan The Use of Structured Survey Instrument Metadata throughout the Data Lifecycle Sue Ellen Hansen Institute for Social Research University of Michigan ESRA 2011
Presentation Outline Data Lifecycle Survey Lifecycle MQDS Moving through the data lifecycle Study example
Data Lifecycle (DDI) http://www.ddialliance.org/
Survey Lifecycle (CCSG) http://ccsg.isr.umich.edu/
MQDS Michigan Questionnaire Documentation System Documents Blaise™ survey instruments and datasets Produces relational and eXtended Markup Language (XML) question-level metadata that meet the Data Documentation Initiative (DDI) metadata standard Now available to licensed Blaise ™ users
MQDS Version 3 Relational database DDI compliant standardized tables Flexibility to add extensions that meet client and organizational needs Allows Automated documentation of any Blaise survey instrument Importing and documenting data produced by other software (e.g., SAS) Lower cost development of other tools that facilitate editing and disseminating data
MQDS V3 Relational Database
Using MQDS across the Data Lifecycle
From Concepts to Instrument Specifications Concepts → Constructs → Indicators → Measures Questionnaire design Focus groups, cognitive interviews, etc. Questionnaire Instrument design Instrument specifications → programming & screen layout → MQDS documentation (instrument and codebook)
Instrument Specifications
Blaise™ Programming
Blaise™ Screen
MQDS Instrument Documentation
Using MQDS across the Data Lifecycle Check programming against instrument specifications Compare versions of instruments Test programmed instruments Document preliminary datasets Document cleaned datasets Harmonize data Disseminate data for public use data
Study Example Collaborative Psychiatric Epidemiological Surveys (CPES) National Comorbidity Survey Replication (NCS-R) National Survey of American Life (NSAL) National Latino and Asian American Study (NLAAS) – 5 languages U.S. National Institute of Mental Health (NIMH) http://www.icpsr.umich.edu/icpsrweb/CPES/
CPES Programming and data collection Harmonization project Survey Research Operations (SRO), Survey Research Center, (SRC), Institute for Social Research (ISR) Harmonization project SRO and the Inter-University Consortium for Political and Social Research (ICSPR), ISR Facilitated by MQDS standardized structured XML metadata for all three CPES surveys
CPES Harmonization Project Primary goals: Harmonize and merge CPES datasets Prepare codebooks and data for on-line analysis Develop Website Disseminate for public use
The Data ~ 6,000 variables ~ 20,000 Respondents NCS-R: 9,282 adults NSAL: 6,199 adults 3,570 African American 1623 Afro-Caribbean 891 non-Latino White NLAAS: 4,864 adults 1095 Asian 2,554 Latino 215 non-Latino, non-Asian White ~ 6,000 variables from 9,000 before harmonization and disclosure analysis
Process for Creating CPES XML
Process Repeated for NCS-R NSAL NLAAS NLAAS English NLAAS Spanish NLAAS Tagolog NLAAS Vietnamese NLAAS Chinese
CPES Website Development and Data Dissemination
CPES Interactive Codebook http://www.icpsr.umich.edu/icpsrweb/CPES/
CPES Interactive Codebook
CPES Interactive Codebook
CPES Interactive Codebook
Summary MQDS
Questions? Data Documentation Initiative CPES Cross-Cultural Survey Guidelines MQDS http://www.ddialliance.org/ http://www.icpsr.umich.edu/icpsrweb/CPES/ http://ccsg.isr.umich.edu/ http://www.blaise.com/Tools