Metadata standards Using DDI to Inform, Organize, and Drive Survey Data Production.

Slides:



Advertisements
Similar presentations
3rd International Digital Curation Conference Washington, DC, Dec 2007 Paper Presentations: Interoperability, Metadata & Standards Data Documentation Initiative:
Advertisements

ICPSR-SRO Shared Data Model Project Mary Vardigan Director, DDI Alliance.
Developments in Data Discovery at ICPSR George Alter Director, ICPSR University of Michigan.
Trials and Tribulations of creating DDI Codebooks at the University of Guelph A.Michelle Edwards and Carol Perry, Data Resource Centre, University of Guelph.
The MetaDater Model and the formation of a GRID for the support of social research John Kallas Greek Social Data Bank National Center for Social Research.
Codebook Centric to Life-Cycle Centric In the beginning….
Managing the Metadata Lifecycle The Future of DDI at GESIS and ICPSR Peter Granda, ICPSR Meinhard Moschner, GESIS Mary Vardigan, ICPSR Joachim Wackerow,
DDI Does it have a life beyond IASSIST? IASSIST/IFDO 2005 Edinburgh Edinburgh February 11, 2004 Ernie Boyko NESSTAR Americas Ottawa May, 2005.
 Name and organization  Have you worked with DDI before? (2 or 3)  If not, are you familiar with XML?  What kind of CAI systems do you use?  Goals.
© 2014 by the Regents of the University of Michigan Metadata from Blaise and DDI 3.0/3.2 Gina Cheung Beth-Ellen Pennell North American DDI Conference April.
IPUMS to IHSN: Leveraging structured metadata for discovering multi-national census and survey data Wendy L. Thomas 4 th Conference of the European Survey.
Data Documentation Initiative (DDI): Goals and Benefits Mary Vardigan Director, DDI Alliance.
World Bank, Africa Region, Africa Household Survey Databank - The World Bank - Africa.
Data Collection, Harmonisation and Storage (An international perspective) Jon Johnson (CLS, Senior Database Manager) Sub-brand to go here CLS is an ESRC.
Distributed Access to Data Resources: Metadata Experiences from the NESSTAR Project Simon Musgrave Data Archive, University of Essex.
4 April 2007METIS Work Session1 Metadata Standards and Their Support of Data Management Needs Daniel W. Gillman Bureau of Labor Statistics Paul Johanis.
Applying DDI to a Longitudinal Study of Aging. Overview of Presentation  Content of MIDUS  Importance of DDI as Data Management  Process of Creating.
Technical Overview of SDMX and DDI : Describing Microdata Arofan Gregory Metadata Technology.
United Nations Regional Seminar on Census Data Archiving for Africa, Addis Ababa, Ethiopia, September, 2011 Documentation and Cataloguing in Data.
Statistics New Zealand’s End-to-End Metadata Life-Cycle ”Creating a New Business Model for a National Statistical Office if the 21 st Century” Gary Dunnet.
Colectica: A Platform for DDI 3 based Metadata Management Design. Collect. Share.
DDI and the Lifecycle of Longitudinal Surveys Larry Hoyle, IPSR, Univ. of Kansas Joachim Wackerow, GESIS - Leibniz Institute for the Social Sciences.
Why metadata is AWESOME! Jon Johnson May
Ontario Data Documentation, Extraction Service and Infrastructure.
COLECTICA FOR EXCEL: USING DDI LIFECYCLE WITH SPREADSHEETS NADDI 2013.
United Nations Oslo City Group on Energy Statistics OG7, Helsinki, Finland October 2012 ESCM Chapter 8: Data Quality and Meta Data 1.
The Data Documentation Initiative (DDI) Fostering Community Engagement and Adoption Breakout 9 RDA Sixth Plenary, Paris Mary Vardigan, ICPSR, University.
Metadata Driven Survey Research Jeremy Iverson. Open Standards.
University of Colorado at Denver and Health Sciences Center Department of Preventive Medicine and Biometrics Contact:
A DDI Primer: An Overview and Examples of DDI in Action Barry Radler Distinguished Researcher (UW-Madison Institute on Aging) Jared Lyle Director (DDI.
REDCap General Overview
Investment Intentions Survey 2016
Future directions for DDI
Documenting the Consumer Expenditure Surveys Processing with DDI
Introduction to Survey Documentation and Analysis (SDA)
An introduction to MEDIN Data Guidelines.
Investment Intentions Survey 2016
Michigan Questionnaire Documentation System (MQDS)
Conference on National Platforms for SDG Reporting
Improving Data Discoverability and Interoperability with DDI Metadata
Project that MIDUS is working on with Colectica using DDI 3
What’s New in Colectica 5.3 Part 1
Questasy: Documenting and Disseminating Longitudinal Data Online with DDI 3 Edwin de Vet 11/14/2018.
Data Management: Documentation & Metadata
Powering Official Statistics at Statistics New Zealand with DDI-L and Colectica A Case Study.
Using DDI to Automate Blaise Instrument Generation
Data Quality By Suparna Kansakar.

What’s New in Colectica 5.3 Part 2
Enhancing ICPSR metadata with DDI-Lifecycle
Tomaž Špeh, Rudi Seljak Statistical Office of the Republic of Slovenia
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
CLOSER Discovery Alison Park, UCL Institute of Education
Social Research Methodology and Supplementary Documentation John Kallas University of the Aegean, Department of Sociology.
Colectica 5 A New Generation of Open Metadata Tools
RODA.
The MRC Research Data Gateway
Question Banks, Reusability, and DDI 3.2 (Use Parameters)
Survey Documentation and Analysis (SDA)
Introducing the GSBPM Steven Vale UNECE
Managing Private and Public Views of DDI Metadata Repositories
European DDI Conference
Presentation to SISAI Luxembourg, 12 June 2012
Implementing DDI in a Survey Organisation
The role of metadata in census data dissemination
Kees Zeelenberg, Winfried Ypma, Peter Struijs; Statistics Netherlands
Introduction to reference metadata and quality reporting
The Role of Metadata in Census Data Dissemination
July, 2019 Joint Statistical Meetings
Palestinian Central Bureau of Statistics
Presentation transcript:

Metadata standards Using DDI to Inform, Organize, and Drive Survey Data Production

Overview What Metadata standards try to achieve DDI – the metadata standard for social science Barriers to adoption Implementing metadata in a complex environment Use Cases: MIDUS portal CLOSER portal Takeaway

Barriers to sharing data and metadata Different agencies and clients have different systems Taking over a survey from another agency often requires re-inputting everything Questionnaire specification quality and format differences Different clients have different requirements Barriers are also internal within organisations Different disciplines have different attitudes to what is most important Different departments speak different languages Communication is always an issue Difficulties are also intra-personal “the person with the information you want is yourself, 6 months ago, and he or she don’t respond to s.” Manual processes reduce transparency within and between organisations Survey Metadata: Barriers and Opportunities” Meeting June 26, 2014,

Tackling diversity in the Survey Process

What are survey questions trying to achieve Accurate Communication & Accurate Response Most important considerations are: Language used Frame of reference Arrangement of questions Length of the questionnaire Form of the response Dichotomous Multiple choice Check lists Open Ended Pictorial From Young, Pauline (1956) “Scientific Social Surveys & Research”, 3 rd Edition. Prentice Hall

What we are trying to capture with DDI How the survey was communicated & how participants responded Most important considerations are: Language used in the questions Frame of reference Arrangement of questions Length of the questionnaire Form of the response Dichotomous Multiple choice Check lists Open Ended Pictorial Who was asked Who responded Is the question asked related to another question Who was responsible for the collection

What DDI provides Capture what was intended What, where it came from and why Capture exactly what was used in the survey implementation How, the logic employed and under what conditions To specify what the data output will be That is mirrors what was captured and its source To keep the connection between the survey implementation through to the data received -> data management at a study -> to archiving Generalised solution So that is can be actioned efficiently and is self-describing So that it can be rendered in different forms for different purposes

And a framework to do this Methodology and Instrument Design Instrument Fielding and Data Collection Data Cleaning, Labeling, And Transformations Documentation, READMEs, Descriptions (non-dataset or variable)

Data production Questionnaire development Colectica (DDI to Blaise / CASES etc) Questionnaire metadata and data export Blaise (well used add on) Unicom Intelligence (Colectica export) CASES (SDAtoDDI, and Colectica) RedCap (API and Colectica) Data production StatTransfer SledgeHammer Colectica

Discovery and dissemination Archives Australian Data Archive (2 million variables) ICPSR (4.5 million variables) CESSDA (14 European social science data archives) Research Portals MIDUS (25,000 variables) CLOSER (45,000 variables, 18,000 questions)

Choosing the right flavor Codebook Cross-sectional data (codebook) Easy to implement Lifecycle Longitudinal data Harmonization Re-use of data and questions Support process description e.g. questions

DDI in practice

Use Case: MIDUS Key strength of MIDUS: Multiple longitudinal samples Multidisciplinary design Products: N<13,000 25,000 variables 20 datasets Wide secondary usage – Open Data philosophy Top data download at ICPSR 68k data downloads; 30k users 700+ publications Survey data collection with CASES (DDI friendly)

Use Case: MIDUS Metadata capture is crucial for: Harmonization Discovery Data download capabilities

Use Case: MIDUS - Harmonization

Use Case: MIDUS - Discovery

Use Case: MIDUS - Download Dataset Codebook

Use Case: CLOSER Key strength of CLOSER Multiple longitudinal samples Multiple cohorts (1930 – present) Biomedical & Social Science Products: N ~ 150,000 ~ 250,000 variables ~ 300 datasets Metadata only platform Full Questionnaire flow and contents Cross-cohort comparison

Millennium Cohort Study Data Extraction QA Clean and Edit SPSS (Edited) DDI XML Colectica Designer Dimensions MDD + DDF Export to SPSS

Use Case: CLOSER - Scope

Use Case: CLOSER - Questions

Use Case: CLOSER - Data

A Common mechanism for communication Capture what was intended What, where it came from and why Capture exactly what was used in the survey implementation How, the logic was employed and under what conditions To specify what the data output will be Mirrors what was captured and its source To keep the connection between the survey implementation through to the data received -> data management -> to the archive Generalised solution So that is can be actioned efficiently and is self-describing So that it can be rendered in different forms for different purposes

Some final thoughts Reduction in manual processes More accurate, cheaper and quicker One DDI document  multiple uses Enables distributed data collection Across different platforms and organizations Enables distributed research Increased quality of documentation of data collection Raises visibility of needs Encourages users to better understand the data and the data collection process New tools to think in more interesting ways can be built

Another perspective Dan Gillman: Information Scientist, Bureau of Labor Statistics